SignalFeed - 技术信息流

1

The AI Vampire

📝 Simon Willison's Weblog

🏷️ 软件工程 🏷️ 职业发展

↗ 打开原文

📌 AI 摘要: 文章核心指出，过度依赖AI提升个人生产力可能导致价值被公司榨取，并引发严重的职业倦怠与认知疲劳。

💡 核心要点:

个人独自高强度使用AI工作，产生的超额价值可能被雇主全部获取。
作者认为AI代理工作认知负担重，每天四小时是更现实的节奏。
AI自动化了简单任务，但将困难的决策和问题解决留给了人类。

🧠 深度分析:

这揭示了AI工具普及下新的劳资关系风险，个人需警惕无偿的自我剥削。
文章提醒开发者需合理设定AI辅助工作的强度与时长，以维持可持续的创造力。
从团队管理角度看，需建立公平的价值分配机制，避免因AI使用加剧内部竞争与消耗。

📖 站内阅读原文（RSS全文）

The AI Vampire

Steve Yegge's take on agent fatigue, and its relationship to burnout.

Let's pretend you're the only person at your company using AI.

In Scenario A, you decide you're going to impress your employer, and work for 8 hours a day at 10x productivity. You knock it out of the park and make everyone else look terrible by comparison.

In that scenario, your employer captures 100% of the value from you adopting AI. You get nothing, or at any rate, it ain't gonna be 9x your salary. And everyone hates you now.

And you're exhausted. You're tired, Boss. You got nothing for it.

Congrats, you were just drained by a company. I've been drained to the point of burnout several times in my career, even at Google once or twice. But now with AI, it's oh, so much easier.

Steve reports needing more sleep due to the cognitive burden involved in agentic engineering, and notes that four hours of agent work a day is a more realistic pace:

I’ve argued that AI has turned us all into Jeff Bezos, by automating the easy work, and leaving us with all the difficult decisions, summaries, and problem-solving. I find that I am only really comfortable working at that pace for short bursts of a few hours once or occasionally twice a day, even with lots of practice.

Via Tim Bray

Tags: steve-yegge , ai , generative-ai , llms , ai-assisted-programming , ai-ethics , coding-agents

2

Race between primes of the forms 4k + 1 and 4k + 3

📝 John D. Cook

🏷️ 数学 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 文章探讨了形如4k+1和4k+3的素数分布竞赛，指出虽然两者在无穷意义上数量相等，但4k+3型素数因梅森素数搜索优势而长期领先，并存在切比雪夫偏差现象。

💡 核心要点:

形如4k+1的素数可表为两平方和，而4k+3型素数则不能。
已知最大素数多为梅森素数（4k+3型），因卢卡斯-莱默测试使其更易被发现。
素数计数差g(n)在无穷区间内不收敛，存在长期为正的切比雪夫偏差。

🧠 深度分析:

该偏差揭示了素数分布的非随机性，对理解数论深层结构有理论意义。
梅森素数的易测性影响了密码学等领域对超大素数的实际选择。
可视化分析有助于直观理解数学概念的长期行为，是有效的科普与教学工具。

📖 站内阅读原文（RSS全文）

The last few posts have looked at expressing an odd prime p as a sum of two squares. This is possible if and only if p is of the form 4 k + 1. I illustrated an algorithm for finding the squares with p = 2 255 − 19, a prime that is used in cryptography. It is being used in bringing this page to you if the TLS connection between my server and your browser is uses Curve25519 or Ed25519.

World records

I thought about illustrating the algorithm with a larger prime too, such as a world record. But then I realized all the latest record primes have been of the form 4 k + 3 and so cannot be written as a sum of squares. Why is p mod 4 equal to 3 for all the records? Are more primes congruent to 3 than to 1 mod 4? The answer to that question is subtle; more on that shortly.

More record primes are congruent to 3 mod 4 because Mersenne primes are easier to find, and that’s because there’s an algorithm, the Lucas-Lehmer test, that can test whether a Mersenne number is prime more efficiently than testing general numbers. Lucas developed his test in 1878 and Lehmer refined it in 1930.

Since the time Lucas first developed his test, the largest known prime has always been a Mersenne prime, with exceptions in 1951 and in 1989.

Chebyshev bias

So, are more primes congruent to 3 mod 4 than are congruent to 1 mod 4?

Define the function f ( n ) to be the ratio of the number of primes in each residue class.

f ( n ) = (# primes p < n with p = 3 mod 4) / (# primes p < n with p = 1 mod 4)

As n goes to infinity, the function f ( n ) converges to 1. So in that sense the number of primes in each category are equal.

If we look at the difference rather than the ratio we get a more subtle story. Define the lead function to be how much the count of primes equal to 3 mod 4 leads the number of primes equal to 1 mod 4.

g ( n ) = (# primes p < n with p = 3 mod 4) − (# primes p < n with p = 1 mod 4)

For any n , f ( n ) > 1 if and only if g ( n ) > 0. However, as n goes to infinity the function g ( n ) does not converge. It oscillates between positive and negative infinitely often. But g ( n ) is positive for long stretches. This phenomena is known as Chebyshev bias.

Visualizing the lead function

We can calculate the lead function at primes with the following code.

from numpy import zeros from sympy import primepi, primerange

N = 1_000_000 leads = zeros(primepi(N) + 1) for index, prime in enumerate(primerange(2, N), start=1): leads[index] = leads[index - 1] + prime % 4 - 2 Here is a list of the primes at which the lead function is zero, i.e. when it changes sign.

[ 0, 1, 3, 7, 13, 89, 2943, 2945, 2947, 2949, 2951, 2953, 50371, 50375, 50377, 50379, 50381, 50393, 50413, 50423, 50425, 50427, 50429, 50431, 50433, 50435, 50437, 50439, 50445, 50449, 50451, 50503, 50507, 50515, 50517, 50821, 50843, 50853, 50855, 50857, 50859, 50861, 50865, 50893, 50899, 50901, 50903, 50905, 50907, 50909, 50911, 50913, 50915, 50917, 50919, 50921, 50927, 50929, 51119, 51121, 51123, 51127, 51151, 51155, 51157, 51159, 51161, 51163, 51177, 51185, 51187, 51189, 51195, 51227, 51261, 51263, 51285, 51287, 51289, 51291, 51293, 51297, 51299, 51319, 51321, 51389, 51391, 51395, 51397, 51505, 51535, 51537, 51543, 51547, 51551, 51553, 51557, 51559, 51567, 51573, 51575, 51577, 51595, 51599, 51607, 51609, 51611, 51615, 51617, 51619, 51621, 51623, 51627] This is OEIS sequence A038691 .

Because the lead function changes more often in some regions than others, it’s best to plot the function over multiple ranges.

The lead function is more often positive than negative. And yet it is zero infinitely often. So while the count of primes with remainder 3 mod 4 is usually ahead, the counts equal out infinitely often. The post Race between primes of the forms 4k + 1 and 4k + 3 first appeared on John D. Cook .

3

Hideki Sato has died

📝 Old Vintage Computing Research

🏷️ 硬件 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 世嘉前高管、传奇硬件设计师佐藤秀树逝世，他主导了世嘉从SG-1000到Dreamcast的所有主机开发。

💡 核心要点:

佐藤秀树参与了世嘉SG-1000至Dreamcast所有主机的设计或开发。
他于1971年加入世嘉，2001至2003年担任代理总裁，2008年退休。
他于本周末去世，享年77岁，其个人曾撰写过详细的硬件回顾文章。

🧠 深度分析:

他的逝世标志着一个经典游戏硬件设计时代的远去，其设计理念（如Dreamcast的前瞻性）至今被玩家和开发者研究。
作为从工程师到高管的职业路径范例，他的经历对技术人员的职业发展具有参考价值。

📖 站内阅读原文（RSS全文）

Remember when Sega made consoles? Hideki Sato remembered, because he was involved in or designed all of them — from the 1982 SG-1000 under Sega Enterprises Ltd. president Hayao Nakayama, later reworked as the SC-3000 home computer , to of course the extremely popular Mega Drive/Genesis and the technologically overwrought Saturn, to the flawed but ahead-of-its-time 1999 Dreamcast , the very last console the company released to date and one of my favourite machines. Joining Sega in 1971, he later became acting president from 2001 to 2003, and finally retired from Sega in 2008. I can think of no better summation of his career than his own , a detailed retrospective on each machine translated from the Japanese. He passed away this weekend at the age of 77 (X.com link). Rest in peace.

4

Cost of Housing

📝 the singularity is nearer

🏷️ 其他 🏷️ 产品设计

↗ 打开原文

📌 AI 摘要: 文章核心观点是，美国住房可负担性危机的根源并非供给问题，而是现有房主（尤其是婴儿潮一代）需要维持高房价以保护自身资产价值，导致降价不可行。

💡 核心要点:

作者认为住房价格下跌将导致大量有抵押贷款的房主陷入资产贬值的困境。
文章指出，维持房价上涨已成为一种必须被维护的社会承诺或现实。
作者将住房危机归因于现有资产持有者需要将高成本转嫁给新买家。

🧠 深度分析:

这一观点揭示了住房问题的金融和政治属性，挑战了单纯从土地、 zoning 或环保审批找原因的流行叙事。
如果此观点成立，意味着解决住房危机需要复杂的金融或社会政策干预，而非简单的增加供给。
对于技术领域从业者，这提醒我们许多社会问题背后存在根深蒂固的利益结构，技术方案可能无法直接解决。

📖 站内阅读原文（RSS全文）

Many people in America are complaining about the cost of housing. But do they understand the damage it will do if it prices go down?

Everyone who owns a house will suffer. Some of those people don’t even fully own the house, they have a mortgage. So when prices go down, they will be underwater, having put money for years into an asset that now has no value.

So it’s simply out of the question for housing prices to go down. If you want to buy a house to live in, sorry. The boomers were told that houses are appreciating assets, and now we must bend reality to make that true.

Until you solve this problem, you will never solve the housing affordability crisis. It has nothing to do with houses, zoning, or environmental reviews. It has to do with people holding bags they need to dump on you.

5

My Courses Site is Moving to a New Home

📝 Miguel Grinberg's Blog

🏷️ Web开发 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 作者宣布其付费课程网站已迁移至新平台，并告知老用户如何转移账户。

💡 核心要点:

课程托管平台从旧站点迁移至新域名 https://learn.miguelgrinberg.com。
此次迁移主要影响直接从作者处购买课程或电子书的用户。
文章提供了将旧账户转移到新站点的具体操作指引。

🧠 深度分析:

平台迁移是技术内容创作者维护服务的重要操作，直接影响用户体验和访问连续性。
作者主动通知并提供转移指南，是维护客户关系和信任的关键举措，避免用户因迁移而丢失访问权限。

📖 站内阅读原文（RSS摘要）

This is a short blog post to announce that I'm migrating the site in which I host my paid courses to a new platform at https://learn.miguelgrinberg.com . If you have purchased a course or ebook directly from me, this article tells you how to transfer your account to the new site.

6

Social Media Payments and Perverse Incentives

📝 Terence Eden’s Blog

🏷️ 产品设计 🏷️ 技术趋势

↗ 打开原文

📌 AI 摘要: 文章探讨了在社交媒体平台直接集成小额支付功能（如打赏）的设想，并重点分析了此举可能引发的负面激励问题，如内容同质化、欺诈和版权盗窃风险。

💡 核心要点:

作者提出在社交平台直接打赏创作者或记者的设想，但隐藏技术复杂性。
现有平台A/B测试和算法已导致内容同质化与愤怒营销泛滥。
支付功能可能激励内容盗窃和欺诈，增加平台责任与安全风险。

🧠 深度分析:

该功能设计需平衡便捷性与安全监管，否则可能加剧平台生态恶化，损害原创者利益。
尽管存在风险，但类似GitHub Sponsors的成功案例表明，在特定社区或受控环境下，直接支付模式可行。
对于Mastodon等新兴去中心化平台，谨慎实验支付功能或可探索出更健康的创作者激励模式。

📖 站内阅读原文（RSS全文）

At the recent " Protocols for Publishers " event, a group of us were talking about news paywalls, social media promotion, and the embarrassment of having to ask for money.

What if, we said, you could tip a journalist directly on social media? Or reward your favourite creator without leaving the platform? Or just say thanks by buying someone a pint?

Here's a trivial mock-up:

Of course, this hides a ton of complexity. Does it show your local currency symbol? Does the platform take a cut or does it just pass you to the poster's preferred platform? Do users want to be able to tip as well as / instead of reposting and favouriting?

But I think the real problem is the perverse incentives it creates. We already know that relentless A|B testing of monetisation strategies leads to homogeneity and outrage farming. Every YouTuber has the same style of promotional thumbnail. Rage-baiters on Twitter know what drives the algorithm and pump out unending slurry.

Even if we ignore those who want to burn the world, content stealers like @CUTE_PUPP1E5 grab all the content they can and rip-off original creators. At the moment that's merely annoying, but monetisation means a strong incentive to steal content.

When people inevitably get scammed, would that damage the social media platform? Would promoting a payment link lead to liability? Now that money is involved, does that make hacking more attractive?

And yet… Accounts add payment links to their profiles all the time. Lots of accounts regularly ask for donor and sponsors. GitHub sponsors exist and I don't see evidence of people impersonating big projects and snaffling funds.

It is somewhat common for platforms to pay for publishers to be on their site. If you're starting up a new service then you need to give people an incentive to be there. That might be as a payer or receiver.

Personally, I'd love a frictionless way to throw a quid to a helpful blog post, or effortlessly donate to a poster who has made me laugh. Selfishly, I'd like it if people paid me for my Open Source or (micro)blogging.

I don't know whether Mastodon or BlueSky will ever have a payments button - and I have no influence on their decision-making process - but I'd sure like to see them experiement.

You can read more discussion on Mastodon .

Or, feel free to send me a tip!

• Buy me a gift from my Amazon wishlist

• Sponsor me on GitHub

• Send me money via PayPal

• Support me on Ko-Fi

• Become a Patreon

• Join my Open Collective

• Donate using LiberaPay

• Pay with Wise

7

How Generative and Agentic AI Shift Concern from Technical Debt to Cognitive Debt

📝 Simon Willison's Weblog

🏷️ 软件工程 🏷️ AI/机器学习

↗ 打开原文

📌 AI 摘要: 文章核心阐述了生成式AI和智能体AI的兴起，正在使开发者的核心担忧从“技术债”转向“认知债”，即团队对系统理解的缺失比代码混乱更致命。

💡 核心要点:

认知债指因快速开发导致的理解缺失，它存在于开发者脑中，影响其修改和决策能力。
文中学生团队案例表明，认知债（不理解设计决策和系统交互）比技术债（混乱代码）更易导致项目瘫痪。
作者亲身体验了使用AI快速生成功能但未审查，导致自身对项目心智模型模糊，难以进行后续开发。

🧠 深度分析:

这标志着软件工程关注点的重大转变：AI辅助编程下，维护‘系统理论’和团队共识比修复代码更关键。
实践建议：团队需建立强制性的知识同步机制（如文档、评审），即使AI生成‘可读’代码，也需确保理解其意图和实现。
长期影响：可能催生新的工程实践和工具，专注于知识管理和认知负载降低，而不仅是代码质量分析。

📖 站内阅读原文（RSS全文）

How Generative and Agentic AI Shift Concern from Technical Debt to Cognitive Debt

This piece by Margaret-Anne Storey is the best explanation of the term cognitive debt I've seen so far.

Cognitive debt , a term gaining traction recently, instead communicates the notion that the debt compounded from going fast lives in the brains of the developers and affects their lived experiences and abilities to “go fast” or to make changes. Even if AI agents produce code that could be easy to understand, the humans involved may have simply lost the plot and may not understand what the program is supposed to do, how their intentions were implemented, or how to possibly change it.

Margaret-Anne expands on this further with an anecdote about a student team she coached:

But by weeks 7 or 8, one team hit a wall. They could no longer make even simple changes without breaking something unexpected. When I met with them, the team initially blamed technical debt: messy code, poor architecture, hurried implementations. But as we dug deeper, the real problem emerged: no one on the team could explain why certain design decisions had been made or how different parts of the system were supposed to work together. The code might have been messy, but the bigger issue was that the theory of the system, their shared understanding, had fragmented or disappeared entirely. They had accumulated cognitive debt faster than technical debt, and it paralyzed them.

I've experienced this myself on some of my more ambitious vibe-code-adjacent projects. I've been experimenting with prompting entire new features into existence without reviewing their implementations and, while it works surprisingly well, I've found myself getting lost in my own projects.

I no longer have a firm mental model of what they can do and how they work, which means each additional feature becomes harder to reason about, eventually leading me to lose the ability to make confident decisions about where to go next.

Via Martin Fowler

Tags: definitions , ai , generative-ai , llms , ai-assisted-programming , vibe-coding

8

The empire always falls

📝 Westenberg.

🏷️ AI/机器学习 🏷️ 技术趋势

↗ 打开原文

📌 AI 摘要: 文章通过历史类比，驳斥了AI巨头及其技术路线将线性发展并永久主导未来的“AI必然性”论调，指出任何看似永恒的体系终将因内部僵化与外部变化而崩溃。

💡 核心要点:

历史帝国与科技巨头常因坚信自身永恒而忽视内部衰败与外部颠覆，最终崩溃。
当前对基础模型公司线性发展的预测，与历史上失败的直线预测如出一辙。
主导系统（如科学范式、公司架构）的成功会使其内部人员无法感知自身的弱点与威胁。

🧠 深度分析:

对AI从业者与投资者而言，盲目相信技术线性进步是危险的，应警惕市场集中、战略自满与创新者窘境。
文章提醒技术决策者需保持开放与谦逊，避免被现有成功模式束缚，为潜在的范式转变做好准备。
这为评估AI行业长期前景提供了一个批判性视角，即技术演进路径更可能是曲折、充满意外与非线性的。

📖 站内阅读原文（RSS全文）

A citizen of Rome in 117 AD, under Emperor Trajan, would've found it difficult to imagine the empire not existing. The roads, the aqueducts, the legal system, the trade networks stretching from Britain to Mesopotamia: all of it seemed to be a near-fact of nature, like gravity // the Mediterranean itself. Edward Gibbon gave us six volumes explaining how that feeling turned out to be wrong, and even he couldn't fully untangle all the causes. But the overarching theme might be this: the permanence was a mirage, and belief in the permanence a catastrophic delusion. Popular AI commentary treats the current crop of foundation model companies the way those Roman citizens treated the legions: as inevitable, as the only possible structure the world could take. The posting classes assume that because OpenAI and Google and Anthropic and Meta have built impressive things, those impressive things will continue to compound in a linear fashion until every job is automated and every economy is restructured, leaving a permanent underclass of unemployable humans in a world that no longer needs them. This is treated as so obvious that questioning it marks you as either naive or sentimental. But companies destroy themselves and empires rot from within , and the people living inside these systems almost never see the collapse coming, because the system itself is the lens through which they view the world. Permanence is the most dangerous feeling in history Thomas Kuhn argued in The Structure of Scientific Revolutions that the scientists working within a dominant framework don't use it as a tool so much as inhabit it. Normal science is puzzle-solving within a framework that nobody questions, until the anomalies pile up so high that someone proposes a new framework entirely, and the old guard spends twenty years insisting nothing's changed. The Ptolemaic model of the solar system survived for over a thousand years, largely because everyone concerned was brilliant enough to keep adding epicycles to make the data fit, making every new complication feel like...well, progress. In the "AI inevitability thesis" every limitation gets explained away as a temporary obstacle on the path to AGI. Reasoning will improve, costs will fall etc and to be fair, they might. But the confidence with which these predictions are delivered should remind you of the confidence with which the British Empire's administrators, circa 1900, reviewed the permanent nature of their civilizational project. They had the world's largest navy and the world's most extensive telegraph network, plus control of roughly a quarter of the earth's land surface. Within fifty years, nearly all of it was gone. And that dissolution happened because the underlying conditions that made the empire possible changed in ways that no amount of naval power could address. Sure things fill graveyards In 2007, Research In Motion controlled roughly half the US smartphone market and had a market capitalization north of $60 billion. RIM's co-CEO Mike Lazaridis reportedly studied the iPhone at launch and concluded it was impossible for the device to work as advertised on a cellular network. He was, in a narrow technical sense, almost right. The first iPhone had appalling battery life and a network that could barely support it. But he was catastrophically wrong about everthing else. Nokia held about 40% of the global mobile phone market at its peak. Internal documents later revealed that middle management had become so terrified of senior leadership's reactions to bad news that critical information about competitive threats stopped flowing upward. The company suffocated on its own hierarchy. Xerox PARC invented the graphical user interface, the mouse, the laser printer and Ethernet, and then Xerox managed to commercialize approximately one of those things while Steve Jobs walked out of a demo and built Apple's future on what he'd seen. The Soviet Union fell because the gap between its internal model of reality and actual reality became unsustainable. The Ottoman Empire spent its last century implementing increasingly frantic reforms, each one an attempt to bolt modernity onto a structure that couldn't support it. I'm reminded of Shelley's Ozymandias : the lone and level sands stretch far away, and they stretch away from every kingdom that ever declared itself eternal. And yes, that could still include Google, Anthropic and anyone // everyone else. Straight lines never stay straight The current AI narrative draws a line from model 1 to model 2 to model 3 to whatever comes next, projects it forward, and concludes that human labor // existence is finished. But straight-line projections are the most reliably wrong predictions in the history of forecasting, tech or otherwise. What actually happens, in empires // companies alike, is that progress hits unexpected walls and leaders make strategic blunders while some force that nobody took seriously finds an approach that makes the incumbent architecture look like Ptolemy's epicycles: elaborate and technically sophisticated but pointed in entirely the wrong direction. The blunder creates the opening creates the backlash creates the opportunity for the insurgent. Why should the AI industry be exempt from this? What is it about foundation models that repeals the laws of entropy that have governed every dominant system in recorded history? Clayton Christensen documented the corporate version of this in The Innovator's Dilemma . Across industries and decades, incumbents fail to respond to disruptive threats because responding would require cannibalizing their existing business (or identity, or vision, or mission) and admitting that the strategy everyone got promoted for executing was wrong. AKA: Dominant systems produce the very conditions that destroy them, because the success of the system makes it impossible for the people inside it to perceive its weaknesses. What collapse looks like before it arrives We haven't seen the first great AI collapse. We haven't seen a foundation model company make the BlackBerry mistake or the Nokia mistake, or the Roman mistake, or the Ottoman mistake or reach their Bunker-in-Berlin mistake. But we will, and we'll see it multiple times, because these mistakes = features of power concentration. The hubris that makes a company or an empire dominant in one era is frequently the quality that blinds it to the next one. If you could ask Lazaridis in 2006, or a British colonial administrator in 1900, whether their model of the world was permanent, each would've given you a very convincing explanation for why it in fact was. When someone tells you that AGI is inevitable and the permanent economic displacement of most humans is a foregone conclusion, what they're really telling you is that they believe the current leaders of the AI industry will execute flawlessly, indefinitely, against challenges they can't yet foresee, in an environment that's changing faster than perhaps any technological enviroment in history. They believe that this particular set of institutions, at this particular moment, has broken the pattern that has held for every empire and every corporation in human history. But roads crumble and legions go home, the epicycles collapse into a simpler truth, and something else, something nobody predicted, grows in the spaces left behind. It always has and it always will.

9

Two different tricks for fast LLM inference

📝 seangoedecke.com RSS feed

🏷️ AI/机器学习 🏷️ 性能优化

↗ 打开原文

📌 AI 摘要: 文章分析了Anthropic与OpenAI实现“快速模式”的两种不同技术路径：Anthropic通过降低批处理大小来提速，而OpenAI则利用Cerebras巨型芯片实现全内存推理，但后者使用了能力稍逊的蒸馏模型。

💡 核心要点:

Anthropic快速模式提速2.5倍，成本高6倍，推测基于低批处理大小推理。
OpenAI快速模式提速15倍，使用Cerebras芯片全内存运行，但模型是能力较弱的GPT-5.3-Codex-Spark。
作者推测Anthropic此举是为在新闻周期中与OpenAI竞争，而非真正专注于快速推理。

🧠 深度分析:

这揭示了AI推理服务在速度、成本与模型能力间的核心权衡，厂商需根据场景（如实时对话与批量任务）选择不同优化策略。
Cerebras等专用硬件可能推动小型化、高性能推理模型的发展，但经济性与通用性仍是待解问题。
对于开发者，选择快速模式需评估任务对速度与准确性的敏感度，避免为速度牺牲过多可靠性。

📖 站内阅读原文（RSS全文）

Anthropic and OpenAI both recently announced “fast mode”: a way to interact with their best coding model at significantly higher speeds.

These two versions of fast mode are very different. Anthropic’s offers up to 2.5x tokens per second (so around 170, up from Opus 4.6’s 65). OpenAI’s offers more than 1000 tokens per second (up from GPT-5.3-Codex’s 65 tokens per second, so 15x). So OpenAI’s fast mode is six times faster than Anthropic’s 1 .

However, Anthropic’s big advantage is that they’re serving their actual model. When you use their fast mode, you get real Opus 4.6, while when you use OpenAI’s fast mode you get GPT-5.3-Codex-Spark, not the real GPT-5.3-Codex. Spark is indeed much faster, but is a notably less capable model: good enough for many tasks, but it gets confused and messes up tool calls in ways that vanilla GPT-5.3-Codex would never do.

Why the differences? The AI labs aren’t advertising the details of how their fast modes work, but I’m pretty confident it’s something like this: Anthropic’s fast mode is backed by low-batch-size inference, while OpenAI’s fast mode is backed by special monster Cerebras chips . Let me unpack that a bit.

How Anthropic’s fast mode works

The tradeoff at the heart of AI inference economics is batching , because the main bottleneck is memory . GPUs are very fast, but moving data onto a GPU is not. Every inference operation requires copying all the tokens of the user’s prompt 2 onto the GPU before inference can start. Batching multiple users up thus increases overall throughput at the cost of making users wait for the batch to be full.

A good analogy is a bus system. If you had zero batching for passengers - if, whenever someone got on a bus, the bus departed immediately - commutes would be much faster for the people who managed to get on a bus . But obviously overall throughput would be much lower, because people would be waiting at the bus stop for hours until they managed to actually get on one.

Anthropic’s fast mode offering is basically a bus pass that guarantees that the bus immediately leaves as soon as you get on. It’s six times the cost, because you’re effectively paying for all the other people who could have got on the bus with you, but it’s way faster 3 because you spend zero time waiting for the bus to leave.

Obviously I can’t be fully certain this is right. Maybe they have access to some new ultra-fast compute that they’re running this on, or they’re doing some algorithmic trick nobody else has thought of. But I’m pretty sure this is it. Brand new compute or algorithmic tricks would likely require changes to the model (see below for OpenAI’s system), and “six times more expensive for 2.5x faster” is right in the ballpark for the kind of improvement you’d expect when switching to a low-batch-size regime.

How OpenAI’s fast mode works

OpenAI’s fast mode does not work anything like this. You can tell that simply because they’re introducing a new, worse model for it. There would be absolutely no reason to do that if they were simply tweaking batch sizes. Also, they told us in the announcement blog post exactly what’s backing their fast mode: Cerebras.

OpenAI announced their Cerebras partnership a month ago in January. What’s Cerebras? They build “ultra low-latency compute”. What this means in practice is that they build giant chips . A H100 chip (fairly close to the frontier of inference chips) is just over a square inch in size. A Cerebras chip is 70 square inches.

You can see from pictures that the Cerebras chip has a grid-and-holes pattern all over it. That’s because silicon wafers this big are supposed to be broken into dozens of chips. Instead, Cerebras etches a giant chip over the entire thing.

The larger the chip, the more internal memory it can have. The idea is to have a chip with SRAM large enough to fit the entire model , so inference can happen entirely in-memory. Typically GPU SRAM is measured in the tens of megabytes . That means that a lot of inference time is spent streaming portions of the model weights from outside of SRAM into the GPU compute 4 . If you could stream all of that from the (much faster) SRAM, inference would a big speedup: fifteen times faster, as it turns out!

So how much internal memory does the latest Cerebras chip have? 44GB . This puts OpenAI in kind of an awkward position. 44GB is enough to fit a small model (~20B params at fp16, ~40B params at int8 quantization), but clearly not enough to fit GPT-5.3-Codex. That’s why they’re offering a brand new model, and why the Spark model has a bit of “small model smell” to it: it’s a smaller distil of the much larger GPT-5.3-Codex model 5 .

OpenAI’s version is much more technically impressive

It’s interesting that the two major labs have two very different approaches to building fast AI inference. If I had to guess at a conspiracy theory, it would go something like this:

• OpenAI partner with Cerebras in mid-January, obviously to work on putting an OpenAI model on a fast Cerebras chip

• Anthropic have no similar play available, but they know OpenAI will announce some kind of blazing-fast inference in February, and they want to have something in the news cycle to compete with that

• Anthropic thus hustles to put together the kind of fast inference they can provide: simply lowering the batch size on their existing inference stack

• Anthropic (probably) waits until a few days before OpenAI are done with their much more complex Cerebras implementation to announce it, so it looks like OpenAI copied them

Obviously OpenAI’s achievement here is more technically impressive. Getting a model running on Cerebras chips is not trivial, because they’re so weird. Training a 20B or 40B param distil of GPT-5.3-Codex that is still kind-of-good-enough is not trivial. But I commend Anthropic for finding a sneaky way to get ahead of the announcement that will be largely opaque to non-technical people. It reminds me of OpenAI’s mid-2025 sneaky introduction of the Responses API to help them conceal their reasoning tokens .

Is fast AI inference the next big thing?

Seeing the two major labs put out this feature might make you think that fast AI inference is the new major goal they’re chasing. I don’t think it is. If my theory above is right, Anthropic don’t care that much about fast inference, they just didn’t want to appear behind OpenAI. And OpenAI are mainly just exploring the capabilities of their new Cerebras partnership. It’s still largely an open question what kind of models can fit on these giant chips, how useful those models will be, and if the economics will make any sense.

I personally don’t find “fast, less-capable inference” particularly useful. I’ve been playing around with it in Codex and I don’t like it. The usefulness of AI agents is dominated by how few mistakes they make , not by their raw speed. Buying 6x the speed at the cost of 20% more mistakes is a bad bargain, because most of the user’s time is spent handling mistakes instead of waiting for the model 6 .

However, it’s certainly possible that fast, less-capable inference becomes a core lower-level primitive in AI systems. Claude Code already uses Haiku for some operations. Maybe OpenAI will end up using Spark in a similar way.

• This isn’t even factoring in latency. Anthropic explicitly warns that time to first token might still be slow (or even slower), while OpenAI thinks the Spark latency is fast enough to warrant switching to a persistent websocket (i.e. they think the 50-200ms round trip time for the handshake is a significant chunk of time to first token).

↩

• Either in the form of the KV-cache for previous tokens, or as some big tensor of intermediate activations if inference is being pipelined through multiple GPUs. I write a lot more about this in Why DeepSeek is cheap at scale but expensive to run locally , since it explains why DeepSeek can be offered at such cheap prices (massive batches allow an economy of scale on giant expensive GPUs, but individual consumers can’t access that at all).

↩

• Is it a contradiction that low-batch-size means low throughput, but this fast pass system gives users much greater throughput? No. The overall throughput of the GPU is much lower when some users are using “fast mode”, but those user’s throughput is much higher.

↩

• Remember, GPUs are fast, but copying data onto them is not. Each “copy these weights to GPU” step is a meaningful part of the overall inference time.

↩

• Or a smaller distil of whatever more powerful base model GPT-5.3-Codex was itself distilled from. I don’t know how AI labs do it exactly, and they keep it very secret. More on that here .

↩

• On this note, it’s interesting to point out that Cursor’s hype dropped away basically at the same time they released their own “much faster, a little less-capable” agent model. Of course, much of this is due to Claude Code sucking up all the oxygen in the room, but having a very fast model certainly didn’t help .

↩

10

Separating Download from Install in Docker Builds

📝 Andrew Nesbitt

🏷️ DevOps 🏷️ 性能优化

↗ 打开原文

📌 AI 摘要: 文章核心指出，将依赖下载与安装步骤分离是优化Docker构建层缓存、减少对公共包注册中心冗余请求的关键，并列举了各语言包管理器的支持现状。

💡 核心要点:

多数包管理器将下载与安装合并，导致源码变更触发依赖全量重下，浪费构建时间和社区注册中心带宽。
Go、pnpm、Cargo、pip、Bundler等工具提供了下载命令，可实现依赖层与源码层分离缓存。
npm和Yarn缺乏原生下载命令，而BuildKit缓存挂载是从容器侧解决，非包管理器原生方案。

🧠 深度分析:

此优化能显著提升CI/CD流水线效率，减少因依赖层失效带来的构建时间波动，对大型项目或频繁构建场景价值巨大。
减少对社区资助注册中心（如crates.io、PyPI）的冗余流量，是开发者践行资源节约和社会责任的具体体现。
技术选型时，包管理器对Docker友好性的支持（如pnpm fetch）应成为考量因素，尤其在微服务和容器化部署成为主流的背景下。

📖 站内阅读原文（RSS全文）

Docker layer caching works best when each layer’s inputs are narrow, and a layer that only depends on a lockfile can survive most builds untouched because you’re usually changing application code, not dependencies. Most package managers combine downloading and installing into a single command though, so the layer that fetches from the registry also depends on source files, and any source change invalidates the layer and forces every dependency to re-download even when the lockfile is identical to last time.

That costs more than build time. crates.io, rubygems.org, and pypi.org all run on bandwidth donated by Fastly, and every redundant download in a Docker build is a cost someone else is volunteering to cover. npm is backed by Microsoft and Go’s module proxy by Google, so they can absorb it, but for the community-funded registries it adds up. It feels instant from the developer’s side, a few seconds of progress bars, so nobody thinks about the hundreds of HTTP requests firing against those services on every build where the lockfile has changed by even one line, or when you’re debugging a failed install and rebuilding the same image over and over.

If package managers exposed a download that populates the local cache from the lockfile and an install that works offline from that cache, Docker layer caching would handle the rest:

COPY lockfile . RUN pkg download COPY . . RUN pkg install --offline

go mod download

Go modules shipped with Go 1.11 in August 2018, and the community figured out the Docker pattern within weeks . It’s now the canonical Go Dockerfile pattern, recommended by Docker’s own documentation :

COPY go.mod go.sum ./ RUN go mod download COPY . . RUN CGO_ENABLED = 0 go build -o /app .

go mod download reads go.mod and go.sum and fetches everything without doing any resolution or building, and the layer caches when those two files haven’t changed.

Before Go 1.11, GOPATH -based dependency management didn’t have a clean two-file manifest that could be separated from source code for layer caching, and the design of go.mod and go.sum as small standalone files made this Docker pattern fall out naturally once modules landed.

go build can still contact the checksum database ( sum.golang.org ) after go mod download to verify modules not yet in go.sum . Setting GOFLAGS=-mod=readonly after the download step prevents any network access during the build.

pnpm fetch

pnpm is the only JavaScript package manager with a download-only command, and pnpm fetch was designed specifically for Docker. It reads pnpm-lock.yaml and downloads all packages into pnpm’s content-addressable store without reading package.json at all:

COPY pnpm-lock.yaml pnpm-workspace.yaml ./ RUN pnpm fetch --prod COPY . . RUN pnpm install -r --offline --prod

The download layer only depends on the lockfile, and the install step uses --offline so it never touches the network. In monorepos this is particularly useful because you don’t need to copy every workspace’s package.json before the download step, and pnpm’s authors thinking about container builds when they designed the CLI is the same kind of design awareness that made go mod download standard in Go.

cargo fetch

cargo fetch reads Cargo.lock and downloads all crate source into the registry cache. After fetching, --frozen (which combines --locked and --offline ) prevents any network access during the build:

COPY Cargo.toml Cargo.lock ./ RUN mkdir src && touch src/main.rs RUN cargo fetch --locked COPY . . RUN cargo build --release --frozen

The dummy src/main.rs is needed because cargo fetch requires a valid project structure even though it’s only reading the lockfile, and there’s been an open issue about removing that requirement since 2016.

Almost nobody uses cargo fetch in Dockerfiles. The Rust community skipped straight to caching compilation with cargo-chef , because compiling hundreds of crates is where builds spend most of their wall-clock time and downloads feel cheap by comparison. But every cargo build without a prior cargo fetch is still hitting crates.io for every crate whenever the layer rebuilds, and Fastly is absorbing that traffic whether it takes three seconds or thirty.

pip download

pip download fetches distributions into a directory, and pip install --no-index --find-links installs from that directory offline:

COPY requirements.txt . RUN pip download -r requirements.txt -d /tmp/pkgs COPY . . RUN pip install --no-index --find-links /tmp/pkgs -r requirements.txt

There’s a known bug where build dependencies like setuptools aren’t included in the download, so packages that ship only as source distributions can fail during the offline install, though most Python projects in 2026 ship as prebuilt wheels unless you’re doing something unusual with C extensions.

Neither Poetry nor uv have download-only commands. Poetry has had an open issue since 2020, and uv has one with over a hundred upvotes. Both suggest exporting to requirements.txt and falling back to pip.

bundle cache

Bundler has bundle cache --no-install , which fetches .gem files into vendor/cache without installing them, and bundle install --local installs from that cache without hitting the network:

COPY Gemfile Gemfile.lock ./ RUN bundle cache --no-install COPY . . RUN bundle install --local

In practice this has enough rough edges that it rarely gets used in Dockerfiles. Git-sourced gems still try to reach the remote even with --local , and platform-specific gems need --all-platforms plus bundle lock --add-platform to work across macOS development and Linux containers. The command was designed for vendoring gems into your repository rather than for Docker layer caching.

npm and yarn

npm has no download-only command. npm ci reads the lockfile and skips resolution, but downloads and installs as one atomic operation with no way to separate them, and there’s no --download-only flag or RFC proposing one.

Yarn Classic has an offline mirror that saves tarballs as a side effect of install, but no standalone download command. Yarn Berry has no fetch command either, despite multiple open issues requesting one.

The standard JavaScript Docker pattern is still:

COPY package.json package-lock.json ./ RUN npm ci COPY . .

When the lockfile hasn’t changed the layer caches and nothing gets downloaded, but when it has changed every package re-downloads from the registry, and pnpm is the only JavaScript package manager where you can avoid that.

BuildKit cache mounts

Docker BuildKit has --mount=type=cache , which persists a cache directory across builds so package managers can reuse previously downloaded packages even when the layer invalidates:

RUN --mount = type = cache,target = /root/.npm npm ci

Cache mounts solve the problem from the wrong end. The package manager has the lockfile and knows the cache format, but Docker doesn’t know any of that, which is why the Dockerfile author has to specify internal cache paths that vary between tools and sometimes between versions of the same tool. Not every build system supports BuildKit cache mounts either, and not every CI environment preserves them between builds, so a download command in the package manager itself would be more broadly useful.

Registry Funding Download command Offline install Used in practice?

Go module proxy Google go mod download implicit Yes, canonical

npm registry Microsoft pnpm fetch (pnpm only; npm and yarn have nothing) --offline pnpm yes, others no

crates.io Fastly (donated) cargo fetch --frozen Rarely

PyPI Fastly (donated) pip download (pip only; Poetry and uv have nothing) --no-index --find-links Rarely

rubygems.org Fastly (donated) bundle cache --no-install --local Rarely

Most package managers were designed around a persistent local cache on a developer’s laptop, ~/.cache or ~/.gem or ~/.npm , that warms up over time and stays warm. Ephemeral build environments start clean every time, and Docker layers are the only caching mechanism available, which means the network-dependent part of a build needs to be isolated from the rest for caching to work.

Opportunities:

• npm could add an npm fetch that reads package-lock.json and populates the cache without installing

• Poetry has had an open issue requesting a download command since 2020, and uv has one with strong community interest

• Bundler’s bundle cache --no-install would work if it handled git gems and cross-platform builds more reliably

• Cargo’s cargo fetch shouldn’t need a dummy source file to run a command that only reads the lockfile

11

Quoting Boris Cherny

📝 Simon Willison's Weblog

🏷️ AI/机器学习 🏷️ 职业发展

↗ 打开原文

📌 AI 摘要: Anthropic的Claude Code创造者认为，在AI时代，工程师的角色正在转变，但优秀工程师的重要性有增无减。

💡 核心要点:

AI时代工程师需承担提示工程、客户沟通、跨团队协调等新职责。
工程领域正在发生变革，其工作内容与方式正在被重塑。
Anthropic等AI前沿公司仍在积极招聘开发人员。

🧠 深度分析:

这揭示了AI辅助编程趋势下，工程师的核心价值正从纯编码向更高层次的决策与协调迁移。
对于从业者而言，提升沟通、产品定义与AI协作能力，可能比单纯追求编码速度更重要。

📖 站内阅读原文（RSS全文）

Someone has to prompt the Claudes, talk to customers, coordinate with other teams, decide what to build next. Engineering is changing and great engineers are more important than ever.

— Boris Cherny , Claude Code creator, on why Anthropic are still hiring developers

Tags: careers , anthropic , ai , claude-code , llms , coding-agents , ai-assisted-programming , generative-ai

12

Wagon’s algorithm in Python

📝 John D. Cook

🏷️ 编程语言 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章完整实现了Stan Wagon的算法，用于在Python中寻找满足x² + y² = p的整数解，其中p是奇素数，并成功应用于大整数2²⁵⁵ - 19。

💡 核心要点:

算法核心是找到模p下的-1平方根，并应用修改的欧几里得算法。
实现中利用Python的isqrt函数处理大整数平方根，避免浮点精度问题。
通过寻找二次非剩余来构造模p下的-1平方根。

🧠 深度分析:

该算法为求解经典数论问题提供了高效、可实现的Python方案，尤其适用于密码学等领域中的大素数处理。
文章展示了如何结合数论（二次剩余）与基础算法（欧几里得算法）解决实际问题，具有教学和工程参考价值。
使用Python标准库的isqrt等函数处理大整数，体现了现代编程语言对高精度计算的原生支持优势。

📖 站内阅读原文（RSS全文）

The last three posts have been about Stan Wagon’s algorithm for finding x and y satisfying

x ² + y ² = p

where p is an odd prime.

The first post in the series gives Gauss’ formula for a solution, but shows why it is impractical for large p . The bottom of this post introduces Wagon’s algorithm and says that it requires two things: finding a quadratic non-residue mod p and a variation on the Euclidean algorithm.

The next post shows how to find a quadratic non-residue.

The reason Wagon requires a non-residue is because he need to find a square root of −1 mod p . The previous post showed how that’s done.

In this post we will complete Wagon’s algorithm by writing the modified version of the euclidean algorithm.

Suppose p is an odd prime, and we’ve found x such that x ² = −1 mod p as in the previous posts. The last step in Wagon’s algorithm is to apply the Euclidean algorithm to x and p and stop when the numbers are both less than √ p .

When we’re working with large integers, how do we find square roots? Maybe p and even √ p are too big to represent as a floating point number, so we can’t just apply the sqrt function. Maybe p is less than the largest floating point number (around 10 308 ) but the sqrt function doesn’t have enough precision. Floats only have 53 bits of precision, so an integer larger than 2 53 cannot necessarily be represented entirely accurately.

The solution is to use the isqrt function, introduced in Python 3.8. It returns the largest integer less than the square root of its argument.

Now we have everything necessary to finish implementing Wagon’s algorithm.

from sympy import legendre_symbol, nextprime from math import isqrt

def find_nonresidue(p): q = 2 while legendre_symbol(q, p) == 1: q = nextprime(q) return q

def my_euclidean_algorithm(a, b, stop): while a > stop: a, b = b, a % b return (a, b)

def find_ab(p): assert(p % 4 == 1) k = p // 4 c = find_nonresidue(p) x = pow(c, k, p) return my_euclidean_algorithm(p, x, isqrt(p)) Let’s use this to find a and b such that x ² + y ² = p where p = 2 255 − 19.

>>> a, b = find_ab(p := 2**255 - 19) >>> a 230614434303103947632580767254119327050 >>> b 68651491678749784955913861047835464643 >>> a**2 + b**2 - p 0 Finis . The post Wagon’s algorithm in Python first appeared on John D. Cook .

13

Instruction decoding in the Intel 8087 floating-point chip

📝 Ken Shirriff's blog

🏷️ 硬件 🏷️ 系统架构

↗ 打开原文

📌 AI 摘要: 本文通过逆向工程揭示了 Intel 8087 浮点协处理器如何与 8086/8088 CPU 协同工作，并详细解析了其复杂的指令解码机制。

💡 核心要点:

通过监控总线上的 ESCAPE 操作码识别指令，并利用 8086 计算内存地址。
指令结构基于 8086 的 ModR/M 字节，通过 MOD 位区分内存访问与内部操作。
芯片解码电路分散各处，微码 ROM 控制指令执行，总线接口单元负责通信与地址捕获。

🧠 深度分析:

该设计展示了早期异构计算中硬件协同的巧妙思路，将复杂地址计算卸载给主 CPU，简化了协处理器设计。
对指令位域的精细规划体现了硬件设计中在有限编码空间内实现丰富功能与简化解码的平衡艺术。
这种通过总线监听实现协作的机制，为理解现代多核/异构系统中处理器间通信提供了历史视角。

📖 站内阅读原文（RSS全文）

In the 1980s, if you wanted your IBM PC to run faster, you could buy the Intel 8087 floating-point coprocessor chip. With this chip, CAD software, spreadsheets, flight simulators, and other programs were much speedier. The 8087 chip could add, subtract, multiply, and divide, of course, but it could also compute transcendental functions such as tangent and logarithms, as well as provide constants such as π. In total, the 8087 added 62 new instructions to the computer.

But how does a PC decide if an instruction was a floating-point instruction for the 8087 or a regular instruction for the 8086 or 8088 CPU? And how does the 8087 chip interpret instructions to determine what they mean? It turns out that decoding an instruction inside the 8087 is more complicated than you might expect. The 8087 uses multiple techniques, with decoding circuitry spread across the chip. In this blog post, I'll explain how these decoding circuits work.

To reverse-engineer the 8087, I chiseled open the ceramic package of an 8087 chip and took numerous photos of the silicon die with a microscope. The complex patterns on the die are formed by its metal wiring, as well as the polysilicon and silicon underneath. The bottom half of the chip is the "datapath", the circuitry that performs calculations on 80-bit floating point values. At the left of the datapath, a constant ROM holds important constants such as π. At the right are the eight registers that the programmer uses to hold floating-point values; in an unusual design decision, these registers are arranged as a stack . Floating-point numbers cover a huge range by representing numbers with a fractional part and an exponent; the 8087 has separate circuitry to process the fractional part and the exponent.

Die of the Intel 8087 floating point unit chip, with main functional blocks labeled. The die is 5 mm×6 mm. Click this image (or any others) for a larger image.

The chip's instructions are defined by the large microcode ROM in the middle. 1 To execute an instruction, the 8087 decodes the instruction and the microcode engine starts executing the appropriate micro-instructions from the microcode ROM. In the upper right part of the chip, the Bus Interface Unit (BIU) communicates with the main processor and memory over the computer's bus. For the most part, the BIU and the rest of the chip operate independently, but as we will see, the BIU plays important roles in instruction decoding and execution.

Cooperation with the main 8086/8088 processor

The 8087 chip acted as a coprocessor with the main 8086 (or 8088) processor. When a floating-point instruction was encountered, the 8086 would let the 8087 floating-point chip carry out the floating-point instruction. But how do the 8086 and the 8087 determine which chip executes a particular instruction? You might expect the 8086 to tell the 8087 when it should execute an instruction, but this cooperation turns out to be more complicated.

The 8086 has eight opcodes that are assigned to the coprocessor, called ESCAPE opcodes. The 8087 determines what instruction the 8086 is executing by watching the bus, a task performed by the BIU (Bus Interface Unit). 2 If the instruction is an ESCAPE , the instruction is intended for the 8087. However, there's a problem. The 8087 doesn't have any access to the 8086's registers (and vice versa), so the only way that they can exchange data is through memory. But the 8086 addresses memory through a complicated scheme involving offsest registers and segment registers. How can the 8087 determine what memory address to use when it doesn't have access to the registers?

The trick is that when an ESCAPE instruction is encountered, the 8086 processor starts executing the instruction, even though it is intended for the 8087. The 8086 computes the memory address that the instruction references and reads that memory address, but ignores the result. Meanwhile, the 8087 watches the memory bus to see what address is accessed and stores this address internally in a BIU register. When the 8087 starts executing the instruction, it uses the address from the 8086 to read and write memory. In effect, the 8087 offloads address computation to the 8086 processor.

The structure of 8087 instructions

To understand the 8087's instructions, we need to take a closer look at the structure of 8086 instructions. In particular, something called the ModR/M byte is important since all 8087 instructions use it.

The 8086 uses a complex system of opcodes with a mixture of single-byte opcodes, prefix bytes, and longer instructions. About a quarter of the opcodes use a second byte, called ModR/M, that specifies the registers and/or memory address to use through a complicated encoding. For instance, the memory address can be computed by adding the BX and SI registers, or from the BP register plus a two-byte offset. The first two bits of the ModR/M byte are the "MOD" bits. For a memory access, the MOD bits indicate how many address displacement bytes follow the ModR/M byte (0, 1, or 2), while the "R/M" bits specify how the address is computed. A MOD value of 3, however, indicates that the instruction operates on registers and does not access memory.

Structure of an 8087 instruction

The diagram above shows how an 8087 instruction consists of an ESCAPE opcode, followed by a ModR/M byte. An ESCAPE opcode is indicated by the special bit pattern 11011 , leaving three bits (green) available in the first byte to specify the type of 8087 instruction. As mentioned above, the ModR/M byte has two forms. The first form performs a memory access; it has MOD bits of 00 , 01 , or 10 and the R/M bits specify how the memory address is computed. This leaves three bits (green) to specify the address. The second form operates internally, without a memory access; it has MOD bits of 11 . Since the R/M bits aren't used in the second form, six bits (green) are available in the R/M byte to specify the instruction.

The challenge for the designers of the 8087 was to fit all the instructions into the available bits in such a way that decoding is straightforward. The diagram below shows a few 8087 instructions, illustrating how they achieve this. The first three instructions operate internally, so they have MOD bits of 11; the green bits specify the particular instruction. Addition is more complicated because it can act on memory (first format) or registers (second format), depending on the MOD bits. The four bits highlighted in bright green ( 0000 ) are the same for all ADD instructions; the subtract, multiplication, and division instructions use the same structure but have different values for the dark green bits. For instance, 0001 indicates multiplication and 0100 indicates subtraction. The other green bits ( MF , d , and P ) select variants of the addition instruction, changing the data format, direction, and popping the stack at the end. The last three bits select the R/M addressing mode for a memory operation, or the stack register ST(i) for a register operation.

The bit patterns for some 8087 instructions. Based on the datasheet .

Selecting a microcode routine

Most of the 8087's instructions are implemented in microcode, implementing each step of an instruction in low-level "micro-instructions". The 8087 chip contains a microcode engine; you can think of it as the mini-CPU that controls the 8087 by executing a microcode routine, one micro-instruction at a time. The microcode engine provides an 11-bit micro-address to the ROM, specifying the micro-instruction to execute. Normally, the microcode engine steps through the microcode sequentially, but it also supports conditional jumps and subroutine calls.

But how does the microcode engine know where to start executing the microcode for a particular machine instruction? Conceptually, you could feed the instruction opcode into a ROM that would provide the starting micro-address. However, this would be impractical since you'd need a 2048-word ROM to decode an 11-bit opcode. 3 (While a 2K ROM is small nowadays, it was large at the time; the 8087's microcode ROM was a tight fit at just 1648 words.) Instead, the 8087 uses a more efficient (but complicated) instruction decode system constructed from a combination of logic gates and PLAs (Programmable Logic Arrays). This system holds 22 microcode entry points, much more practical than 2048.

Processors often use a circuit called a PLA (Programmable Logic Array) as part of instruction decoding. The idea of a PLA is to provide a dense and flexible way of implementing arbitrary logic functions. Any Boolean logic function can be expressed as a "sum-of-products", a collection of AND terms (products) that are OR'd together (summed). A PLA has a block of circuitry called the AND plane that generates the desired sum terms. The outputs of the AND plane are fed into a second block, the OR plane, which ORs the terms together. Physically, a PLA is implemented as a grid, where each spot in the grid can either have a transistor or not. By changing the transistor pattern, the PLA implements the desired function.

A simplified diagram of a PLA.

A PLA can implement arbitrary logic, but in the 8087, PLAs often act as optimized ROMs. 4 The AND plane matches bit patterns, 5 selecting an entry from the OR plane, which holds the output values, the micro-address for each routine. The advantage of the PLA over a standard ROM is that one output column can be used for many different inputs, reducing the size.

The image below shows part of the instruction decoding PLA. 6 The horizontal input lines are polysilicon wires on top of the silicon. The pinkish regions are doped silicon. When polysilicon crosses doped silicon, it creates a transistor (green). Where there is a gap in the doped silicon, there is no transistor (red). (The output wires run vertically, but are not visible here; I dissolved the metal layer to show the silicon underneath.) If a polysilicon line is energized, it turns on all the transistors in its row, pulling the associated output columns to ground. (If no transistors are turned on, the pull-up transistor pulls the output high.) Thus, the pattern of doped silicon regions creates a grid of transistors in the PLA that implements the desired logic function. 7

Part of the PLA for instruction decoding.

The standard way to decode instructions with a PLA is to take the instruction bits (and their complements) as inputs. The PLA can then pattern-match against bit patterns in the instruction. However, the 8087 also uses some pre-processing to reduce the size of the PLA. For instance, the MOD bits are processed to generate a signal if the bits are 0, 1, or 2 (i.e. a memory operation) and a second signal if the bits are 3 (i.e. a register operation). This allows the 0, 1, and 2 cases to be handled by a single PLA pattern. Another signal indicates that the top bits are 001 111xxxxx ; this indicates that the R/M field takes part in instruction selection. 8 Sometimes a PLA output is fed back in as an input, so a decoded group of instructions can be excluded from another group. These techniques all reduce the size of the PLA at the cost of some additional logic gates.

The result of the instruction decoding PLA's AND plane is 22 signals, where each signal corresponds to an instruction or group of instructions with a shared microcode entry point. The lower part of the instruction decoding PLA acts as a ROM that holds the 22 microcode entry points and provides the selected one. 9

Instruction decoding inside the microcode

Many 8087 instructions share the same microcode routines. For instance, the addition, subtraction, multiplication, division, reverse subtraction, and reverse division instructions all go to the same microcode routine. This reduces the size of the microcode since these instructions share the microcode that sets up the instruction and handles the result. However, the microcode obviously needs to diverge at some point to perform the specific operation. Moreover, some arithmetic opcodes access the top of the stack, some access an arbitrary location in the stack, some access memory, and some reverse the operands, requiring different microcode actions. How does the microcode do different things for different opcodes while sharing code?

The trick is that the 8087's microcode engine supports conditional subroutine calls, returns, and jumps, based on 49 different conditions ( details ). In particular, fifteen conditions examine the instruction. Some conditions test specific bit patterns, such as branching if the lowest bit is set, or more complex patterns such as an opcode matching 0xx 11xxxxxx . Other conditions detect specific instructions such as FMUL . The result is that the microcode can take different paths for different instructions. For instance, a reverse subtraction or reverse division is implemented in the microcode by testing the instruction and reversing the arguments if necessary, while sharing the rest of the code.

The microcode also has a special jump target that performs a three-way jump depending on the current machine instruction that is being executed. The microcode engine has a jump ROM that holds 22 entry points for jumps or subroutine calls. 10 However, a jump to target 0 uses special circuitry so it will instead jump to target 1 for a multiplication instruction, target 2 for an addition/subtraction, or target 3 for division. This special jump is implemented by gates in the upper right corner of the jump decoder.

The jump decoder and ROM. Note that the rows are not in numerical order; presumably, this made the layout slightly more compact. Click this image (or any other) for a larger version.

Hardwired instruction handling

Some of the 8087's instructions are implemented directly by hardware in the Bus Interface Unit (BIU), rather than using microcode. For example, instructions to enable or disable interrupts, or to save or restore stat

内容较长，当前仅展示前 14000 字。可点击“打开原文”查看完整内容。

14

Design Deconstruction

📝 Tedium: The Dull Side of the Internet.

🏷️ 产品设计 🏷️ 工具

↗ 打开原文

📌 AI 摘要: 文章核心探讨了设计工具脱离图形界面、以文本（代码）驱动的可能性，认为这种解构方式能结合设计的创意与数学精确性，并通过作者的个人实验验证了其可行性。

💡 核心要点:

作者尝试用ffmpeg在手机终端处理视频，再结合Canva完成设计，实现文本驱动图形输出。
文中提及Vimjoyer使用Motion Canvas等工具，以代码优先的方式在Vim中制作动画和视频。
作者利用Markdown、HTML/CSS及Remotion等命令行工具，自动化生成符合个人风格的设计素材。

🧠 深度分析:

这挑战了设计必须依赖GUI的传统范式，为设计师提供了更精确、可编程和可重复的工作流，可能提升效率与一致性。
文本驱动设计降低了工具复杂性，允许在资源受限环境（如手机）进行创作，拓展了设计场景和工具选择。
结合AI助手（如Claude Code）可降低技术门槛，使非专业开发者也能构建自定义设计管道，推动个性化工具创新。

📖 站内阅读原文（RSS全文）

Design is perhaps the software paradigm most wedded to the mouse and the GUI. But there’s no reason it can’t be text-driven. To me, the hard part about being creative is that you’re always trying to look for a new path. Sure, you’ve done things a certain way for a long time, and it’s worked for you. But it’s hard not to want to dabble in new directions just to see where it takes you, and hope that it shakes out a new idea or two. Which is perhaps the reason I’ve started to fixate on a weird idea—that design tools might sometimes work better without an attached graphical interface. Rather than graphics in, graphics out, maybe sometimes it should be text in, graphics out. The myth about design is that it’s a function of the creativity-driven right side of the brain. But I think that’s only half the story. See, with design, there’s a lot of hidden math involved. Ask your favorite newspaper or magazine designer about pica rulers and column lengths, and you’ll get what I’m saying. Put another way: Designers need to be creative problem solvers, painting the perfect canvas, but they also need to be pragmatic, considering the realities of “yes, it’s long, but we have to fit this text.” Tools like InDesign and Final Cut Pro have traditionally combined the canvas and the broader frameworks that make a good design, mixing tools with differing cognitive loads into one interface. But what if design needs to be a bit more deconstructed, where pieces are more separated out, perhaps not even graphical? What if you designed with code? Would that lead to better results? I wanted to find out. Hey, you never know when you’re gonna need a terminal in Android. The spark that caused my weird design-with-code obsession I stumbled upon the idea accidentally, but this weird interest grew out of some genuine frustration. I wanted to try a couple of experiments with vertical video, seeing if I liked it and how comfortable I felt with the idea. The problem is, I wanted it to match my general style, which is strongly built around a heavily filtered grayscale imagery. Every app I tried kind of sucked. CapCut, the ByteDance-produced app for creating TikTok videos, seemed unstable. A lot of other stuff came with spammy upsells. Plus I couldn’t quite get the design I wanted—a faded black and white look that’s a little pixelated, with a slightly choppy frame count. The only thing I actually liked that could edit mobile videos was Canva. However, it could only get me so far. So, to fill the gap, I did something weird: I started testing whether I could filter videos with ffmpeg to my liking in Termux, the Linux terminal program for Android. Then, in a second step, I’d move the videos to Canva, to finish the edit (including adding the text in my desired font/design). And I’ll be damned, it worked: https://www.youtube.com/shorts/Cuyd8H2fvd4 I became curious about pushing this idea further, to social objects, and started working on tools to build quick graphics from Markdown files all on my phone—something you can make happen with HTML and CSS, basically. Cool idea, worked pretty simply: I became curious about pushing this idea further, to social objects, and started working on tools to build quick graphics from Markdown files all on my phone—something you can make happen with HTML and CSS, basically. Cool idea, worked pretty simply: Every tech journalist in 1995 overestimated, then underestimated, the Zip drive. I thought that was enough, and I didn’t need to take this unusual thought any further, until I saw something that blew my mind: a full YouTube video—complete with animation, graphics, and so on, made in a terminal.

Sponsored By … You? If you find weird or unusual topics like this super-fascinating, the best way to tell us is to give us a nod on Ko-Fi . It helps ensure that we can keep this machine moving, support outside writers, and bring on the tools to support our writing. (Also it’s heartening when someone chips in.) We accept advertising, too! Check out this page to learn more .

This guy is nuts. I love what he’s doing. The guy who edits videos with Vim Even with my rendering experiment, there’s no way I would have said yes before a month ago, but then I saw something that really threw me for a loop: A dude who edits his YouTube videos in Vim . Even with my rendering experiment, there’s no way I would have said yes before a month ago, but then I saw something that really threw me for a loop: A dude who edits his YouTube videos in Vim . Look at this crazy-ass video. He made this in Vim! For the uninitiated, this is basically saying that you use scissors to cut a watermelon. I will admit it was by a guy named “Vimjoyer” whose gimmick is basically doing everything with the popular text editor. (I personally use nano like a lamer.) But fortunately, the how behind it doesn’t need vim to be useful. Essentially, he is using a tool called Motion Canvas to push his content around so that he can create animations on the fly, shifting them around as desired. This is not totally dissimilar to what Flash could do with ActionScript back in the day, but it’s deconstructed so it’s code-first, GUI interface second. I was curious, so I started messing around with it using the same on-my-phone format as the earlier ffmpeg experiment. Alas, Motion Canvas didn’t work all that well for such a constrained setting, as it required use of a browser. However, I spotted a similar tool, Remotion , that worked entirely within the command line. But one change precludes another—it needed Playwright , a headless browser tool. As it’s made, that doesn’t work in Termux at all, as Playwright doesn’t have any builds compatible with Qualcomm chips. But I found someone who had solved this exact problem , and that let me do this: I can write the copy for these social objects in Markdown—even chain them together—and have it make a bunch of social objects for me, all meticulously set up in my style. Sound like a lot of work to avoid working in a graphical interface? You bet your ass it is. On the plus side, you only really have to do a complex, repeatable task once (perhaps with some maintenance down the line). But the thing is, you can use tools like Claude Code to make these sorts of weird connections work—and maybe tell them, after the agent insists you can’t run Playwright on your phone, that it’s actually possible. Then, if you want to dive in further, that’s when you take the time to learn it yourself and build upon the idea you’ve been conjuring. (The trick I’ve been using lately: Tapping into the super-cheap DeepSeek Chat model via Claude Code Router , an implementation of Claude Code that lets you use models not made by Anthropic. That gives me additional room to screw around with oddball experiments like these, while being relatively minimal resource-wise. I put in $10 a month ago and have yet to run out, while still getting fairly decent results.) An example of what Typst can do. (via the Typst website) A new script for page layout This is a very cool idea, and it’s more than just a novelty. I honestly believe this basic text-driven ideal could be taken to some amazing new frontiers. Lately, I’ve been fascinated by Typst , a scripting technology that is seen as a competitor to LaTeX. (Let me take a pause here to admit that LaTeX users have been designing with code for a long time. And there are probably some people who build stuff using PostScript they coded by hand. I bow before you, as a guy who started out as designer.) It’s a tool that is designed for laying out technical documents, with an emphasis on things like math equations. But it could also be used to make all sorts of documents, like zines or even wall calendars. This is actually the perfect format to build a wall calendar, because it’s a highly templated format that can get very complex to manage in something like Affinity or InDesign. Here’s an example I built as a test: Longtime readers know that I have been threatening for years to sell a wall calendar, and 2027 might just be the year. But it goes further than that. To me, I think there’s an opportunity to separate concerns inventively. For example: Let’s say you go into Affinity or Inkscape to build an SVG with the basic shape of your layout, or even a basic background, but then you import that graphic into Typst format. That moves you from texture to copy-layout. This is what I mean about separating concerns. Too often, design software tries to awkwardly mesh together these processes in a way that makes nobody happy. Typst won’t get you all the way there, I will admit. It does not currently support blend modes, for example, meaning that you have to import raster graphics or SVGs to handle all of that. Same with clipping paths and masks. But I think there’s a world where Typst could have all of these things, making it an effective publishing tool without forcing you in canvas mode when you’d be better served by a framework. We have a pretty good text-based web design framework in the form of HTML, JavaScript, and CSS. With a few additions or some extensions, Typst could become that for print. It’s too bad the creator of Mou disappeared and took his project’s goodwill with him, because this was a genuinely influential idea. The popular blogging platform Ghost was initially based off of this design. Graphic designers are secretly left-brained people One thing that I think people don’t realize about graphic design, particularly the print form, is that it’s creativity, but there’s also math going on. It’s not that far removed from architecture, if you think about it. Any newspaper designer will tell you about pica rulers and column inches until the cows come home. The secret about news design if that it’s a bunch of right-brained people who can think left-brained when the moment shows itself. If you had asked me about this 15 years ago, I might have considered editorial design all right-brain thinking. But I think the left side of the brain was always there. I think the thing that ultimately made this all click was probably Markdown, particularly an editor that presented the split in a way I couldn’t ignore. Fairly forgotten at this point, but deeply influential at the time, the 2010s-era MacOS Markdown editor Mou basically let you lay out Markdown and see the visual output in real time. The story of Mou ended in tears—the designer basically ghosted a bunch of people after a crowdfunding campaign —but it still inspired me, personally. (The popular open-source editor MacDown, recently revived as MacDown 3000 , is something of a spiritual successor to the defunct Mou.) I’ve been trying to figure out a way to convey all of this, probably, ever since I started ShortFormBlog in 2009. That site began with the provocative idea that you could design individual posts at a micro level rather than making absolutely everything look the same—as long as you were willing to give everything the right framework to work within. We can translate that idea to all sorts of objects. We just need to think beyond the parameters in front of us. I’m not quite at the level of Vim video editor guy just yet, but it’s something to aspire to.

Non-Designy Links I’ve been on the lookout for interesting tools that support Linux, and one I caught was Neep , a paid tool that removes noise from voice calls. Krisp has this killer feature, too, but it doesn’t support Linux. We’ve lost some great musicians of late, particularly Greg Brown, the original guitarist of Cake, who wrote “ The Distance ,” easily one of the best songs of the ’90s. Still hods up. (Also, RIP to Brad Arnold of 3 Doors Down, who made an appearance in our “ Songs About Superman ” piece.) The AI-generated viral video of Brad Pitt and Tom Cruise fighting feels like a strong enough turning point for tech that Hollywood just lost its minds over it on Friday. Perhaps not a strong enough response. -- Alright, that’s all I’ve got. Find this one an interesting read? Share it with a pal ! And speaking of deconstructing things, you can’t get more back-to-basics than the simple brilliance of la machine .

15

tiny corp’s product – a training box

📝 the singularity is nearer

🏷️ AI/机器学习 🏷️ 硬件

↗ 打开原文

📌 AI 摘要: Tiny Corp 计划推出一种能持续学习、更新权重的本地AI硬件产品，以对抗当前云端大模型同质化、用户被边缘化的未来。

💡 核心要点:

当前主流LLM权重固定，导致思维多样性崩溃，未来可能只有少数几个同质化模型。
本地模型要胜出，关键在于能为每个用户或组织进行真正的全权重学习。
Tiny Corp 的长期产品愿景是销售能像生命一样学习用户价值观的本地硬件设备。

🧠 深度分析:

这挑战了当前以云端API为中心的AI服务模式，为追求个性化、数据隐私和低延迟的应用场景提供了新路径。
如果实现，将推动边缘计算和个性化AI硬件的发展，可能重塑AI产业链的价值分配。
该愿景的实现面临巨大技术挑战，包括高效的小规模持续训练算法和配套基础设施的成熟。

📖 站内阅读原文（RSS全文）

Our new Hong Kong office.

It’s starting to shape up what tiny corp’s product will be. It’s not much of a change from what we sell and do now, but the vision is clearer.

Every month, we see these LLMs become more and more human. However, there’s a major difference. They do not learn. Everyone has the same Claude/Codex/Kimi, with the same weights, the same desires, and the same biases. If current trends continue, the collapse in diversity will be staggering. To paraphrase:

I think there is a world market for maybe five people.

This is not the future I want to live in.

If trends continue where there’s a single model with frozen weights and all learning is in-context, the cloud will win. Except in some highly latency sensitive (fighting robots) or connectivity critical (self driving cars) environments, it will be cheaper to run in batch on the cloud.

The enshittification that came to the web won’t be the driving force to local models. We either live in a world where open models are so bad even user-hostile closed models are better, or open models are good enough, and competition to run them through sites like openrouter will prevent enshittification.

The only way local models win is if there’s some value in full on learning per user or organization. At that point, with entirely different compute needing to run per user, local will beat out cloud.

The open question is if everything that’s unique about you can fit in a 10 kB CLAUDE.md. If that’s true, we have a pretty sad future ahead. It’s the Attack of the Clones, swarms of identical minds you have no say over all varying in a small boxed-in way. This isn’t learning, it’s costuming . Everyone who has used these things knows how little of an impact prompting makes compared to the model. It’s the Internet funneled into a little box you can edit on your profile. Write 3 paragraphs about what makes you unique.

We have to build for a future where that isn’t true. 90% of people will choose the cloud, and what they will find is that they are no longer meaningfully in the loop. The dream is an AI product that will do your job for you while you continue to get paid. But this cannot exist, that’s way too much of a fee to pay to the middleman. If you choose the homogenous mind, you are superfluous and will be cut out. Is there anything uniquely valuable about you? And I mean honestly, not the self-esteem pumping speeches you may have heard in school. If there’s not, I have some bad news for you…

We already sell the hardware . Consumer GPUs still are the cheapest way to run models. There’s tons of work required on the infrastructure . The frontend will be the future iterations of OpenClaw and opencode . But the key distinction from what you have today is that your tinybox will learn. It will update the weights based on its interactions with you. Like living things.

This is many years away. Currently, we are focused on large LLM training (even running these things is hard, have you tried to use vLLM not on NVIDIA?) and generic infrastructure for driving GPUs. But this is the long term idea.

Not API keyed SaaS clones. Something that lives in your house and learns your values. Your child.

16

Microsoft Game Pass Ultimate Billing Fraud

📝 Jayden’s Blog

🏷️ 网络安全 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 作者指控微软在用户已明确关闭自动续费的情况下，三年后仍从其一次性信用卡扣费，认为此举涉嫌欺诈。

💡 核心要点:

作者为购买Xbox Series X并利用金会员转换技巧订阅了Game Pass Ultimate。
订阅时作者确认已关闭所有自动续费设置，并使用了单次信用卡号以防万一。
三年后，作者收到微软扣费邮件，发现自动续费被莫名重新开启。

🧠 深度分析:

此事件暴露了订阅服务中‘暗模式’或后台设置变更的风险，损害用户信任，可能引发集体诉讼或监管审查。
对用户而言，使用虚拟/一次性信用卡是有效的防御措施，但企业应确保用户设置不被擅自修改。
这提醒所有技术服务提供商，清晰的账单设置和用户授权是商业伦理与合规的基本要求。

📖 站内阅读原文（RSS全文）

I purchased an Xbox Series X out of some misplaced sense of nostalgia for the 360 and because I needed a 4K player. At the time you could still do the trick where you load up on Xbox Live Gold and then convert it to Game Pass Ultimate cheaply.

I signed up for it and then made absolutely sure to disable any autorenewing settings everywhere I could. I remember seeing something to the effect of “Your subscription will expire 2/2026 and will not renew.

At the time I still trusted Microsoft a little, but I made sure to use a one time use credit card number, just in case.

Lo and behold, I just got this email:

Conveniently for those liars and cheats at Microsoft, somehow in the intervening three years autorenew got turned back on. Oopsie whoopsie sowwy 👉👈!

I don’t know how this isn’t outright fraud.

17

Reading list 02/14/26

📝 Construction Physics

🏷️ 硬件 🏷️ 技术趋势

↗ 打开原文

📌 AI 摘要: 本周阅读清单聚焦于建筑、住房与制造业的技术与政策动态，揭示了智能家居发展停滞、住房政策改革及制造业供应链竞争等核心议题。

💡 核心要点:

智能家居技术仍面临用户体验差、不友好的困境，多年未得到根本改善。
美国国会通过法案，取消了预制房屋必须使用钢底盘的规定，以降低住房成本。
AI数据中心需求激增，推动康宁公司研发更薄更坚韧的光纤电缆。

🧠 深度分析:

住房政策改革（如取消钢底盘要求）可能降低预制房屋成本，是应对住房短缺问题的具体技术性尝试。
制造业动态（如光纤需求、中国低价竞争）反映了全球供应链重组和技术竞争对本土产业构成的挑战与机遇。
智能家居技术长期停滞，表明该领域在易用性和系统集成上存在深层技术或商业模式障碍，影响其大规模普及。

📖 站内阅读原文（RSS全文）

• •

The Chronicle of Georgia, via Wikipedia . Welcome to the reading list, a weekly list of news and links related to buildings, infrastructure, and industrial technology. Roughly 2/3rds of the reading list is paywalled, so for full access become a paid subscriber. Housekeeping items this week: • My book is a finalist for the Manhattan Institute’s Hayek Book Prize .

Housing The Atlantic has a piece on how difficult and user-unfriendly most smart home technology still is. This was true when Gizmodo published its 2015 article Why Is My Smart Home So Fucking Dumb? , and it seems like it’s still true today. [ The Atlantic ] The Department of Justice is apparently considering opening an antitrust probe into US homebuilders, possibly due to their coordination on prices through a trade group. “Leading Builders of America”. [ Bloomberg ] The US house of representatives passes the Housing in the 21st Century Act. This is the house version of the ROAD to Housing Act which was passed by the senate back in October, and which I talked about on Statecraft . Among other things these bills remove the requirement that manufactured (mobile) homes have steel chassis, which the industry has long complained about. [ X ] Trump and newly elected New York mayor Zohran Mamdani are apparently both enthusiastic about NYC zoning reform. [ Politico ] Americans are taking on more and more credit card debt, but mortgage delinquencies so far remain fairly flat. [ X ] • •

Buildings are apparently more frequently collapsing in the Southern Mediterranean, possibly due to increased erosion due to rising sea levels. “Alexandria, a historic and densely populated port city in Egypt representative of several coastal towns in the Southern Mediterranean, has experienced over 280 building collapses along its shorelines over the past two decades, and the root causes are still under investigation.” [ Wiley ] [ Usc ] Sunderji’s Paradox: the rich spend a smaller fraction of their income on their housing than the poor, but as countries get richer these fractions don’t change. [ Substack ] • •

“London has been set a target of building 88,000 new homes per year over the next decade. Last year construction started on just 5,891 — 94 per cent below target, a 75 per cent year-on-year decline, the steepest drop in the country, the lowest tally since records began almost 40 years ago and the lowest figure for any major city in the developed world this century.” [ FT ] • •

Manufacturing The WSJ has a piece on Corning, the glass company that’s manufacturing the suddenly-in-demand fiber optic cable for AI data centers. “In 2018, Weeks and O’Day went to Dallas to tour a data center owned by Meta, then known as Facebook. They marveled at the demand for fiber-optic cabling to connect all the servers inside that giant warehouse. Facebook was using a mix of copper cables and existing fiber optics, but found both ill-suited to the task. This spurred Corning’s engineers to make their cables thinner, but also tougher, so they could withstand tight bends, says Claudio Mazzali, Corning’s head of research. Five years later, ChatGPT made its debut, and demand for fiber-powered data centers exploded. “We’re thankful that we made the trip in 2018 and thankful that we made the bet,” says O’Day. At the time, they had no idea whether it would be a good investment or a dud, he adds.” [ Wall Street Journal ] In other glass manufacturing news, the WSJ also has a piece about windshield manufacturers upset about a US-based, Chinese-owned windshield factory making windshields for far cheaper than they can. [ Wall Street Journal ] Bloomberg has a piece on whether its only a matter of time before Chinese cars are available in the US. One interesting point is that it’s actually Korean and Japanese imports (which dominate the low end of the US market), not US brands, which might be most threatened by an influx of low-priced Chinese cars. [ Bloomberg ] BYD’s January sales were 30% lower than last year. [ X ] A US drone manufacturer was booted out of their space at the Brooklyn Navy Yard, apparently in part due to activist pressure upset that they were supplying drones to Israel. [ Mondoweiss ] [ X ] The Whitehouse released a maritime action plan for revitalizing US shipbuilding. I haven’t had a chance to read through it closely, but it seems to be a collection of a few dozen policy recommendations. [ White House ] We’ve previously noted that a big drawback of tariffs is that they can make domestic manufacturing less competitive by jacking up the price for inputs to manufacturing. Now the Trump Administration plans to relax the tariffs of metal and aluminum. [ FT ]

18

Book Review: 20 Goto 10 - 10101001 facts about retro computers by Steven Goodwin ★★★★☆

📝 Terence Eden’s Blog

🏷️ 硬件 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 这是一篇关于复古计算机主题书籍《20 Goto 10》的书评，作者认为该书内容有趣、编排独特，但受限于出版问题难以购买。

💡 核心要点:

书籍采用非线性结构，章节结尾有“GOTO”选项引导阅读路径。
内容包含近200篇短文，涵盖轶事、技术概述和冷知识。
书中部分知识是只有特定爱好者才会感兴趣的晦涩技术细节。

🧠 深度分析:

该书独特的非线性阅读设计，模仿了早期计算机程序和文字冒险游戏，增强了阅读的互动性和趣味性。
书评指出该书因出版方问题难以购买，这反映了小众技术出版物在发行和获取上面临的现实挑战。
书籍内容对大众实用性有限，但精准满足了复古计算爱好者的怀旧与探索需求，定位清晰。

📖 站内阅读原文（RSS全文）

This is an excellent "dipping" book. There are nearly 200 articles ranging from short anecdotes, multi-page synopses of complex topics, and quirky little asides. Rather than a linear history of computing, each short chapter ends with a multiple-choice "GOTO".

From there, you take a meandering wander throughout retro-computing lore.

Some paths lead to dead-ends (a delightful little Game-Over experience) while others will send you round in loops (much like any text adventure). I've no idea if I actually read everything - although I did stumble onto some Easter Eggs!

Some of the knowledge in here is of the geeky arcane trivia which is of no use to man nor beast - yet strangely compelling to anyone who remembers POKE, CHAIN, and all the other esoteric commands. Some of the stories you'll undoubtedly heard before. Others are deliciously obscure.

Sadly, the book is caught up in the continuing Unbound drama so is rather hard to buy. There are signed copies available from The Centre for Computing History .

I'm grateful to the kind friend who lent me their copy.

19

Quoting Thoughtworks

📝 Simon Willison's Weblog

🏷️ 软件工程 🏷️ 职业发展

↗ 打开原文

📌 AI 摘要: 文章核心观点是，AI工具不会淘汰初级开发者，反而能加速其成长，而行业真正的挑战在于如何帮助大量中级工程师适应新环境。

💡 核心要点:

AI工具能帮助初级开发者更快度过初期产出为负的阶段。
初级开发者比高级工程师更擅长使用AI工具，因其没有旧习惯束缚。
行业主体是中级工程师，他们缺乏适应新环境的基础，再培训难度大。

🧠 深度分析:

这挑战了AI将取代初级岗位的流行叙事，可能影响企业的人才招聘与培养策略。
中级工程师的技能缺口是行业普遍难题，尚无成熟解决方案，这凸显了结构性转型的挑战。
学徒制、轮岗等传统方法被探讨但未证实有效，企业需探索新的持续学习模式以应对技术变革。

📖 站内阅读原文（RSS全文）

The retreat challenged the narrative that AI eliminates the need for junior developers. Juniors are more profitable than they have ever been. AI tools get them past the awkward initial net-negative phase faster. They serve as a call option on future productivity. And they are better at AI tools than senior engineers, having never developed the habits and assumptions that slow adoption.

The real concern is mid-level engineers who came up during the decade-long hiring boom and may not have developed the fundamentals needed to thrive in the new environment. This population represents the bulk of the industry by volume, and retraining them is genuinely difficult. The retreat discussed whether apprenticeship models, rotation programs and lifelong learning structures could address this gap, but acknowledged that no organization has solved it yet.

— Thoughtworks , findings from a retreat concerning "the future of software engineering", conducted under Chatham House rules

Tags: ai-assisted-programming , careers , ai

20

AI twitter's favourite lie: everyone wants to be a developer

📝 Westenberg.

🏷️ AI/机器学习 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章驳斥了“AI将让每个人都成为软件开发者”的流行观点，指出绝大多数人根本不想自己构建软件，他们只想使用现成的、无摩擦的解决方案。

💡 核心要点:

大众偏好购买现成方案而非自行构建，SaaS经济即是明证。
即使技术壁垒消失，定义需求的概念化壁垒依然存在。
AI狂热者常混淆专业开发者与普通大众的根本需求差异。

🧠 深度分析:

这提醒AI产品设计应聚焦于增强现有工具易用性，而非假设用户想成为创造者。
对技术趋势的判断需警惕‘自我投射’偏差，避免将极客爱好误认为普遍需求。
企业应投资于让软件更智能地理解用户意图，而非仅提供低代码构建工具。

📖 站内阅读原文（RSS全文）

Twitter's latest consensus on inevitability: now that large language models can write code, everyone will become a software developer. People, you see, have problems, and software solves problems, and AI removes the barrier between people and software, therefore everyone will build their own software. It's a syllogism, after a fashion, but its premise = so wildly disconnected from how actual humans behave that it borders on fantasy. Because the average punter does not want to build software. They don't want to prompt software. They don’t want to describe software. They don't particularly want to think about software. They want to tap, swipe and scroll with zero friction and next-to-zero cognitive input. They want their problems to go away, and they would very much prefer if that happened without them having to open a terminal, a chat window, or anything else that reminds them of work. This is damn-near universally applicable. Most people are not tinkerers There's a deep assumption embedded in the "everyone will build" thesis, that most people are latent creators held back only by technical barriers. Remove the barriers, and creation floods forth. But we've run this before. Desktop publishing tools became accessible in the 1980s with the Macintosh and PageMaker. Did everyone start designing their own newsletters? A handful did, and the rest continued to hire designers or, more commonly, didn't make newsletters at all. WordPress has made it trivially easy to build a website for over twenty years now, and the vast majority of small business owners still pay someone else to do it, or they use a template and never touch it again. The people excited about vibe coding are, almost by definition, people who were already interested in building things, and they're projecting their own enthusiasm onto a general population that has repeatedley demonstrated a preference for buying solutions over building them. And why wouldn't they prefer that? Building things is cognitively expensive, whether or not it’s financially viable. And even when the technical barrier falls to zero, the conceptualisation barrier remains. You still have to know what you want, specify it clearly, evaluate whether what you got is what you wanted, and iterate if it’s not. That's work // effort and for most people it is accompanied by functionally zero dopamine. The spec problem nobody talks about An old joke: the hardest part of building software is figuring out what the software should do. This has been true for decades, and AI hasn't changed it. If anything, AI has made the problem more visible. When the bottleneck was writing code, you could blame the difficulty of ~programming for why your project never got off the ground. Now that an AI can write code in seconds, the bottleneck is clearly, embarrassingly, you // me // us. This is the part that the AI manics keep skating past. They demo an app built in ten minutes and declare that software development has been democratized. But the demo is always something with a clear spec: a to-do list, a calculator, a simple game with obvious rules. The rest of the world’s problems don't come pre-decomposed into clean specifications. The rest of the world may not even be able to fully articulate what’s broken and what they want fixed. What people actually want Most folks don't want to build a custom CRM. I do! I might! I couldn't be more excited about what this era unlocks. But I am not most people. They want to sign up for one that works. They don't want to create their own budgeting app. They want Mint or YNAB to do the job. The entire SaaS economy exists as proof that people will pay monthly fees to avoid having to build or even configure things themselves. And is there anything wrong with that preference? The division of labor exists for good reasons, and Adam Smith figured this out in 1776 and he was a good deal smarter than a good many of us. What people will actually do with AI is use AI-enhanced versions of existing products, with smarter search and better autocomplete inside the tools they already have. The revolution won't look like a hundred million people vibe coding custom apps. It'll look like existing software getting better at understanding what users want and doing it for them, which is what good software has always tried to do. The tech industry has a long history of confusing what power users want with what everyone wants. The folks on AI Twitter who are building apps every weekend with Claude and GPT are having a great time, and the tools they're using are the same ones I’m obsessing over most of my waking hours. But we are a self-selected sample of tinkerers and builders, and the conclusions they're drawing about the general population say more about their own relationship with technology than about anyone else's. Most people, given a magic wand, would not wish for the ability to write software. They'd wish for their sofware to work properly without them having to do fuck-all.

21

Package Management Namespaces

📝 Andrew Nesbitt

🏷️ 软件工程 🏷️ 开源项目

↗ 打开原文

📌 AI 摘要: 文章分析了包管理器的三种命名空间策略（扁平、作用域、层级）及其在命名稀缺性、安全性和治理方面的核心权衡。

💡 核心要点:

扁平命名空间（如PyPI）导致好名字被抢占和易受typosquatting攻击。
作用域命名空间（如npm）通过‘组织/包名’格式缓解冲突，但带来治理开销。
层级命名空间（如Maven）将命名权与DNS绑定，但存在域名过期导致的安全风险。

🧠 深度分析:

命名空间设计是包管理器的基石决策，深刻影响生态系统的可发现性、安全性和长期维护成本。
MavenGate事件揭示了依赖外部系统（如DNS）进行身份验证的持续风险，需要注册表进行主动监控。
对于新包管理器，强制作用域命名（如Packagist）能避免历史包袱，但会牺牲部分用户体验的简洁性。

📖 站内阅读原文（RSS全文）

Every package needs a name. The rules for how those names work is one of the most consequential decisions a package manager makes, and one of the hardest to change later. I categorized the approaches previously and touched on the tradeoffs briefly.

Flat namespaces

RubyGems, PyPI, crates.io, Hex, Hackage, CRAN, and LuaRocks all use flat namespaces: one global pool of names, first-come-first-served. You pick a name, and if nobody has it, it’s yours.

This gives you gem install rails , pip install requests , cargo add serde . The names are short, memorable, and greppable, with no punctuation to remember and no organization to look up.

At scale, though, good names run out. Someone registers database on day one and never publishes a real package. Or they publish something, abandon it, and the name sits there forever, pointing at a library last updated in 2013. PyPI has over 600,000 projects. Many of the short, obvious names were claimed years ago by packages with single-digit downloads.

Name scarcity creates pressure, and you end up with python-dateutil because dateutil was taken, beautifulsoup4 because beautifulsoup was the old version, or pillow because the original PIL package was abandoned and PyPI doesn’t recycle names. New developers have to learn not just what to install but which of several similar-sounding packages is the right one.

Flat namespaces also make typosquatting straightforward. Someone registers reqeusts next to requests and waits. The attack works because there’s nothing between the user’s keystrokes and the registry lookup, no organization to verify and no hierarchy to navigate, just a string match against a flat list.

Some registries add normalization rules to limit this. PyPI treats hyphens, underscores, and dots as equivalent, so my-package and my_package resolve to the same thing. crates.io does similar normalization. RubyGems doesn’t, which is why both stripe and stripe-ruby can coexist as unrelated packages.

Scoped namespaces

npm added scopes in 2014. Instead of just babel-core , you could publish @babel/core . Packagist has always used vendor/package format: symfony/console , laravel/framework . JSR, Ansible Galaxy, Puppet Forge, and others follow similar patterns.

Scopes split the package name into two parts: who published it, and what they called it. Different organizations can use the same package name without collision, so @types/node and @anthropic/node coexist without confusion.

npm’s implementation is interesting because scopes are optional. You can still publish unscoped packages to the flat namespace. So npm actually has two systems running in parallel: a flat namespace for legacy packages and a scoped namespace for newer ones.

Most of the ecosystem’s most-used packages ( express , lodash , react ) predate scopes and sit in the flat namespace. Scopes are most common for organizational packages (everything under @angular/ , for example) and type definitions ( @types/ ). And because so much of the ecosystem depends on unscoped names, npm can never require scopes without breaking the world.

Packagist required scopes from the start. Every Composer package is vendor/package , no exceptions. This avoided the split-namespace problem npm has, but it means you need to know the vendor name. Is it guzzlehttp/guzzle or guzzle/guzzle ? You have to look it up. And vendor names themselves are first-come-first-served, just pushing the squatting problem up one level. The stakes are higher, though, because squatting a vendor name locks out an entire family of package names rather than just one. Someone could register the google vendor on Packagist before Google gets there, and that blocks every google/* package at once.

Scopes also require governance. Who decides that @babel belongs to the Babel team? npm ties scopes to user accounts and organizations, which means you need account management, ownership transfer procedures, and dispute resolution. When a maintainer leaves a project, their scoped packages might need to move. This is solvable but adds operational overhead that flat registries avoid.

Hierarchical namespaces

Maven Central uses reverse-domain naming: org.apache.commons:commons-lang3 , com.google.guava:guava . The group ID is supposed to correspond to a domain you control.

The reverse-domain approach ties naming authority to DNS. If you own example.com , you can publish under com.example . This defers governance to the existing DNS system rather than requiring the registry to manage name ownership. Maven Central enforces this by requiring you to prove domain ownership, or for projects without their own domain, to use io.github.username as a fallback.

That fallback is interesting because it quietly undermines the premise: the whole point of reverse-domain naming is that you prove ownership of infrastructure you control, but io.github.username just defers to GitHub’s namespace. It’s URL-based naming wearing a reverse-domain costume.

Organizations with stable domains get clean namespaces out of this. Apache, Google, and Spring all have clear homes. The trade-off is verbose identifiers. org.springframework.boot:spring-boot-starter-web is a lot of characters. IDE autocompletion papers over this in Java, but the verbosity is real when reading build files or discussing dependencies.

Domain ownership is also less stable than it looks. Companies get acquired and change domains. Open source projects move between hosting organizations. A package published under com.sun.xml in 2005 might need to live under com.oracle.xml after the acquisition, except it can’t, because changing the group ID would break every project that depends on the old one. So old names persist as historical artifacts.

The hierarchy also doesn’t prevent all squatting. Someone could register a domain specifically to claim a Maven namespace. More concerning is domain resurrection: when a domain expires after its owner has already registered a Maven group ID, anyone can buy that domain and potentially claim the namespace. Maven Central verifies domain ownership when you first register a group ID, requiring a DNS TXT record, but that verification is a point-in-time check.

In January 2024, security firm Oversecured published MavenGate , an analysis of 33,938 domains associated with Maven group IDs. They found that 6,170 of them, roughly 18%, had expired or were available for purchase. The affected group IDs included widely-used libraries like co.fs2 , net.jpountz.lz4 , and com.opencsv . A new owner of any of those domains could publish new versions under the existing group ID. Existing artifacts on Maven Central are immutable so old versions wouldn’t change, but build files that pull the latest version would pick up the attacker’s release.

Sonatype responded by disabling accounts tied to expired domains and tightening their verification process, but they haven’t announced ongoing domain monitoring. PyPI, facing the same problem with account email domains, built automated daily checks in 2025 and found around 1,800 accounts to unverify.

Clojars shows what happens when a registry in the Maven ecosystem takes a different approach. Clojure libraries are distributed as Maven artifacts, but Clojars originally let you use any group ID without verification. You could publish under hiccup or ring with no domain proof. This was simpler for the Clojure community, where most libraries are small and maintained by individuals, but it meant Clojars had a much more relaxed namespace than Maven Central.

Since build tools can pull from both registries, the gap created a dependency confusion risk: an attacker could register an unverified group on Clojars that shadows a legitimate Maven Central library. In 2021, after dependency confusion attacks became widely understood, Clojars started requiring verified group names for new projects, adopting the same reverse-domain convention as Maven Central. Existing projects with unverified groups were grandfathered in, so the old flat names still exist alongside the new hierarchical ones.

URL-based identifiers

Go modules use import paths that are URLs: github.com/gorilla/mux , golang.org/x/crypto . There’s no registration step. The URL points to a repository, and the module system fetches code from there (or from the Go module proxy, which caches it).

This model sidesteps the registry as naming authority entirely. You publish code to a repository and the URL is the identifier, with no approval step required. Name collisions don’t arise because URLs are globally unique by construction, and owning the repo means owning the name.

Names become tied to hosting infrastructure, though. When github.com/user/repo is the package identity, a GitHub org rename breaks every downstream consumer. Go addressed this with the module proxy, which caches modules so they survive repo disappearance, but the name still reflects the original location even if the code has moved. Import paths like github.com/golang/lint that redirect to golang.org/x/lint create confusion about which is canonical. And your package identity depends on a third party either way: GitHub controls the github.com namespace, so if they ban your account or the organization renames, your package identity changes. You’ve traded one governance dependency for another, a hosting platform instead of a registry.

“No registration step” has its own consequences. Without a registry to mediate names, there’s no obvious place to check for existing packages, no search, no download counts, no centralized vulnerability database. Go built most of these features separately with pkg.go.dev and the module proxy. The URL-based naming stayed, but the surrounding infrastructure converged toward what registries provide anyway, just assembled differently.

Deno launched with raw URL imports and eventually built JSR , a scoped registry with semver resolution, because URL imports created problems they couldn’t solve at the URL layer: duplicated dependencies when the same package was imported from slightly different URLs, version management scattered across every import statement, and reliability issues when hosts went offline. You can start without a registry, but the things registries do (search, versioning, deduplication, availability) keep needing to be solved, and solving them piecemeal tends to reconverge on something registry-shaped.

Swift Package Manager

Apple hired Max Howell to build SwiftPM in 2015. He’d created Homebrew and used both CocoaPods and Carthage heavily, so he arrived with strong opinions about how a language package manager should work. As he told The Changelog : “I’d been involved with CocoaPods and Carthage and used them heavily, and obviously made Homebrew, so I had lots of opinions about how a package manager should be.” He was drawn to decentralization, something he wished Homebrew had from the start.

Carthage had already demonstrated the approach in the Apple ecosystem, launching in 2014 as a deliberate reaction against CocoaPods’ centralized registry, using bare Git URLs with no registry at all. SwiftPM followed the same path, using Git repository URLs as package identifiers with no central registry.

Go made the same choice but then spent years building infrastructure around it: a module proxy that caches source in immutable storage so deleted repos still resolve, a checksum database ( sum.golang.org ) that uses a transparency log to guarantee every user gets identical content for a given version, and pkg.go.dev for search and discovery.

SwiftPM doesn’t have any of this yet. Every swift package resolve clones directly from the Git host. If a repo disappears, resolution fails with no fallback. SwiftPM records a fingerprint per package version the first time it downloads it, but that fingerprint lives on your machine only. There’s no global database to verify that what you downloaded matches what everyone else got, no way to detect a targeted attack serving different content to different users.

A 2022 Checkmarx study found thousands of packages across Go and Swift vulnerable to repo-jacking, where an attacker registers an abandoned GitHub username and recreates a repo that existing packages still point to. Go’s proxy mitigates this because cached modules don’t re-fetch from the source, but SwiftPM has no such layer.

The pieces to fix this are partly in place. Apple defined a registry protocol (SE-0292, shipped in Swift 5.7) and built client support for it in SwiftPM, including package signing. The client tooling is ready, the protocol is specified, and the ecosystem is still small enough that introducing a namespace layer wouldn’t require the kind of painful migration that npm or PyPI face. The Swift Package Index , community-run and Apple-sponsored, already tracks around 12,000 packages. What’s missing is the public registry service itself and the integrity infrastructure around it, and the window for adding these before the ecosystem’s size makes it much harder is not open forever.

The migration problem

As I wrote about in Package Management is a Wicked Problem , once PyPI accepted namespace-less package names, that was permanent. If PyPI added mandatory namespaces tomorrow, every existing requirements.txt , every tutorial, every CI script would need updating. The new system would have to support both namespaced and un-namespaced packages indefinitely. You haven’t replaced the flat namespace, you’ve just added a layer on top of it.

npm’s experience shows what this looks like in practice. Scoped packages have been available since 2014, but most of the ecosystem still uses flat names. The existence of scopes didn’t make express become @expressjs/express because too much already depends on the existing name. Scopes ended up being used primarily for new packages and organizational groups rather than as a migration path for the existing namespace.

NuGet went through a partial migration. It added package I

内容较长，当前仅展示前 14000 字。可点击“打开原文”查看完整内容。

22

Justifying text-wrap: pretty

📝 matklad

🏷️ Web开发 🏷️ 产品设计

↗ 打开原文

📌 AI 摘要: 文章指出Safari率先实现了CSS的`text-wrap: pretty`特性以优化排版，但其与`text-align: justify`结合使用时，因算法目标冲突导致词间距过宽，破坏了美观性。

💡 核心要点:

年Safari率先提供了`text-wrap: pretty`的合理实现，用于智能换行。
智能换行算法（源于Knuth-Plass）旨在平衡行宽，但设定目标宽度略小于容器最大宽度。
当智能换行与两端对齐结合时，系统性的宽度预留导致词间距被过度拉伸，影响视觉效果。

🧠 深度分析:

这揭示了CSS高级排版特性组合使用时可能产生的意外副作用，提醒开发者在追求美观排版时需注意特性间的兼容性。
该问题可能推动浏览器引擎（如WebKit）进一步优化排版算法的协同工作方式，提升Web整体排版质量。
对于前端开发者而言，在当前阶段需谨慎同时使用这两个属性，或通过调整容器宽度等方式进行视觉补偿。

📖 站内阅读原文（RSS全文）

Justifying text-wrap: pretty

Feb 14, 2026 Something truly monumental happened in the world of software development in 2025. Safari shipped a reasonable implementation of text-wrap: pretty : https://webkit.org/blog/16547/better-typography-with-text-wrap-pretty/ . We are getting closer and closer to the cutting-edge XV-century technology. Beautiful paragraphs!

We are not quite there yet, hence the present bug report.

A naive way to break text into lines to form a paragraph of a given width is greediness: add the next word to the current line if it fits, otherwise start a new line. The result is unlikely to be pretty — sometimes it makes sense to try to squeeze one more word on a line to make the lines more balanced overall. Johannes Gutenberg did this sort of thing manually, to produce a beautiful page above. In 1981, Knuth and Plass figured out a way to teach computer to do this, using dynamic programming, for line breaking in TeX.

Inexplicably, until 2025, browsers stuck with the naive greedy algorithm, subjecting generations of web users to ugly typography. To be fair, the problem in a browser is harder version than the one solved by Gutenberg, Plass, and Knuth. In print, the size of the page is fixed, so you can compute optimal line breaking once, offline. In the web context, the window width is arbitrary and even changes dynamically, so the line-breaking has to be “online”. On the other hand, XXI century browsers have a bit more compute resources than we had in 1980 or even 1450!

Making lines approximately equal in terms of number of characters is only half-way through towards a beautiful paragraph. No matter how you try, the length won’t be exactly the same, so, if you want both the left and the right edges of the page to be aligned, you also need to fudge the spaces between the words a bit. In CSS, text-wrap: pretty asks the browser to select line breaks in an intelligent way to make lines roughly equal, and text-align: justify adjusts whitespace to make them equal exactly.

Although Safari is the first browser to ship a non-joke implementation of text-wrap , the combination with text-align looks ugly, as you can see in this very blog post. To pin the ugliness down, the whitespace between the words is blown out of proportion. Here’s the same justified paragraph with and without text-wrap: pretty :

The paragraph happens to look ok with greedy line-breaking. But the “smart” algorithm decides to add an entire line to it, which requires inflating all the white space proportionally. By itself, either of

p { text-wrap: pretty; text-align : justify; }

looks alright. It’s just the combination of the two that is broken.

This behavior is a natural consequence of implementation. My understanding is that the dynamic programming scoring function aims to get each line close to the target width, and is penalized for deviations. Crucially, the actual max width of a paragraph is fixed: while a line can be arbitrary shorter, it can’t be any longer, otherwise it’ll overflow. For this reason, the dynamic programming sets the target width to be a touch narrower than the paragraph. That way, it’s possible to both under and overshoot, leading to better balance overall. As per original article :

The browser aims to wrap each line sooner than the maximum limit of the text box. It wraps within the range, definitely after the magenta line, and definitely before the red line.

But if you subsequently justify all the way to the red line, the systematic overshoot will manifest itself as too wide inter-word space!

WebKit devs, you are awesome for shipping this feature ahead of everyone else, please fix this small wrinkle such that I can make my blog look the way I had intended all along ;-)

23

How Michael Abrash doubled Quake framerate

📝 Fabien Sanglard

🏷️ 性能优化 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章讲述了Michael Abrash通过优化技术，将《雷神之锤》游戏的帧率提升了一倍。

💡 核心要点:

Michael Abrash是《雷神之锤》性能优化的关键人物。
他通过深入分析汇编代码和硬件特性进行优化。
优化工作使游戏帧率实现了翻倍的显著提升。

🧠 深度分析:

这展示了在硬件资源受限时代，底层代码优化对软件性能的决定性作用。
其优化思路和方法论对后续游戏及高性能软件开发产生了深远影响。

24

Anthropic's public benefit mission

📝 Simon Willison's Weblog

🏷️ AI/机器学习 🏷️ 技术趋势

↗ 打开原文

📌 AI 摘要: 文章揭示了Anthropic作为公益公司的具体使命声明，其措辞从2021年的“为人类文化、社会和技术进步”演变为“为人类的长期利益”。

💡 核心要点:

Anthropic是公益公司，但非非营利组织，无义务向IRS公开年度文件。
其公司注册文件显示，2021年使命声明强调AI对文化、社会、技术的改进。
年后至2024年的文件，使命声明更新为强调AI对人类的长期利益。

🧠 深度分析:

使命措辞的变化可能反映了公司战略聚焦点的演变，从宽泛的“进步”转向更具长期主义色彩的目标。
作为AI领域的重要参与者，其公开的公益使命是评估其技术发展伦理导向和长期承诺的关键参考。

📖 站内阅读原文（RSS全文）

Someone asked if there was an Anthropic equivalent to OpenAI's IRS mission statements over time .

Anthropic are a "public benefit corporation" but not a non-profit, so they don't have the same requirements to file public documents with the IRS every year.

But when I asked Claude it ran a search and dug up this Google Drive folder where Zach Stein-Perlman shared Certificate of Incorporation documents he obtained from the State of Delaware !

Anthropic's are much less interesting that OpenAI's. The earliest document from 2021 states:

The specific public benefit that the Corporation will promote is to responsibly develop and maintain advanced Al for the cultural, social and technological improvement of humanity.

Every subsequent document up to 2024 uses an updated version which says:

The specific public benefit that the Corporation will promote is to responsibly develop and maintain advanced AI for the long term benefit of humanity.

Tags: ai-ethics , anthropic , ai

25

Premium: The AI Data Center Financial Crisis

📝 Ed Zitron's Where's Your Ed At

🏷️ AI/机器学习 🏷️ 云计算

↗ 打开原文

📌 AI 摘要: 文章揭示了AI数据中心投资的财务危机，指出科技巨头为满足AI需求投入巨额资本支出，但生成式AI缺乏实质性收入，且AI实验室通过将训练成本排除在毛利率之外来粉饰财务状况，商业模式不可持续。

💡 核心要点:

自2023年以来，科技巨头资本支出超8140亿美元，大量用于满足OpenAI等公司的AI需求。
Anthropic等AI公司营收远低于亏损及融资额，例如2025年营收45亿亏损52亿，却计划再融资250亿。
AI实验室将模型训练成本排除在毛利率计算之外，若计入，Anthropic 2025年毛利率将由正转负至-53%。

🧠 深度分析:

这暴露了当前AI热潮背后的财务泡沫，巨额基础设施投资与微薄收入严重不匹配，可能引发行业调整或投资紧缩。
将训练成本视为一次性研发投入是会计误导，它实为持续运营成本，此做法扭曲了AI公司的真实盈利能力和商业模式可持续性。
投资者和行业观察者需警惕这种财务粉饰，应要求更透明的成本核算，以评估AI公司的长期生存能力。

📖 站内阅读原文（RSS全文）

Since the beginning of 2023, big tech has spent over $814 billion in capital expenditures, with a large portion of that going towards meeting the demands of AI companies like OpenAI and Anthropic. Big tech has spent big on GPUs, power infrastructure, and data center construction, using a variety of financing methods to do so, including (but not limited to) leasing. And the way they’re going about structuring these finance deals is growing increasingly bizarre. I’m not merely talking about Meta’s curious arrangement for its facility in Louisiana , though that certainly raised some eyebrows. Last year, Morgan Stanley published a report that claimed hyperscalers were increasingly relying on finance leases to obtain the “powered shell” of a data center, rather than the more common method of operating leases. The key difference here is that finance leases, unlike operating leases, are effectively long-term loans where the borrower is expected to retain ownership of the asset (whether that be a GPU or a building) at the end of the contract. Traditionally, these types of arrangements have been used to finance the bits of a data center that have a comparatively limited useful life — like computer hardware, which grows obsolete with time. The spending to date is, as I’ve written about again and again , an astronomical amount of spending considering the lack of meaningful revenue from generative AI. Even after a year straight of manufacturing consent for Claude Code as the be-all-end-all of software development resulted in putrid results for Anthropic — $4.5 billion of revenue and $5.2 billion of losses before interest, taxes, depreciation and amortization according to The Information — with ( per WIRED ) Claude Code only accounting for around $1.1 billion in annualized revenue in December, or around $92 million in monthly revenue. This was in a year where Anthropic raised a total of $16.5 billion (with $13 billion of that coming in September 2025), and it’s already working on raising another $25 billion . This might be because it promised to buy $21 billion of Google TPUs from Broadcom , or because Anthropic expects AI model training costs to cost over $100 billion in the next 3 years . And it just raised another $30 billion — albeit with the caveat that some of said $30 billion came from previously-announced funding agreements with Nvidia and Microsoft, though how much remains a mystery. According to Anthropic’s new funding announcement, Claude Code’s run rate has grown to “over $2.5 billion” as of February 12 2026 — or around $208 million. Based on literally every bit of reporting about Anthropic, costs have likely spiked along with revenue, which hit $14 billion annualized ($1.16 billion in a month) as of that date. I have my doubts, but let’s put them aside for now. Anthropic is also in the midst of one of the most aggressive and dishonest public relations campaigns in history. While its Chief Commercial Officer Paul Smith told CNBC that it was “focused on growing revenue” rather than “spending money,” it’s currently making massive promises — tens of billions on Google Cloud , “ $50 billion in American AI infrastructure ,” and $30 billion on Azure . And despite Smith saying that Anthropic was less interested in “flashy headlines,” Chief Executive Dario Amodei has said, in the last three weeks , that “ almost unimaginable power is potentially imminent ,” that AI could replace all software engineers in the next 6-12 months , that AI may (it’s always fucking may ) cause “ unusually painful disruption to jobs ,” and wrote a 19,000 word essay — I guess AI is coming for my job after all! — where he repeated his noxious line that “we will likely get a century of scientific and economic progress compressed in a decade.” Training Costs Should Be Part of AI Labs’ Gross Margins, And To Not Include Them Is Deceptive Yet arguably the most dishonest part is this word “training.” When you read “training,” you’re meant to think “oh, it’s training for something, this is an R&D cost,” when “training LLMs” is as consistent a cost as inference (the creation of the output) or any other kind of maintenance. While most people know about pretraining — the shoving of large amounts of data into a model (this is a simplification I realize) — in reality a lot of the current spate of models use post-training , which covers everything from small tweaks to model behavior to full-blown reinforcement learning where experts reward or punish particular responses to prompts. To be clear, all of this is well-known and documented, but the nomenclature of “training” suggests that it might stop one day, versus the truth: training costs are increasing dramatically, and “training” covers anything from training new models to bug fixes on existing ones. And, more fundamentally, it’s an ongoing cost — something that’s an essential and unavoidable cost of doing business. Training is, for an AI lab like OpenAI and Anthropic, as common (and necessary) a cost as those associated with creating outputs (inference), yet it’s kept entirely out of gross margins : Anthropic has previously projected gross margins above 70% by 2027, and OpenAI has projected gross margins of at least 70% by 2029, which would put them closer to the gross margins of publicly traded software and cloud firms. But both AI developers also spend a tremendous amount on renting servers to develop new models—training costs, which don’t factor into gross margins—making it more difficult to turn a net profit than it is for traditional software firms. This is inherently deceptive. While one would argue that R&D is not considered in gross margins, training isn’t gross margins — yet gross margins generally include the raw materials necessary to build something, and training is absolutely part of the raw costs of running an AI model. Direct labor and parts are considered part of the calculation of gross margin, and spending on training — both the data and the process of training itself — are absolutely meaningful, and to leave them out is an act of deception. Anthropic’s 2025 gross margins were 40% — or 38% if you include free users of Claude — on inference costs of $2.7 (or $2.79) billion, with training costs of around $4.1 billion . What happens if you add training costs into the equation? Let’s work it out! • If Anthropic’s gross margin was 38% in 2025, that means its COGS (cost of goods sold) was $2.79 billion.

• If we add training, this brings COGS to $6.89 billion, leaving us with -$2.39 billion after $4.5 billion in revenue.

• This results in a negative 53% gross margin. Training is not an up front cost , and considering it one only serves to help Anthropic cover for its wretched business model. Anthropic (like OpenAI) can never stop training, ever, and to pretend otherwise is misleading. This is not the cost just to “train new models” but to maintain current ones, build new products around them, and many other things that are direct, impossible-to-avoid components of COGS. They’re manufacturing costs, plain and simple. Anthropic projects to spend $100 billion on training in the next three years, which suggests it will spend — proportional to its current costs — around $32 billion on inference in the same period, on top of $21 billion of TPU purchases, on top of $30 billion on Azure (I assume in that period?), on top of “tens of billions” on Google Cloud. When you actually add these numbers together (assuming “tens of billions” is $15 billion), that’s $200 billion. Anthropic ( per The Information’s reporting ) tells investors it will make $18 billion in revenue in 2026 and $55 billion in 2027 — year-over-year increases of 400% and 305% respectively, and is already raising $25 billion after having just closed a $30bn deal. How does Anthropic pay its bills? Why does outlet after outlet print these fantastical numbers without doing the maths of “how does Anthropic actually get all this money?” Because even with their ridiculous revenue projections, this company is still burning cash, and when you start to actually do the maths around anything in the AI industry, things become genuinely worrying. You see, every single generative AI company is unprofitable, and appears to be getting less profitable over time. Both The Information and Wall Street Journal reported the same bizarre statement in November — that Anthropic would “turn a profit more quickly than OpenAI,” with The Information saying Anthropic would be cash flow positive in 2027 and the Journal putting the date at 2028, only for The Information to report in January that 2028 was the more-realistic figure. If you’re wondering how, the answer is “Anthropic will magically become cash flow positive in 2028”: This is also the exact same logic as OpenAI, which will, per The Information in September , also, somehow, magically turn cashflow positive in 2030: Oracle, which has a 5-year-long, $300 billion compute deal with OpenAI that it lacks the capacity to serve and that OpenAI lacks the cash to pay for, also appears to have the same magical plan to become cash flow positive in 2029 : Somehow, Oracle’s case is the most legit, in that theoretically at that time it would be done, I assume, paying the $38 billion it’s raising for Stargate Shackelford and Wisconsin, but said assumption also hinges on the idea that OpenAI finds $300 billion somehow . it also relies upon Oracle raising more debt than it currently has — which, even before the AI hype cycle swept over the company, was a lot. As I discussed a few weeks ago in the Hater’s Guide To Oracle , a megawatt of data center IT load generally costs (per Jerome Darling of TD Cowen) around $12-14m in construction (likely more due to skilled labor shortages, supply constraints and rising equipment prices) and $30m a megawatt in GPUs and associated hardware. In plain terms, Oracle (and its associated partners) need around $189 billion to build the 4.5GW of Stargate capacity to make the revenue from the OpenAI deal, meaning that it needs around another $100 billion once it raises $50 billion in combined debt, bonds, and printing new shares by the end of 2026. I will admit I feel a little crazy writing this all out, because it’s somehow a fringe belief to do the very basic maths and say “hey, Oracle doesn’t have the capacity and OpenAI doesn’t have the money.” In fact, nobody seems to want to really talk about the cost of AI, because it’s much easier to say “I’m not a numbers person” or “they’ll work it out.” This is why in today’s newsletter I am going to lay out the stark reality of the AI bubble, and debut a model I’ve created to measure the actual, real costs of an AI data center. While my methodology is complex, my conclusions are simple: running AI data centers is, even when you remove the debt required to stand up these data centers, a mediocre business that is vulnerable to basically any change in circumstances. Based on hours of discussions with data center professionals, analysts and economists, I have calculated that in most cases, the average AI data center has gross margins of somewhere between 30% and 40% — margins that decay rapidly for every day, week, or month that you take putting a data center into operation. This is why Oracle has negative 100% margins on NVIDIA’s GB200 chips — because the burdensome up-front cost of building AI data centers (as GPUs, servers, and other associated) leaves you billions of dollars in the hole before you even start serving compute, after which you’re left to contend with taxes, depreciation, financing, and the cost of actually powering the hardware. Yet things sour further when you face the actual financial realities of these deals — and the debt associated with them. Based on my current model of the 1GW Stargate Abilene data center, Oracle likely plans to make around $11 billion in revenue a year from the 1.2GW (or around 880MW of critical IT). While that sounds good, when you add things like depreciation, electricity, colocation costs of $1 billion a year from Crusoe, opex, and the myriad of other costs, its margins sit at a stinkerific 27.2% — and that’s assuming OpenAI actually pays, on time, in a reliable way. Things only get worse when you factor in the cost of debt. While Oracle has funded Abilene using a mixture of bonds and existing cashflow, it very clearly has yet to receive the majority of the $25 billion+ in GPUs and associated hardware (with only 96,000 GPUs “ delivered ”), meaning that it likely bought them out of its $18 billion bond sale from last September . If we assume that maths, this means that Oracle is paying a little less than $963 million a year ( per the terms of the bond sale ) whether or not a single GPU is even turned on, leaving us with a net margin of 22.19%... and this is assuming OpenAI pays every single bill, every single time, and there are absolutely no delays. These delays are also very, very expensive. Based on my model, if we assume that 100MW of critical IT load is operational (roughly two buildings and 100,000 GB200s) but has yet to start generating revenue, Oracle is burning, with depreciation (which starts once the chips are installed), around $4.69 million a day in cash . I have also confirmed with sources in Abilene that there is no chance that Stargate Abilene is fully operational in 2026. In simpler terms: • AI startups are all unprofitable, and do not appear to have a path to sustainability.

• AI data centers are being built in anticipation of demand that doesn’t exist, and will only exist if AI startups — which are all unprofitable — can afford to pay them.

• Oracle, which has committed to building 4.5GW of data centers, is burning cash every day that OpenAI takes to set up its GPUs, and when it starts making money, it does so from a starting position of billions and billions of dollars in debt.

• Margins are low throughout the entire stack of AI data center

内容较长，当前仅展示前 14000 字。可点击“打开原文”查看完整内容。

26

This Week on The Analog Antiquarian

📝 The Digital Antiquarian

🏷️ 其他

↗ 打开原文

📌 AI 摘要: 文章是《模拟古玩家》系列的第13章，标题为“地球的阴影”，内容未知。

💡 核心要点:

文章是系列连载的一部分。
本章标题暗示可能涉及科幻或奇幻主题。
材料仅为章节标题，无具体内容摘要。

🧠 深度分析:

由于提供的材料仅为章节标题，无法进行实质性内容分析。
读者需查阅完整文章才能了解其具体讨论的技术或文化议题。

📖 站内阅读原文（RSS全文）

Chapter 13: The Shades of the Earth

27

How can I distinguish between the numeric keypad 0 and the top-row 0 in the WM_CHAR message?

📝 The Old New Thing

🏷️ 软件工程 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 文章核心指出，在WM_CHAR消息中，无法通过扩展键位直接区分小键盘0和主键盘0，但可以通过扫描码映射回虚拟键值来间接判断。

💡 核心要点:

WM_CHAR消息中，小键盘0（NumLock开）和主键盘0的wParam和扩展位完全相同。
关键区别在于扫描码，将其映射回虚拟键值（vk_from_scan）可区分是VK_INSERT（小键盘0）还是VK_0（主键盘0）。
存在其他输入方式（如Alt+数字码、输入法）产生字符‘0’，此时vk_from_scan可能对应其他键（如VK_MENU）。

🧠 深度分析:

此分析对需要精确识别物理按键来源的应用程序（如虚拟键盘、游戏、辅助工具）至关重要，能提升输入控制的准确性。
开发者需注意，依赖此方法时，对于非标准输入方式产生的字符，应设计合理的回退或通用处理逻辑，以保证兼容性。

📖 站内阅读原文（RSS全文）

Last time, we looked at how to distinguish the numeric keypad 0 and the top-row 0 in the WM_ KEYDOWN message . We may as well look at the analogous table for WM_ CHAR .

Event wParam Extended?

Numpad0 with NumLock on VK_0 0

Numpad0 with NumLock off (no WM_CHAR )

Ins key (no WM_CHAR )

0 on top row VK_0 0

I got the name VK_0 from this comment block in winuser.h .

/* * VK_0 - VK_9 are the same as ASCII '0' - '9' (0x30 - 0x39) * 0x3A - 0x40 : unassigned * VK_A - VK_Z are the same as ASCII 'A' - 'Z' (0x41 - 0x5A) */ Uh-oh. The extended bit doesn’t distinguish between the two. They both show up as VK_0 , non-extended.

What changes is something not in the above table: The scan code.

So let’s convert the scan code back to a virtual key.

auto vk_from_scan = MapVirtualKey((lParam >> 16) & 0xFF, MAPVK_VSC_TO_VK); Event wParam Extended? vk_from_scan

Numpad0 with NumLock on VK_0 0 VK_INSERT

Numpad0 with NumLock off (no WM_CHAR )

Ins key (no WM_CHAR )

0 on top row VK_0 0 VK_0

So we can infer which zero was pressed by taking the scan code, mapping it to a virtual key, and seeing whether it’s the Ins key (from the numeric keypad) or the 0 key (from the top row).

But wait, we’re not done yet.

There are ways to type the character 0 without using the numeric keypad or the top row. For example, you can hold the Alt key and then type 4 , 8 on the numeric keypad, and that will type a 0 . I tried it out, and the vk_from_scan was VK_ MENU , which is the virtual key code for the Alt key. Another way of entering a 0 is by using an input method editor (IME). Or there might be a custom keyboard layout that generates a 0 through some wacky chord sequence.

Therefore, if the vk_ from_ scan is neither VK_ INSERT nor VK_0 , you have to conclude that the 0 was entered by some means other than the numeric keypad or the top row.

The post How can I distinguish between the numeric keypad 0 and the top-row 0 in the <CODE>WM_<WBR>CHAR</CODE> message? appeared first on The Old New Thing .

28

Testing Reachy Mini - Hugging Face's Pi powered robot

📝 Jeff Geerling

🏷️ 硬件 🏷️ AI/机器学习

↗ 打开原文

📌 AI 摘要: 作者测试了Hugging Face与Pollen Robotics联合推出的Reachy Mini机器人，发现其实际部署与英伟达CEO在CES演示的流畅效果存在差距。

💡 核心要点:

作者最初认为CES上英伟达CEO演示的Reachy Mini是噱头。
实际测试发现，复现演示中的流畅交互并非“极其简单”。
该机器人由Hugging Face和Pollen Robotics合作开发，并由树莓派驱动。

🧠 深度分析:

这表明前沿AI硬件产品的演示效果与实际开发者体验可能存在显著差距，提醒开发者需理性看待技术宣传。
对于想探索具身智能或机器人应用的开发者，此案例强调了实际集成与部署的复杂性，需做好充分技术评估。

📖 站内阅读原文（RSS摘要）

When I saw Jensen Huang introduce the Reachy Mini at CES , I thought it was a gimmick. His keynote showed this little robot responding to human input, turning its head to look at a TODO list on the wall, sending emails, and turning drawings into architectural renderings with motion.

HuggingFace and Pollen robotics sent me a Reachy Mini to test, and, well, at least if you're looking to replicate that setup in the keynote, it's not, as Jensen put it, "utterly trivial now."

29

The Small Web is Tricky to Find

📝 matduggan.com

🏷️ Web开发 🏷️ 技术趋势

↗ 打开原文

📌 AI 摘要: 文章核心论述了在当今搜索引擎生态下，发现和分类非技术性的个人小网站（Small Web）存在巨大困难，这阻碍了相关工具的开发。

💡 核心要点:

作者尝试为浏览器扩展分类网站，但难以可靠识别非技术性小网站。
谷歌曾是这些小网站的唯一流量来源，其搜索功能变化后，发现机制基本失效。
WordPress和Ghost网站的非技术内容比例更高，但难以批量、可靠地获取。

🧠 深度分析:

这揭示了去中心化内容生态的一个关键瓶颈：缺乏有效的发现和分发机制，可能导致小众内容被埋没。
对开发者而言，构建依赖小网站数据的工具（如推荐引擎）面临根本性数据源挑战。
这或许会推动对替代性网站发现协议（如RSS聚合、Webring）或社区策展模式的重新关注。

📖 站内阅读原文（RSS全文）

One of the most common requests I've gotten from users of my little Firefox extension( https://timewasterpro.xyz ) has been more options around the categories of websites that you get returned. This required me to go through and parse the website information to attempt to put them into different categories. I tried a bunch of different approaches but ended up basically looking at the websites themselves seeing if there was anything that looked like a tag or a hint on each site. This is the end conclusion of my effort at putting stuff into categories. Unknown just means I wasn't able to get any sort of data about it. This is the result of me combining Ghost, Wordpress and Kagi Small Web data sources. Interestingly one of my most common requests is "I would like less technical content" which as it turns out is tricky to provide because it's pretty hard to find. They sorta exist but for less technical users they don't seem to have bought into the value of the small web own your own web domain (or if they have, I haven't been able to figure out a reliable way to find them). This is an interesting problem, especially because a lot of the tools I would have previously used to solve this problem are....basically broken. It's difficult for me to really use Google web search to find anything at this point even remotely like "give me all the small websites" because everything is weighted to steer me away from that towards Reddit. So anything that might be a little niche is tricky to figure out. Interesting findings So there's no point in building a web extension with a weighting algorithm to return less technical content if I cannot find a big enough pool of non-technical content to surface. It isn't that these sites don't exist its just that we never really figured out a way to reliably surface "what is a small website". So from a technical perspective I have a bunch of problems. • First I need to reliably sort websites into a genre, which can be a challenge when we're talking about small websites because people typically write about whatever moves them that day. Most of the content on a site might be technical, but some of it might not be. Big sites tend to be more precise with their SEO settings but small sites that don't care don't do that, so I have fewer reliable signals to work with.

• Then I need to come up with a lot of different feeding systems for independent websites. The Kagi Small Web was a good starting point, but Wordpress and Ghost websites have a much higher ratio of non-technical content. I need those sites, but it's hard to find a big batch of them reliably.

• Once I have the type of website as a general genre and I have a series of locations, then I can start to reliably distribute the types of content you get. I think I can solve....some of these, but the more I work on the problem the more I'm realizing that the entire concept of "the small web" had a series of pretty serious problems. • Google was the only place on Earth sending any traffic there

• Because Google was the only one who knew about it, there never needed to be another distribution system

• Now that Google is broken, it's almost impossible to recreate that magic of becoming the top of list for a specific subgenre without a ton more information than I can get from public records.

30

Gadget Review: Topdon TS004 Thermal Monocular ★★★★⯪

📝 Terence Eden’s Blog

🏷️ 硬件 🏷️ 工具

↗ 打开原文

📌 AI 摘要: 本文是对Topdon TS004热成像单目镜的评测，核心结论是：这是一款专为观察野生动物设计的优秀硬件，成像流畅、易于使用，但存在UI烧入图像、AI识别不精准等软件层面的小瑕疵。

💡 核心要点:

硬件坚固，人体工学设计良好，配备USB-C和标准三脚架接口。
热成像分辨率为256x192，视频流畅（42.187 FPS），内置约30GB存储空间。
配套App可实现Wi-Fi实时图传，但在Linux上仅识别为U盘，无法作为网络摄像头使用。

🧠 深度分析:

作为一款面向野生动物观察的消费级热成像设备，其高刷新率和良好的人体工学设计提升了户外使用的核心体验，但UI烧入图像、缺乏GPS和时区问题表明其软件与生态整合仍有不足。
制造商对部分缺点的回应（如单位制切换将通过固件更新解决）显示了其响应态度，但AI识别能力受硬件所限的回复，也暗示了消费级与专业测量设备在定位上的明确区分。
对于技术爱好者或户外探索者，该设备提供了便捷的热成像能力，但近400英镑的售价和软件细节的粗糙度，要求用户在购买前权衡其核心功能与对完美体验的期待。

📖 站内阅读原文（RSS全文）

I love thermal imaging cameras. They're great for spotting leaking pipes, inefficient appliances, and showing how full a septic tank is. The good folks at Topdon have sent me their latest thermal camera to review - it is specifically designed for spotting wildlife.

This is the TS004 Thermal Monocular :

Let's put it through its paces!

Hardware

This is a chunky bit of kit and fits nicely in the hand. It's well weighted and feels sturdy.

The rubber seal fits tightly around your eye and is excellent at keeping light out. The screen is set a little way back, so is easy to focus on. Taking a photo of the screen itself was a little tricky - here's what you can expect to see when using the settings menu:

The focus knob near the viewfinder is a little stiff, but it turns silently.

There's a rubber lens cover which is attached and can be easily tucked away next to the standard tripod mount. It comes with a lanyard strap, so you're unlikely to drop it. The buttons are well spaced and respond quickly.

The USB-C port has a rubber flap to keep out moisture.

OK, let's take some snaps!

Photos

Photo quality is pretty good - although limited by the technology behind the thermal sensor. The TS004 has a thermal resolution of 256x192 and images are upscaled to 640x480.

One thing to note, the user-interface is burned in to the photos. So if you want the battery display on screen, it will also appear on the photo. Similarly, things like the range-finder appear in the image.

There's a reasonable AI built in. It is designed to tell you what sort of wildlife you've spotted. In some cases, it is pretty accurate! A woman walked by me while I was looking for wildlife - here's her photo:

Nifty!

Here's a photo of a fox:

There are remarkably few wild boars in London!

Video

Video is also 640x480. It is a very smooth 42.187 FPS and a rather chunky 2,162 Kbps - leading to a file size of around 20MB per minute. With around 30GB of in-built storage, that shouldn't be a problem though. There's no audio available and, just like the photos, the UI is burned into the picture.

Here are a couple of sample videos I shot. In them, I cycle through the colour modes and zoom levels.

First, an urban fox foraging in London:

https://shkspr.mobi/blog/wp-content/uploads/2026/02/fox.mp4

Second, some parakeets flapping around a tree:

https://shkspr.mobi/blog/wp-content/uploads/2026/02/Birds-In-Flight.mp4

I'm impressed with the smoothness of the video and how well it picks up heat even from relatively far away.

Linux

Bizarrely, on Linux it shows up as 1d6b:0101 Linux Foundation Audio Gadget . It presents as a standard USB drive and you can easily copy files to and from it. 100% compatibility!

You can't use it as a WebCam - for anything more complicated than copying files, you need to use the official app.

App

The TopInfrared App for Android is reasonably good. It connects to the camera via WiFi and offers some useful features. Most impressively, it live-streams the camera's view to your phone.

From there you can take photos or videos and have them saved straight onto your device. Handy if you've set the camera up outside and want to view it from somewhere warmer.

Frustratingly, it isn't possible to set all the options on the camera using the app. For that you need to go back to the menu on the camera - which is slightly laborious.

The app isn't mandatory for most operations - thankfully - but it is the only way to set the time and date on the monocular. You will also need it if there are any firmware updates.

If you don't need the app, you can turn off the WiFi to save some battery life.

Drawbacks

The device works - and is great for wildlife spotting - but there are a few little niggles. I've fed these back to the manufacturer and have included their responses.

• There's no EXIF in the photos, or any way to get thermal data out of the images.

• "These products focus on image clarity, high sensitivity, and low latency. For example, temperature-measurement thermal cameras typically run at 25 Hz, while the TS004 operates at 50 Hz for smoother viewing. Devices that include EXIF temperature data, raw thermal export, and analytical tools are measurement-focused thermal cameras, which are based on a different design and use case."

• As mentioned, having the UI burned into the photos and videos is slightly annoying.

• You can turn off the UI elements on screen which stops them appearing in the photo.

• The range-finder only works in yards and, while seemingly accurate, isn't overly helpful to those of us who think in metric!

• "Unit switching will be available in the March firmware update"

• Once you sync the time with the monocular, all the filenames are timestamped like 2026_02_09_12345678 but it appears to be hardwired to Hong Kong Time (UTC+8) - so your dates and times might be a little out.

• "We will investigate it and see if it can be implemented in a future update"

• The AI detection feature doesn't seem particularly tuned for the UK.

• "Due to hardware limitations, the current recognition is relatively basic, so there is limited room for significant improvement"

In terms of hardware limitations, there's no GPS. I would expect a device in this price-range to have basic GPS functionality to allow you to easily tag photos.

None of these are show-stoppers, but for a device this expensive they are an annoyance.

Price

OK, so you want to spot birds in trees and wild boars foraging in the forest - what'll this cost you?

Close to £400 - you can use code TERENCE15 for a 15% discount until 16 February 2026.

The price of thermal imaging equipment is high and this is a fairly niche form-factor. It is easy to use, has a great range, and the rubber eyepiece is much nicer than staring at a bright phone screen. The battery life is excellent and you certainly can't complain about the generous storage space.

There are some minor irritations as discussed above, but it is an exceptional bit of kit if you like to explore the environment. Are you going to spot any cryptids with it? Who knows! But you'll have lots of fun discovering the natural world around you.

31

Factional Drift: We cluster into factions online

📝 iDiallo.com

🏷️ 软件工程 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 文章核心揭示了在线讨论中，参与者会基于身份认同而非观点自发形成派系，导致讨论主题发生横向或纵向的偏移。

💡 核心要点:

作者通过三个案例（遥控器、网页报价、AI）展示了在线讨论如何分裂为基于身份（如技术爱好者、公寓住户）或知识框架（如经济学家、伦理学家）的派系。
派系形成过程是自发的，由用户回复共鸣评论及平台算法推动，而非主动选择。
讨论主题会因此发生偏移：遥控器案例是横向偏移（按生活经验），报价案例是纵向偏移（按知识框架深入）。

🧠 深度分析:

这种现象可能导致在线社区讨论失焦，加剧回声室效应，使建设性对话和共识达成变得困难。
对于社区管理者和内容平台，理解这种‘派系漂移’有助于设计更好的讨论引导机制，例如通过结构化提问或派系标签来管理对话流向。
内容创作者可以预判不同身份群体的关注点，从而更有效地参与或引导讨论，避免被单一派系的声音淹没。

📖 站内阅读原文（RSS全文）

Whenever one of my articles reaches some popularity, I tend not to participate in the discussion. A few weeks back, I told a story about me, my neighbor and an UHF remote . The story took on a life of its own on Hackernews before I could answer any questions. But reading through the comment section, I noticed a pattern on how comments form. People were not necessarily talking about my article. They had turned into factions.

This isn't a complaint about the community. Instead it's an observation that I've made many years ago but didn't have the words to describe it. Now I have the articles to explore the idea.

The article asked this question: is it okay to use a shared RF remote to silence a loud neighbor ? The comment section on hackernews split into two teams. Team Justice, who believed I was right to teach my neighbor a lesson. And then Team Boundaries, who believed I was “a real dick”. But within hours, the thread stopped being about that question. People self-sorted into tribes, not by opinion on the neighbor, but by identity.

The tinkerers joined the conversation. If you only looked through the comment section without reading the article, you'd think it was a DIY thread on how to create an UHF remote. They turned the story into one about gadget showcasing. TV-B-Gone, Flipper Zeros, IR blasters on old phones, a guy using an HP-48G calculator as a universal remote. They didn't care about the neighbor. They cared about the hack.

Then came the apartment warriors. They bonded over their shared suffering experienced when living in an apartment. Bad soundproofing, cheap landlords, one person even proposed a tool that doesn't exist yet, a "spirit level for soundproofing". The story was just a mirror for their own pain.

The diplomats quietly pushed back on the whole premise. They talked about having shared WhatsApp groups, politely asking, and collective norms. A minority voice, but a distinct one.

Why hack someone when you can have a conversation?

The Nostalgics drifted into memories of old tech. HAM radios, Magnavox TVs, the first time a remote replaced a channel dial. Generational gravity.

Back in my days...

Nobody decided to join these factions. They just replied to the comment that felt like their world, and the algorithm and thread structure did the rest. Give people any prompt, even a lighthearted one, and they will self-sort. Not into "right" and "wrong," but into identity clusters. Morning people find morning people. Hackers find hackers. The frustrated find the frustrated. You discover your faction. And once you're in one, the comments from your own tribe just feel more natural to upvote.

This pattern might be true for this article, but what about others? I have another article that has gone viral twice . On this one the question was: Is it ethical to bill $18k for a static HTML page?

Team Justice and Team Boundaries quickly showed up. "You pay for time, not lines of code." the defenders argued. "Silence while the clock runs is not transparent." the others criticized. But then the factions formed. People self-sorted into identity clusters, each cluster developed its own vocabulary and gravity, and the original question became irrelevant to most of the conversation.

Stories about money and professional life pull people downward into frameworks and philosophy.

The pricing philosophers exploded into a deep rabbit hole on Veblen goods, price discrimination, status signaling, and perceived value. Referenced books, studies, and the "I'm Rich" iPhone app. This was the longest thread.

The corporate cynics shared war stories about use-it-or-lose-it budgets, contractors paid to do nothing, and organizational dysfunction. Veered into a full government-vs-corporations debate that lasted dozens of comments.

The professional freelancers dispensed practical advice. Invoice periodically, set scope boundaries, charge what you're worth. They drew from personal contractor experience.

The ethicists genuinely wrestled with whether I did the right thing. Not just "was it legal" but "was it honest." They were ignored.

The psychology undergrads were fascinated by the story. Why do people Google during a repair job and get fired? Why does price change how you perceive quality? Referenced Cialdini's "Influence" and ran with it.

Long story short, a jeweler was trying to move some turquoise and told an assistant to sell them at half price while she was gone. The assistant accidentally doubled the price, but the stones still sold immediately.

The kind of drift between the two articles was different. The remote thread drifted laterally: people sorted by life experience and hobby (gadget lovers found gadget lovers, apartment sufferers found apartment sufferers). The $18k thread drifted deep: people sorted by intellectual framework (economists found economists, ethicists found ethicists, corporate cynics found corporate cynics). The $18k thread even spawned nested debates within subfactions. The Corporate Cynics thread turned into a full government-vs-corporations philosophical argument that had nothing to do with me or the article.

But was all this something that just happens with my articles? I needed an answer. So I picked a recent article I enjoyed by Mitchell Hashimoto . And it was about AI, so this was perfect to test if these patterns exist here as well.

Now here is a respected developer who went from AI skeptic to someone who runs agents constantly. Without hype, without declaring victory, just documenting what worked. The question becomes: Is AI useful for coding, or is it hype?

The result wasn't entirely binary. I spotted 3 groups at first. Those in favor said: "It's a tool. Learn to use it well." Those against it said: "It's slop. I'm not buying it." But then a third group. The fence-sitters (I'm in this group): "Show me the data. What does it cost?"

And then the factions appeared.

The workflow optimizers used the article as a premise to share their own agent strategy. Form an intuition on what the agent is good at, frame and scope the task so that it is hard for the AI to screw up, small diffs for faster human verification.

The defenders of the craft dropped full on manifestos. “AI weakens the mind” then references The Matrix. "I derive satisfaction from doing something hard." This group isn't arguing AI doesn't work. They're arguing it shouldn't work, because the work itself has intrinsic value.

The history buffs joined the conversation. There was a riff on early aircraft being unreliable until the DC-3, then the 747. Architects moving from paper to CAD. They were framing AI adoption as just another tool transition in a long history of tool transitions. They're making AI feel inevitable, normal, obvious.

The Appeal-to-Mitchell crowd stated that Mitchell is a better developer than you. If he gets value out of these tools you should think about why you can't. The flamewar kicked in! Someone joked:

"Why can't you be more like your brother Mitchell?"

The Vibe-code-haters added to the conversation. The term 'vibe coding' became a battleground. Some using it mockingly, some trying to redefine it. There was an argument that noted the split between this thread (pragmatic, honest) and LinkedIn (hyperbolic, unrealistic).

A new variable from this thread was the author's credibility, plus he was replying in the threads. Unlike with my articles, the readers came to this thread with preconceived notions. If I claimed that I am now a full time vibe-coder, the community wouldn't care much. But not so with Mitchell.

The quiet ones lose. The Accountants, the Fence-Sitters, they asked real questions and got minimal traction. "How much does it cost?" silence. "Which tool should I use?" minimal engagement. The thread's energy went to the factions that told a better story.

One thing to note is that the Workflow Optimizers weren't arguing with the Skeptics. The Craft Defenders weren't engaging with the Accountants. Each faction found its own angle and stayed there. Just like the previous threads.

Three threads. Three completely different subjects: a TV remote story, an invoice story, an AI adoption guide. Every single one produced the same underlying architecture. A binary forms. Sub-factions drift orthogonally. The quiet ones get ignored. The entertaining factions win.

The type of drift changes based on the article. Personal anecdotes (TV remote) pull people sideways into shared experience. Professional stories ($18k invoice) pull people down into frameworks. Prescriptive guides (AI adoption) pull people into tactics and philosophy. But the pattern, like the way people self-sort, the way factions ignore each other, the way the thread fractures, this remained the same.

The details of the articles are not entirely relevant. Give any open-ended prompt to a comment section and watch the factions emerge. They're not coordinated. They're not conscious. They just... happen. For example, the Vibe-Code Haters faction emerged around a single term "vibe coding." The semantic battle became its own sub-thread. Language itself became a faction trigger.

Now that you spotted the pattern, you can't unsee it. That's factional drift.

32

Pluralistic: Trump antitrust is dead (13 Feb 2026)

📝 Pluralistic: Daily links from Cory Doctorow

🏷️ 技术趋势 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 文章核心结论是，特朗普时期的反垄断政策已名存实亡，其所谓的“民粹右翼”反垄断运动因本质是交易性的政治表演而注定失败，大型企业通过贿赂和谄媚即可规避监管。

💡 核心要点:

特朗普反垄断机构负责人盖尔·斯莱特被亲信帕姆·邦迪排挤并最终去职，标志其政策终结。
科技巨头通过向MAGA影响者行贿和公开支持特朗普，轻松化解了反垄断压力。
斯莱特任内批准了多起损害公众利益的重大并购，如HPE/瞻博网络、Discover/第一资本等。

🧠 深度分析:

这表明将反垄断建立在政治恩怨而非结构性权力批判上是脆弱的，为利益交换留下了空间，削弱了监管的公正性与效力。
大型科技公司通过政治献金和公开站队来俘获监管，这种模式可能加剧市场垄断，损害消费者权益与市场竞争。
文章警示，有效的反垄断需要超越派系 grievances，聚焦于权力集中本身，否则监管极易被资本腐蚀。

📖 站内阅读原文（RSS全文）

->->->->->->->->->->->->->->->->->->->->->->->->->->->->->

Top Sources: None

-->

Today's links

• Trump antitrust is dead : The "populist right" was doomed to fail.

• Hey look at this : Delights to delectate.

• Object permanence : Premature internet activists; Privacy Without Monopoly; "Broad Band"; Yazidi supersoldiers; I was a Jeopardy! clue.

• Upcoming appearances : Where to find me.

• Recent appearances : Where I've been.

• Latest books : You keep readin' em, I'll keep writin' 'em.

• Upcoming books : Like I said, I'll keep writin' 'em.

• Colophon : All the rest.

Trump antitrust is dead ( permalink )

Remember when the American right decided that it hated (some) big businesses, specifically Big Tech? A whole branch of the Trump coalition (including JD Vance, Matt Gaetz and Josh Hawley) declared themselves to be "Khanservatives," a cheering section for Biden's generationally important FTC commissioner Lina Khan:

https://www.fastcompany.com/91156980/trump-vp-pick-j-d-vance-supports-big-tech-antitrust-crackdown

Trump owes his power to his ability to bully and flatter a big, distrustful coalition of people who mostly hate each other into acting together, like the business lobby and the grievance-saturated conspiratorialists who hate Big Tech because they were momentarily prevented from calling for genocide or peddling election disinformation:

https://pluralistic.net/2025/07/18/winning-is-easy/#governing-is-harder

The best framing for the MAGA war on Big Tech comes from Trashfuture's Riley Quinn, who predicted that the whole thing could be settled by tech companies' boards agreeing to open every meeting with a solemn "stolen likes acknowledgment" that made repentance for all the shadowbanned culture warriors whose clout had been poached by soy content moderators.

And that's basically what happened. Trump's antitrust agencies practiced "boss politics antitrust" in which favored courtiers were given free passes to violate the law, while Trump's enemies were threatened with punitive antitrust investigations until they fell into line:

https://pluralistic.net/2025/07/29/bondi-and-domination/#superjove

Trump's antitrust boss Gail Slater talked a big game about "Trump Antitrust" but was thwarted at every turn by giant corporations who figured out that if they gave a million bucks to a MAGA podcaster, they could go over Slater's head and kill her enforcement actions. When Slater's deputy, Roger Alford, went public to denounce the sleazy backroom dealings that led to the approval of the HPE/Juniper merger, he was forced out of the agency altogether and replaced with a Pam Bondi loyalist who served as a kind of politburo political officer in Slater's agency:

https://abovethelaw.com/2025/08/former-maga-attorney-goes-scorched-earth-with-corruption-allegations-in-antitrust-division/

Bondi made no secret of her contempt for Slater, and frequently humiliated her in public. Now it seems that Bondi has gotten tired of this game and has forced Slater out altogether. As ever, Matt Stoller has the best analysis of how this happened and what it means:

https://www.thebignewsletter.com/p/trump-antitrust-chief-ousted-by-ticketmaster

Stoller's main thesis is that the "conservative populist" movement only gained relevance by complaining about "censorship of conservatives" on the Big Tech platforms. While it's true that the platforms constitute an existential risk to free expression thanks to their chokehold over speech forums, it was always categorically untrue that conservatives were singled out by tech moderators:

https://pluralistic.net/2022/12/10/e2e/#the-censors-pen

Conservative populists' grievance-based politics is in contrast with the progressive wing of the anti-monopoly movement, which was concerned with the idea of concentrated power itself, and sought to dismantle and neuter the power of the business lobby and the billionaires who ran it:

https://pluralistic.net/2022/02/20/we-should-not-endure-a-king/

The problem with conservative populism, then, is that its movement was propelled by the idea that Big Tech was soy and cucked and mean to conservatives. That meant that Big Tech bosses had an easy path out of its crosshairs: climb into the tank for MAGA.

That's just what they did: Musk bought Twitter; Zuck ordered his content moderators to censor the left and push MAGA influencers; Bezos neutered his newspaper in the run up to the 2024 elections; Tim Cook hand-assembled a gold participation trophy for Trump live on camera. These CEOs paid a million dollars each for seats on Trump's inauguration dais and their companies donated millions for Trump's Epstein Memorial Ballroom.

Slater's political assassination merely formalizes something that's been obvious for a year now: you can rip off the American people with impunity so long as you flatter and bribe Trump.

The HP/Juniper merger means that one company now supplies the majority of commercial-grade wifi routers, meaning that one company now controls all the public, commercial, and institutional internet you'll ever connect to. The merger was worth $14b, and Trump's trustbusters promised to kill it. So the companies paid MAGA influencer Mike Davis (who had publicly opposed the merger) a million bucks and he got Trump to overrule his own enforcers. Getting your $14b merger approved by slipping a podcaster a million bucks is a hell of a bargain.

HP/Juniper were first, but they weren't the last. There was the Discover/Capital One merger, which rolled up the two credit cards that low-waged people rely on the most, freeing the new company up for even more predatory practices, price-gouging, junk-fees, and strong-arm collections. When the bill collectors are at your door looking for thousands you owe from junk fees, remember that it was Gail Slater's weakness that sent them there:

https://www.nytimes.com/2025/04/03/business/dealbook/capital-one-discover-merger.html

Slater also waved through the rollup of a string of nursing homes by one of the world's most notoriously greedy and cruel private equity firms, KKR. When your grandma dies of dehydration in a dirty diaper, thank Gail Slater:

https://pluralistic.net/2023/05/09/dingo-babysitter/#maybe-the-dingos-ate-your-nan

Slater approved the merger of Unitedhealth – a company notorious for overbilling the government while underdelivering to patients – with Amedisys, who provide hospice care and home health help:

https://www.justice.gov/opa/pr/justice-department-requires-broad-divestitures-resolve-challenge-unitedhealths-acquisition

The hits keep coming. Want to know why your next vacation was so expensive? Thank Slater for greenlighting the merger of American Express Global Business Travel and CWT Holdings, which Slater challenged but then dropped, reportedly because MAGA influencer Mike Davis told her to.

Davis also got Slater to reverse her opposition to the Compass/Anywhere Real Estate merger, which will make America's dysfunctional housing market even worse:

https://www.wsj.com/us-news/law/real-estate-brokerages-avoided-merger-investigation-after-justice-department-rift-e846c797?gaa_at=eafs&gaa_n=AWEtsqdSXg4z1XPl2UpqdHR4V2-sNj9M7oDcWHscPIXuSU-5n0gtYEv8Q5XZG7qtzfY%3D&gaa_ts=698e44a6&gaa_sig=IO7tWGaHZSYER64YyUzyoiVtrOKR77ZsYMMOdwN1P7koRt9zXYRJ1hxw2oDU9cD40-aGgHHVfwMWg14olFwNaw%3D%3D

It's not just homebuyers whose lives are worse off because of Slater's failures, it's tenants, too. Slater settled the DoJ's case against Realpage, a price-fixing platform for landlords that is one of the most culpable villains in the affordability crisis. Realpage was facing an existential battle with the DoJ; instead, they got away with a wrist-slap and (crucially) are allowed to continue to make billions helping landlords rig the rental market against tenants.

So Slater's defenestration is really just a way of formalizing Trump's approach to antitrust: threaten and prosecute companies that don't bend the knee to the president, personally…and allow companies to rob the American people with impunity if they agree to kick up a percentage to the Oval Office.

But while Slater will barely rate a footnote in the history of the Trump administration, the precipitating event for her political execution is itself very interesting. Back in September, Trump posed with Kid Rock and announced that he was going after Ticketmaster/Live Nation, a combine with a long, exhaustively documented history of ripping off and defrauding every entertainer, fan and venue in America:

https://www.pbs.org/newshour/nation/ftc-sues-ticketmaster-saying-it-uses-illegal-tactics-to-make-fans-pay-more-for-live-events

At the time, it was clear that Trump had been prodded into action by two factors: the incredible success of the Mamdani campaign's focus on "affordability" (Ticketmaster's above-inflation price hikes are one of the most visible symptoms of the affordability crisis) and Kid Rock's personal grievances about Ticketmaster.

Kid Rock is the biggest-name entertainer in the Trump coalition, the guy Trump got to headline a MAGA halftime show that notably failed to dim Bad Bunny's star by a single milliwatt. Trump – a failed Broadway producer – is also notoriously susceptible to random pronouncements by celebrities (hence the Fox and Friends-to-Trump policy pipeline), so it's natural that Kid Rock's grousing got action after decades of documented abuses went nowhere.

Ticketmaster could have solved the problem by offering to exempt Trump-loyal entertainers from its predatory practices. They could have announced a touring Trumpapalooza festival headlined by Kid Rock, Christian rock acts, and AI-generated country singers, free from all junk fees. Instead, they got Gail Slater fired.

Mike Davis doesn't just represent HPE/Juniper, Amex travel, and Compass/Anywhere – he's also the fixer that Ticketmaster hired to get off the hook with the DoJ. He's boasting about getting Slater fired:

https://x.com/gekaminsky/status/2022076364279755066

And Ticketmaster is off the hook:

https://prospect.org/2026/02/12/trump-justice-department-ticketmaster-live-nation-monopoly/

What's interesting about all this is that there were elements of the Biden coalition that also hated antitrust (think of all the Biden billionaires who called for Lina Khan to be fired while serving as "proxies" for Kamala Harris). And yet, Biden's trustbusters did more in four short years than their predecessors managed over the preceding forty.

Stoller's theory is that the progressive anti-monopoly movement (the "Brandeisians") were able to best their coalitional rivals because they did the hard work of winning support for the idea of shattering corporate power itself – not just arguing that corporate power was bad when it was used against them.

This was a slower, harder road than dividing up the world into good monopolies and bad ones, but it paid off. Today the Brandeisians who made their bones under Biden are serving the like of Mamdani:

https://pluralistic.net/2025/11/15/unconscionability/#standalone-authority

And their ideas have spread far and wide – even to other countries:

https://lewisforleader.ca/ideas/public-options-full-plan/

They lit a fire that burns still. Who knows, maybe someday it'll even help Kid Rock scorch the Ticketmaster ticks that are draining his blood from a thousand tiny wounds. He probably won't have the good manners to say thank you.

Hey look at this ( permalink )

• PROPOSAL FOR A STUDY ON TYPES OF BUSINESS MODELS AND ECONOMIC OPPORTUNITIES CREATED BY AND THROUGH THE IMPLEMENTATION OF TECHNOLOGICAL PROTECTION MEASURES (TPMs) https://www.wipo.int/edocs/mdocs/copyright/en/sccr_47/sccr_47_12.pdf

• Wes Cook and the Centralia McDonald's Mural https://cabel.com/wes-cook-and-the-mcdonalds-mural/

• why this, why now, why not? https://backofmind.substack.com/p/why-this-why-now-why-not

• Peter Mandelson Invokes Press Harassment Protections To Dodge Questions About His Support Of Jeffrey Epstein https://www.techdirt.com/2026/02/11/peter-mandelson-invokes-press-harassment-protections-to-dodge-questions-about-his-support-of-jeffrey-epstein/

• The Philosophical Prospects of Large Language Models in the Future of Mathematics https://mxphi.com/wp-content/uploads/2026/02/FT.pdf

Object permanence ( permalink )

#20yrsago Google Video DRM: Why is Hollywood more important than users? https://memex.craphound.com/2006/02/13/google-video-drm-why-is-hollywood-more-important-than-users/

#20yrsago Phishers trick Internet “trust” companies https://web.archive.org/web/20060222232249/http://blog.washingtonpost.com/securityfix/2006/02/the_new_face_of_phishing_1.html

#15yrsago With a Little Help: first post-publication progress report https://www.publishersweekly.com/pw/by-topic/columns-and-blogs/cory-doctorow/article/46105-with-a-little-help-the-early-returns.html

#15yrsago Nokia’s radical CEO has a mercenary, checkered past https://web.archive.org/web/20100608100324/http://www.siliconbeat.com/2008/01/11/microsoft-beware-stephen-elop-is-a-flight-risk/

#15yrsago Scientology’s science fictional origins: thesis from 1981 https://web.archive.org/web/20110218045653/http://digitalcommons.mcmaster.ca/opendissertations/126/

#10yrsago I was a Jeopardy! clue https://memex.craphound.com/2016/02/13/i-was-a-jeopardy-clue/

#10yrsago Liberated Yazidi sex slaves become a vengeful, elite anti-ISIS fighting force https://www.independent.co.uk/news/world/middle-east/isis-yazidi-sex-slaves-take-up-arms-for-mosul-fight-to-bring-our-women-home-a6865056.html

#10yrsago Listen: a new podcast about science fiction and spectacular meals https://www.scottedelman.com/2016/02/10/the-first-episode-of-eating-the-fantastic-with-guest-sarah-pinsker-is-now-live/

#10yrsago Politician given green-light to name developer’s new streets with synonyms for greed and deceit https://web.archiv

内容较长，当前仅展示前 14000 字。可点击“打开原文”查看完整内容。

33

Members only: "Won't Fix" self help

📝 Westenberg.

🏷️ 软件工程 🏷️ 职业发展

↗ 打开原文

📌 AI 摘要: 文章提出一种名为“不予修复”的自我管理新思路，主张将个人难以改变的核心特质视为软件中的“不予修复”缺陷，通过构建“包装层”来适配外界，而非徒劳地彻底改造自我。

💡 核心要点:

当代自助产业主要分为斯多葛式接纳与无限优化两大对立阵营。
个人核心特质如同老代码库中的‘不予修复’缺陷，改变成本极高且效果甚微。
与其彻底重构自我，不如采用‘包装模式’，在旧有特质与外部需求间建立适配层。

🧠 深度分析:

该观点挑战了自助产业‘人皆可被完美改造’的核心商业逻辑，可能引导人们将精力从自我批判转向更务实的策略管理。
将软件工程概念应用于个人发展，为管理长期行为模式提供了具体、可操作的方法论，如为迟到倾向设置‘时间缓冲包装’。
它促使人们重新评估‘问题’的定义，许多‘缺陷’在另一套评价体系下可能是优势，这有助于减少不必要的自我消耗与焦虑。

📖 站内阅读原文（RSS全文）

Every major self-help framework of the last two decades falls into one of two camps. • Camp one is Stoic Acceptance: your problems are features, not bugs, and the path to contentment runs through radical non-resistance.

• Camp two is Relentless Optimization: your problems are solvable if you wake up at 4:30 AM, track your macros, journal with intention, and subscribe to the right Substack. You have Marcus Aurelius on one side, Tony Robbins on the other, and a $13.4 billion personal development industry filling the gap between them with courses and coaching programs and hardcover books that all say some version of the same thing: you can // should // must be fixed. I'd like to propose a third option: the reasonable // rational recognition that most of your personal flaws are "Won't Fix" bugs, and the single most productive thing you can do about them is stop trying to patch them. The "Won't Fix" resolution Tagging a bug "Won't Fix" doesn't mean it isn't real and it doesn't mean nobody noticed; it means the cost of fixing it exceeds the benefit, or the fix would introduce worse instabilities elsewhere, or the system has already built so many dependencies around the bug that it's become, functionally, a feature. Every codebase of sufficient age accumulates these. They're documented, acknowledged, and largely left alone so the engineers can go build something useful. Note (and this is important): you are a codebase of sufficient age. The self-help industry's entire business model depends on convincing you that every single bug in your system is fixable, that with the right framework, the right habits, the right coach, you'll finally refactor yourself into a clean, well-architected human being. But how many of your core personality traits have actually changed in the last decade? The honest answer, for most people, is...very fucking few. Why refactoring yourself is a terrible use of resources Fred Brooks observed that when engineers build the second version of a system, they tend to over-design it, cramming in every feature and fix they wished they'd included the first time. The result is almost always bloated, late, and worse than what it replaced. The lesson developers have extracted from fifty years of living with Brooks's observation is simple: don't rewrite from scratch, and work with what you have. Self-improvement culture is a perpetual second-system rewrite of the self. You're constantly trying to architect Human 2.0, the version of you that's disciplined and calm and focused and doesn't check their phone 96 times a day (which is, by the way, the actual average for American adults, according to Asurion's widely cited research). But Human 2.0 never ships. You keep accumulating half-finished refactors and abandoned meditation streaks alongside a growing sense that something is fundamentally wrong with your willpower. The alternative is the wrapper pattern. When you have a piece of legacy code that works but has an ugly interface, you don't rewrite it. You write a thin layer around it, a wrapper, that presents a clean interface to the rest of the system while leaving the messy internals untouched. The legacy code keeps doing what it always did, and the wrapper translates between the old system and the new requirements. What wrappers look like in practice The Acceptance camp says: release your attachment to punctuality. The Optimization camp says: build a 47-step morning routine with buffer time calculated to the minute. Won't Fix says: you're going to be late, so build a wrapper. Tell people 2 PM when you mean 2:30. Set your clocks ahead. Automate your calendar reminders to fire 15 minutes earlier than the defaults. You haven't changed yourself, but you've written an adapter layer between your actual personality and the world's expectations. Epictetus, who spent years as a slave in Rome before gaining his freedom and building one of the most influential schools of Stoic philosophy, would probably say that this approach surrenders moral agency. You're supposed to become virtuous, not fake it with systems. And there's something to that objection. But Epictetus also spent most of his philosophy drawing sharp lines between what you can and can't control. Is your core temperament something you control? The Big Five personality traits (openness, conscientiousness, extraversion, agreeableness, neuroticism) show remarkable stability across adult lifespans, according to decades of longitudinal research in personalty psychology. You can nudge them, but you can't overhaul them. If Epictetus were working in DevOps, I suspect he'd be a wrapper advocate. Giving up correctly is its own liberation In Kazuo Ishiguro's The Remains of the Day Stevens, the butler, reflects on the decades he spent perfecting his professional dignity at the expense of, well, everthing else. His entire life was a refactoring project: eliminate the personal, optimize for service, become the ideal version of what a butler should be. By the end of the novel he's technically excellent and profoundly diminished. He optimized the wrong thing for forty years because he never stopped to ask whether the specification itself was flawed. Won't Fix is the practice of questioning the specification. Most of the things you're trying to fix about yourself are only problems relative to some imagined ideal of a person you were never going to be. Your distractibility is a bug in the "focused knowledge worker" spec but might be a feature in the "person who notices interesting things and connects them unexpectedly" spec. Your sensitivity and your stubbornness, your tendency to monologue about niche topics at parties: all Won't Fix, and all load-bearing, and all probably okay in the big, heat-death-of-the-universe scheme of all things. Stop trying to ship Human 2.0. Tag the bugs, write the wrappers, and get back to building something worth building. The most productive version of you probably looks a lot like the current version of you, plus a few well-placed adapter patterns and minus about thirty self-help books worth of guilt about not being somone else.

34

Expressing a prime as the sum of two squares

📝 John D. Cook

🏷️ 软件工程 🏷️ 其他

↗ 打开原文

📌 AI 摘要: 文章讨论了费马平方和定理（奇素数可表为两平方和当且仅当模4余1），并对比了高斯公式与Stan Wagon算法在计算该表示时的效率差异。

💡 核心要点:

费马定理的‘仅当’方向证明简单，‘当’方向证明较难。
高斯公式虽优美，但计算复杂度为O(p)，实际计算价值低。
Wagon算法结合二次非剩余和欧几里得算法，效率远高于高斯公式。

🧠 深度分析:

该对比凸显了理论优美性与计算实用性之间的权衡，对算法设计有启发意义。
Wagon算法展示了将数论定理转化为高效计算程序的实际工程思路。
文章暗示了寻找类似威尔逊定理的恒等式可能优化计算，指出了潜在的改进方向。

📖 站内阅读原文（RSS全文）

I saw where Elon Musk posted Grok’s answer to the prompt “What are the most beautiful theorems.” I looked at the list, and there were no surprises, as you’d expect from a program that works by predicting the most likely sequence of words based on analyzing web pages.

There’s only one theorem on the list that hasn’t appeared on this blog, as far as I can recall, and that’s Fermat’s theorem that an odd prime p can be written as the sum of two squares if and only if p = 1 mod 4. The “only if” direction is easy [1] but the “if” direction takes more effort to prove.

If p is a prime and p = 1 mod 4, Fermat’s theorem guarantees the existence of x and y such that

Gauss’ formula

Stan Wagon [2] gave an algorithm for finding a pair ( x , y ) to satisfy the equation above [2]. He also presents “a beautiful formula due to Gauss” which “does not seem to be of any value for computation.” Gauss’ formula says that if p = 4 k + 1, then a solution is

For x and y we choose the residues mod p with | x | and | y | less than p /2.

Why would Wagon say Gauss’ formula is computationally useless? The number of multiplications required is apparently on the order of p and the size of the numbers involved grows like p !.

You can get around the problem of intermediate numbers getting too large by carrying out all calculations mod p , but I don’t see a way of implementing Gauss’ formula with less than O ( p ) modular multiplications [3].

Wagon’s algorithm

If we want to express a large prime p as a sum of two squares, an algorithm requiring O ( p ) multiplications is impractical. Wagon’s algorithm is much more efficient.

You can find the details of Wagon’s algorithm in [3], but the two key components are finding a quadratic non-residue mod p (a number c such that c ≠ x ² mod p for any x ) and the Euclidean algorithm. Since half the numbers between 1 and p − 1 are quadratic non-residues, you’re very likely to find a non-residue after a few attempts.

[1] The square of an integer is either equal to 0 or 1 mod 4, so the sum of two squares cannot equal 3 mod 4.

[2] Stan Wagon. The Euclidean Algorithm Strikes Again. The American Mathematical Monthly, Vol. 97, No. 2 (Feb., 1990), pp. 125-129.

[3] Wilson’s theorem gives a fast way to compute ( n − 1)! mod n . Maybe there’s some analogous identity that could speed up the calculation of the necessary factorials mod p , but I don’t know what it would be.

The post Expressing a prime as the sum of two squares first appeared on John D. Cook .

35

Respectful Open Source

📝 Andrew Nesbitt

🏷️ 开源项目 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章核心揭示了当前开源贡献的“推送”模式给维护者带来了巨大的认知负担和心理健康压力，并探讨了转向更尊重维护者注意力的“拉取”模型的可能性。

💡 核心要点:

开源维护者被未经请求的PR、问题、审计等‘推送’信息淹没，承受巨大认知负荷。
Git原生的`request-pull`是尊重注意力的‘拉取’模型，但当前平台和发现机制使其失效。
AI代码助手加剧了问题，导致低质量提交激增，迫使一些知名项目采取激进措施。

🧠 深度分析:

这揭示了开源可持续性的核心矛盾：贡献便利性与维护者福祉的冲突。若无法解决，将加速核心维护者 burnout，损害项目长期健康。
文章提出的‘可发现的修复’与‘拉取模型’是重要方向，需要平台工具创新（如改进分支发现）来支持，而不仅仅是关闭PR通道。
维护者需要更主动地设定边界（如Ghostty、tldraw的做法），社区也应更重视‘尊重性贡献’文化，而不仅关注资金问题。

📖 站内阅读原文（RSS全文）

I found and fixed a bug in a popular open source project last week. Went to look at the repository and saw a maintainer drowning in issues and pull requests, clearly underwater, and I didn’t submit the fix.

I’ve been on both sides of this for a long time. I ran 24 Pull Requests for years, a project that actively encouraged people to send PRs to open source maintainers every December. The incoming was so overwhelming that I ended up building Octobox just to help maintainers manage the flood of GitHub notifications. I’ve spent a decade building tools to help maintainers cope with inbound, and I still couldn’t bring myself to add to someone else’s pile.

When I mentioned this on Mastodon, most people got it immediately. A couple said send it anyway, which I think misses something about what it’s like to be on the receiving end. A fix from a stranger still carries cognitive load beyond just merging: triage, review, checking for regressions, responding, managing expectations when you can’t get to it quickly. And once you merge someone’s code, you’re maintaining it. They move on, but you’re the one who gets the bug report a year later when something breaks in a way the original patch didn’t anticipate.

Even a perfect PR with a note saying “no rush” creates a low-grade obligation the moment it appears. The maintainer now knows it exists, unanswered. Someone in the thread suggested framing it as a gift with no expectations, and another person put it well: it doesn’t matter how carefully you word it, it still lands as a thing that needs a decision.

The fix exists on my fork. If discovery were good, anyone hitting the same bug could find it there, but nobody will because fork discovery is effectively broken.

Git was pull-based

The open source contribution model is almost entirely push-based. You do the work, then you push it at a maintainer and wait. Issues, PRs, @mentions, automated updates, audit findings, all of it puts something in front of a person who didn’t ask for it.

git request-pull generates a summary of changes in your repo and asks someone to pull from it, a genuine peer-to-peer request where the maintainer decides if and when to look. The contributor publishes their work and the maintainer pulls at their own pace, which is about as respectful of someone’s attention as a collaboration model gets. GitHub took that name and bolted it onto what is functionally a push-based review queue. GitLab is at least honest about it by calling them merge requests.

Nobody can really use the git request-pull workflow anymore because it depends on the other person being able to find and browse your repo, which is a discovery problem that doesn’t have good answers right now. If the default were flipped so that fixes exist publicly without requiring maintainer attention, the contributor’s job would be done when the fix is public rather than when it’s merged, and other users could find and benefit from fixes independently of upstream.

Fork discovery is broken

The best tools for fork discovery are a handful of browser extensions that filter GitHub’s fork list to show forks with commits ahead of upstream, and the most ambitious one I found clones all forks into a single repo and lets you grep across them locally.

GitHub made forking easy and fork discovery nearly impossible. The old fork graph rarely works for popular repos because so many people use the fork button as a bookmark, and Dependabot, CI bots, and AI agents all generate forks that are nothing but noise. Someone in the thread mentioned installing a browser plugin just to look at forks.

GitHub have said they’ll let maintainers turn off PRs on their repos, which makes sense as a pressure valve, but turning off PRs without an alternative channel doesn’t make fixes discoverable elsewhere.

It might be more interesting to pair that switch with better discovery. Imagine a maintainer triaging issue #347 and being able to see “three forks have patches touching this code” without anyone having submitted anything, because the signal is already there in git, just not surfaced anywhere.

Everything is push

PRs are just the most visible channel. Bug reports, feature requests, support questions, and bot-generated updates all land in the same inbox with the same zero friction and the same assumption that someone on the other end has time to look at them.

Compliance and audit requests add another layer, where someone runs a scanner, finds something, and opens an issue that reads like a demand. “Your project has a licensing problem.” “This code has a known vulnerability.” The maintainer didn’t ask for the audit, didn’t agree to the compliance framework, and is now expected to respond on someone else’s timeline. With the EU CRA pushing more software supply chain accountability, there’s a growing class of inbound that amounts to “prove to me that your free software meets my requirements,” which is a lot to push at a volunteer.

Private vulnerability disclosure is different because it needs a direct channel by nature, and that channel has its own AI spam crisis as anyone following curl’s experience with HackerOne can attest. But for everything else, the problem isn’t bad faith on anyone’s part, it’s that every one of these interactions assumes the maintainer has capacity to receive, and there’s no mechanism for them to control that.

Open source sustainability conversations tend to focus on money, and maintainers absolutely need more of it, but maintainer attention and mental health are at least as scarce a resource, and nobody’s trying to conserve them. Miranda Heath’s report on burnout in open source names six causes, and workload is only one of them: toxic community behaviour, hyper-responsibility, and the pressure to keep proving yourself all compound the problem. The communities around projects aren’t fungible either, built on years of shared context and ambient trust that can’t be rebuilt once the people holding them together burn out. Unsolicited PRs, drive-by issues, and automated audits are all withdrawals from a finite account. A pull model, where people log problems and publish fixes somewhere discoverable and the maintainer engages on their own schedule, would at least stop treating that account as bottomless.

AI slop accelerates the problem

All of this was already a problem before AI coding agents, but the past six months have made it noticeably worse. The volume of low-quality inbound to popular projects has exploded. Daniel Stenberg watched AI-generated reports grow to 20% of curl’s bug bounty submissions through 2025, added a checkbox requiring AI disclosure, then finally killed the bounty program entirely in January 2026 after receiving seven submissions in sixteen hours. Ghostty implemented a policy where submitting bad AI-generated code gets you permanently banned. tldraw stopped accepting external PRs altogether .

These are experienced maintainers who tried graduated responses and ended up at the nuclear option because nothing else worked. The pattern is the same every time: add disclosure requirements, then add friction, then restrict access, then close the door, with each step costing maintainer energy on policy rather than code. That might work for individual projects, but it’s hard to see it scaling when the number of potential contributors becomes effectively infinite and the tooling to generate plausible-looking code keeps getting better. And if GitHub’s answer is letting maintainers turn off PRs entirely, AI pressure is going to force that switch on more and more repos, which only widens the discovery gap. GitHub made forking a one-click operation a decade ago without ever investing in making the resulting graph navigable, and now that turning off PRs is becoming a reasonable response to the AI firehose, all those would-be contributions just pile up as diverging forks that nobody can find.

A pull-based model would sidestep most of this, because agents can fork and generate garbage all day without anything landing in anyone’s inbox. The maintainer never has to evaluate it, write a policy about it, or spend emotional energy closing it with a polite note. Generated code that happens to be good sits in a fork where someone might eventually find it useful, and the rest is invisible.

The empathy of not adding to the pile, the choice to fix something and walk away, is invisible in open source sustainability discussions, and I suspect the contributions people deliberately don’t make out of respect for maintainer capacity might matter just as much as the ones they do. The fix is on my fork, and for now that’s where it stays.

36

Attack of the SaaS clones

📝 Martin Alderson

🏷️ AI/机器学习 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 作者仅用约20个提示词，就借助Claude Code克隆了Linear的UI和核心功能，这预示着SaaS产品的核心界面与功能正变得极易被AI复制。

💡 核心要点:

AI代码生成工具能快速复刻成熟SaaS产品的核心功能。
克隆一个复杂产品（如Linear）的UI和功能，所需提示词数量极少。
这一现象直接对SaaS公司的技术壁垒构成挑战。

🧠 深度分析:

AI降低了软件复制的门槛，SaaS公司需更注重构建数据、网络效应等非代码壁垒。
基于现有摘要推断，这可能加速市场竞争，迫使企业更关注创新与用户体验的独特性。

📖 站内阅读原文（RSS摘要）

I cloned Linear's UI and core functionality using Claude Code in about 20 prompts. Here's what that means for SaaS companies.

37

The Final Bottleneck

📝 Armin Ronacher's Thoughts and Writings

🏷️ 软件工程 🏷️ AI/机器学习

↗ 打开原文

📌 AI 摘要: 文章指出，AI辅助编程极大提升了代码生成速度，但导致代码审查成为新的、不可持续的瓶颈，并引发了对软件责任归属和工程实践可持续性的深刻反思。

💡 核心要点:

AI编程工具使代码生成速度远超人工审查能力，导致PR积压如山。
历史经验表明，解决一个瓶颈只会将压力转移到下游环节。
作者认为，只要人类仍需对软件负责，就始终是流程中的瓶颈。

🧠 深度分析:

这揭示了当前‘AI优先’团队面临的核心矛盾：生产效率的提升若无法被下游环节（如审查、测试）吸收，将导致系统崩溃。
文章暗示未来软件工程可能向‘一次性塑料软件’模式演变，即快速生成与替换，这将彻底改变软件质量、维护和责任的定义。
实践上，项目可能需要采取节流（如限制提交）、自动化审查或重构责任模型来应对，否则将陷入不可持续的状态。

📖 站内阅读原文（RSS全文）

Historically, writing code was slower than reviewing code.

It might not have felt that way, because code reviews sat in queues until someone got around to picking it up. But if you compare the actual acts themselves, creation was usually the more expensive part. In teams where people both wrote and reviewed code, it never felt like “we should probably program slower.”

So when more and more people tell me they no longer know what code is in their own codebase, I feel like something is very wrong here and it’s time to reflect.

You Are Here

Software engineers often believe that if we make the bathtub bigger , overflow disappears. It doesn’t. OpenClaw right now has north of 2,500 pull requests open. That’s a big bathtub.

Anyone who has worked with queues knows this: if input grows faster than throughput, you have an accumulating failure. At that point, backpressure and load shedding are the only things that retain a system that can still operate.

If you have ever been in a Starbucks overwhelmed by mobile orders, you know the feeling. The in-store experience breaks down. You no longer know how many orders are ahead of you. There is no clear line, no reliable wait estimate, and often no real cancellation path unless you escalate and make noise.

That is what many AI-adjacent open source projects feel like right now. And increasingly, that is what a lot of internal company projects feel like in “AI-first” engineering teams, and that’s not sustainable. You can’t triage, you can’t review, and many of the PRs cannot be merged after a certain point because they are too far out of date. And the creator might have lost the motivation to actually get it merged.

There is huge excitement about newfound delivery speed, but in private conversations, I keep hearing the same second sentence: people are also confused about how to keep up with the pace they themselves created.

We Have Been Here Before

Humanity has been here before. Many times over. We already talk about the Luddites a lot in the context of AI, but it’s interesting to see what led up to it. Mark Cartwright wrote a great article about the textile industry in Britain during the industrial revolution. At its core was a simple idea: whenever a bottleneck was removed, innovation happened downstream from that. Weaving sped up? Yarn became the constraint. Faster spinning? Fibre needed to be improved to support the new speeds until finally the demand for cotton went up and that had to be automated too. We saw the same thing in shipping that led to modern automated ports and containerization.

As software engineers we have been here too. Assembly did not scale to larger engineering teams, and we had to invent higher level languages. A lot of what programming languages and software development frameworks did was allow us to write code faster and to scale to larger code bases. What it did not do up to this point was take away the core skill of engineering.

While it’s definitely easier to write C than assembly, many of the core problems are the same. Memory latency still matters, physics are still our ultimate bottleneck, algorithmic complexity still makes or breaks software at scale.

Giving Up?

When one part of the pipeline becomes dramatically faster, you need to throttle input. Pi is a great example of this. PRs are auto closed unless people are trusted. It takes OSS vacations . That’s one option: you just throttle the inflow. You push against your newfound powers until you can handle them.

Or Giving In

But what if the speed continues to increase? What downstream of writing code do we have to speed up? Sure, the pull request review clearly turns into the bottleneck. But it cannot really be automated. If the machine writes the code, the machine better review the code at the same time. So what ultimately comes up for human review would already have passed the most critical possible review of the most capable machine. What else is in the way? If we continue with the fundamental belief that machines cannot be accountable, then humans need to be able to understand the output of the machine. And the machine will ship relentlessly. Support tickets of customers will go straight to machines to implement improvements and fixes, for other machines to review, for humans to rubber stamp in the morning.

A lot of this sounds both unappealing and reminiscent of the textile industry. The individual weaver no longer carried responsibility for a bad piece of cloth. If it was bad, it became the responsibility of the factory as a whole and it was just replaced outright. As we’re entering the phase of single-use plastic software, we might be moving the whole layer of responsibility elsewhere.

I Am The Bottleneck

But to me it still feels different. Maybe that’s because my lowly brain can’t comprehend the change we are going through, and future generations will just laugh about our challenges. It feels different to me, because what I see taking place in some Open Source projects, in some companies and teams feels deeply wrong and unsustainable. Even Steve Yegge himself now casts doubts about the sustainability of the ever-increasing pace of code creation.

So what if we need to give in? What if we need to pave the way for this new type of engineering to become the standard? What affordances will we have to create to make it work? I for one do not know. I’m looking at this with fascination and bewilderment and trying to make sense of it.

Because it is not the final bottleneck. We will find ways to take responsibility for what we ship, because society will demand it. Non-sentient machines will never be able to carry responsibility, and it looks like we will need to deal with this problem before machines achieve this status. Regardless of how bizarre they appear to act already.

I too am the bottleneck now . But you know what? Two years ago, I too was the bottleneck. I was the bottleneck all along. The machine did not really change that. And for as long as I carry responsibilities and am accountable, this will remain true. If we manage to push accountability upwards, it might change, but so far, how that would happen is not clear.

38

Introducing GPT‑5.3‑Codex‑Spark

📝 Simon Willison's Weblog

🏷️ AI/机器学习 🏷️ 性能优化

↗ 打开原文

📌 AI 摘要: OpenAI与Cerebras合作推出高速代码模型GPT-5.3-Codex-Spark，其核心优势在于极快的响应速度，能显著提升编程时的流畅度和迭代效率。

💡 核心要点:

模型为GPT-5.3-Codex的缩小版，仅支持文本，上下文窗口128k。
实测速度远超其他模型，OpenAI宣称可达1000 tokens/秒。
作者通过生成“鹈鹕骑自行车”SVG的示例直观对比了速度差异。

🧠 深度分析:

极快的响应速度有助于开发者保持“心流”状态，进行高效的交互式编程迭代。
该模型可能成为实时编码辅助和教学演示的强大工具，推动AI编程助手向低延迟体验发展。
目前定价未公布，其成本效益将直接影响开发者的广泛采用。

📖 站内阅读原文（RSS全文）

Introducing GPT‑5.3‑Codex‑Spark

OpenAI announced a partnership with Cerebras on January 14th . Four weeks later they're already launching the first integration, "an ultra-fast model for real-time coding in Codex".

Despite being named GPT-5.3-Codex-Spark it's not purely an accelerated alternative to GPT-5.3-Codex - the blog post calls it "a smaller version of GPT‑5.3-Codex" and clarifies that "at launch, Codex-Spark has a 128k context window and is text-only."

I had some preview access to this model and I can confirm that it's significantly faster than their other models.

Here's what that speed looks like running in Codex CLI:

That was the "Generate an SVG of a pelican riding a bicycle" prompt - here's the rendered result:

Compare that to the speed of regular GPT-5.3 Codex medium:

Significantly slower, but the pelican is a lot better:

What's interesting about this model isn't the quality though, it's the speed . When a model responds this fast you can stay in flow state and iterate with the model much more productively.

I showed a demo of Cerebras running Llama 3.1 70 B at 2,000 tokens/second against Val Town back in October 2024 . OpenAI claim 1,000 tokens/second for their new model, and I expect it will prove to be a ferociously useful partner for hands-on iterative coding sessions.

It's not yet clear what the pricing will look like for this new model.

Tags: ai , openai , generative-ai , llms , cerebras , pelican-riding-a-bicycle , llm-release , codex-cli

39

Gurman: New Siri Might Be Delayed Again

📝 Daring Fireball

🏷️ AI/机器学习 🏷️ 移动开发

↗ 打开原文

📌 AI 摘要: 据彭博社记者Mark Gurman报道，苹果新一代Siri的个性化功能可能再次延期，部分核心特性或推迟至iOS 26.5及iOS 27发布。

💡 核心要点:

苹果原计划在iOS 26.4中发布新Siri，现改为分批在后续版本中推出。
内部测试已转向iOS 26.5，表明功能至少推迟一个版本。
一项关键延迟功能是Siri访问个人数据（如搜索短信）以执行复杂任务。

🧠 深度分析:

Siri的再次延期可能削弱苹果在AI助手领域的竞争力，影响其与竞争对手的差异化优势。
苹果考虑为功能添加‘预览’开关，表明其可能采取更谨慎的发布策略，以管理用户预期。
若功能推迟至接近WWDC，可能打乱苹果的产品发布节奏，使焦点过早转向下一代操作系统。

📖 站内阅读原文（RSS全文）

Mark Gurman, reporting for Bloomberg:

After planning to include the new capabilities in iOS 26.4 — an operating system update slated for March — Apple is now working to spread them out over future versions, according to people familiar with the matter. That would mean possibly postponing some features until at least iOS 26.5, due in May, and iOS 27, which comes out in September. [...]

In recent days, Apple instructed engineers to use the upcoming iOS 26.5 in order to test new Siri features, implying that the functionality may have been moved back by at least one release. Internal versions of that update now include a notice describing the addition of some Siri enhancements. One feature is especially likely to slip: the expanded ability for Siri to tap into personal data. That technology would let users ask the assistant to, say, search old text messages to locate a podcast shared by a friend and immediately play it.

Internal iterations of iOS 26.5 also include a settings toggle allowing employees to enable a “preview” of that functionality. That suggests Apple is weighing the idea of warning users that the initial launch is incomplete or may not work reliably — similar to what it does with beta tests of new operating systems.

When Gurman began reporting about personalized Siri delays a year ago, his reporting turned out to be exactly right. If these features are going to drop in iOS 26.4, they should be in pretty good shape right now internally. If they’re in bad shape right now in internal builds, it’s really hard to see how they could drop in iOS 26.4. And once you start talking about iOS 26.5 (let alone 26.6), we’d be getting really close to WWDC, where Apple’s messaging will turn to the version 27 OSes.

Something still seems rotten .

★

40

I Told You So

📝 the singularity is nearer

🏷️ AI/机器学习 🏷️ 技术趋势

↗ 打开原文

📌 AI 摘要: 文章借George Hotz的预言，批判当前AI等技术的发展动机（追求权力）可能导致可怕的后果，并呼吁社会反思技术发展的方向。

💡 核心要点:

引用George Hotz在2019年对技术奇点临近及其潜在恐怖后果的警告。
质问当前技术发展（如奇点体验）的方向和目的，批评其由错误动机驱动。
呼吁社会集体反思，认为许多技术本不该被建造，并暗示需要革命性改变。

🧠 深度分析:

文章警示了技术发展若脱离为人类福祉服务的初衷，可能带来系统性风险，这对AI伦理和治理提出了紧迫课题。
其批判性观点在AI技术快速商业化的当下具有现实意义，促使从业者思考技术的社会责任。
由于材料为观点性短文，分析基于其批判逻辑进行合理推断，具体技术路径和解决方案需参考更详细资料。

📖 站内阅读原文（RSS全文）

My quote from 2019

“I don’t know how close you guys think the singularity, but I think it’s very close. Once we reach the singularity, If we have the same motivations we have now — primarily power over people — things are going to be horrific” – George Hotz

How is everyone enjoying their singularity? How far is this going to go? Why are we letting the minds that invented fastpass run things? Who are we doing this all for again?

We live in a society. It seems a lot of people have forgotten this. So much stuff that’s being built just shouldn’t be built. You know technology could be good, right? It could all be like this and not like this .

Is everyone individually too weak to defect? Sounds like we need a revolution.

41

How can I distinguish between the numeric keypad 0 and the top-row 0 in the WM_KEYDOWN message?

📝 The Old New Thing

🏷️ 编程语言 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章核心讲解了在Windows的WM_KEYDOWN消息中，如何通过wParam和lParam的扩展键位来区分小键盘0、主键盘0以及Insert键。

💡 核心要点:

小键盘0在NumLock开启时发送VK_NUMPAD0，关闭时发送VK_INSERT。
通过lParam第24位（扩展键标志）可区分小键盘0（非扩展）与独立Ins键（扩展）。
扩展键机制源于IBM PS/2键盘为兼容旧键盘而引入的设计。

🧠 深度分析:

此区分对需要精确键盘输入的软件（如财务软件、游戏）至关重要，可避免按键功能混淆。
理解底层键盘扫描码与虚拟键码的映射历史，有助于处理更复杂的键盘兼容性问题。
开发者应优先使用扩展键标志进行判断，而非依赖NumLock状态，以提高代码健壮性。

📖 站内阅读原文（RSS全文）

A customer wanted to know how to distinguish between the numeric keypad 0 and the top-row 0 in the WM_ KEYDOWN message. And while we’re at it, let’s also distinguish between the numeric keypad 0 and the Ins key.

We start with this table of what you get in the WM_ KEYDOWN message when you press the numeric keypad 0.

Event wParam

Numpad0 with NumLock on VK_NUMPAD0

Numpad0 with NumLock off VK_INSERT

Okay, so when the wParam is VK_ NUMPAD0 , it seems pretty clear that we have the numeric keypad 0. But when it is VK_ INSERT , we aren’t sure whether it’s the numeric keypad 0 with NumLock off, or whether it’s the dedicated Ins key.

For that, we can look at the lParam , specifically, bit 24, which is documented as the “extended key” bit.

Rewind the clock to 1983. The IBM PC XT keyboard is introduced. To the left of the main keyboard is a set of numbered function keys, and to the right is a numeric keypad. But the keys on the numeric keypad do double-duty because arrows and other editing keys are overlaid onto them.

⏎ 7

Home

8

↑

9

PgUp

4

←

5

6

→

PrtSc

*

1

End

2

↓

3

PgDn

0

Ins

.

Del

You select whether you want numbers or arrows/editing keys by toggling NumLock .

The IBM PS/2 keyboard expanded the set of keys on the keyboard by inserting a block of keys between the main keyboard and the numeric keypad. This block contains the arrow keys and the editing keys. This keyboard layout closely resembles the keyboard layout used by most keyboards today, so I guess it held up okay.

For compatibility, the bonus keys on the keyboard reported themselves to be the same as the numeric keypad keys they shadowed, but with an extra flag byte to say that they are “extended” keys. They’re “extended” because they weren’t in the original keyboard.

This “extended” terminology has carried forward ever since. So we can distinguish between the dedicated Ins key and a numeric keypad 0 with NumLock off by seeing if we got an extended key. If so, then it came from the editing keys; if not, then it came from the numeric keypad.

Event wParam Extended?

Numpad0 with NumLock on VK_NUMPAD0 0

Numpad0 with NumLock off VK_INSERT 0

Ins key VK_INSERT 1

Next time, we’ll look at distinguishing the numeric keypad 0 from the top-row 0 in the WM_ CHAR message. It’s a little messier.

Bonus chatter : That PrtSc key was a major source of frustration because it sat right next to the shift key . If your finger was slightly misaligned and hit both the shift key and the PrtSc key, you accidentally asked for the screen contents to be sent to the printer. Your computer just hung until you turned on your printer so you could get a printout that you didn’t want. (And if you didn’t have a printer, you were just dead.)

The post How can I distinguish between the numeric keypad 0 and the top-row 0 in the <CODE>WM_<WBR>KEYDOWN</CODE> message? appeared first on The Old New Thing .

42

Trends in US Construction Productivity

📝 Construction Physics

🏷️ 技术趋势 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章核心指出，美国建筑业生产率数十年来停滞甚至下降，远低于其他行业，并系统梳理了衡量该问题的不同层级指标及其结论。

💡 核心要点:

自1964至2004年，美国建筑业生产率年均下降0.59%，而同期非农产业年均增长1.77%。
衡量建筑业生产率的指标分为行业整体、细分领域、特定建筑和具体任务四个层级。
Goolsbee和Syverson的研究显示，自1960年代中期以来，建筑业生产率已下降约50%。

🧠 深度分析:

建筑业生产率停滞意味着住房、道路等基础设施成本难以下降，直接影响社会民生与经济发展。
该问题长期存在且被多方研究证实，表明其是系统性顽疾，需从技术、管理和政策等多维度寻求突破。
文章为技术编辑提供了深入分析复杂行业问题的框架，即通过不同颗粒度的指标交叉验证核心趋势。

📖 站内阅读原文（RSS全文）

(This is a chapter of a longer report I’m working on that summarizes and expands the last several years of my work on construction productivity. I plan on publishing one chapter a month on the newsletter, and aim to have the full report done by the end of the year.) For decades, American construction has fallen behind almost every other major sector in productivity growth. As far back as 1970 researchers noted that construction productivity improvement significantly lagged productivity improvement in the economy overall, and by 1985 economists were investigating what appeared to be declining construction productivity. Stanford civil engineering professor Paul Teicholz noted in a 2004 article in AECbytes that between 1964 and 2004, construction productivity declined by 0.59% per year on average, which was “particularly alarming when compared to the increasing labor productivity in all non-farm industries, which have experienced an increasing productivity of 1.77%/year over the same time period.” A 2017 article in The Economist noted that “construction holds the dubious honour of having the lowest productivity gains of any industry.” In a 2023 New York Times column , Ezra Klein wrote that “A construction worker in 2020 produced less than a construction worker in 1970, at least according to the official statistics.” The trend of construction productivity in the United States failing to improve over time is indeed concerning. “Productivity” means some measure of output, divided by some measure of input. When productivity is improving, we get more output for a given amount of input over time; if productivity is falling, we get less output for a given amount of input over time. If productivity doesn’t improve, we can’t expect construction costs to fall and things like houses, roads, and bridges to get any cheaper. Because of this, it’s worth looking deeply at what exactly the trends in US construction productivity are. Economists and researchers measure construction productivity in a variety of different ways. We can broadly categorize these metrics by their level of granularity: • At the lowest level of granularity, we have metrics that track productivity changes across the entire construction sector.

• Slightly more granular are metrics that look at productivity changes in a particular subsector, such as housing construction.

• Looking more specifically, we have metrics that look at productivity changes for constructing particular buildings.

• And finally we have metrics that track productivity changes for individual construction tasks.

Each category of metric gives a slightly different perspective on productivity trends, and each has its own measurement challenges that we must consider when interpreting the data. Sector-wide productivity metrics Sector-wide productivity metrics look at productivity trends across the entire construction industry. They answer if, overall, we’re getting more or less construction output for a given amount of input. The graph below, for instance, shows trends in US construction productivity by using total construction spending as a measure of output, and total hours worked in the construction sector as a measure of input. (Spending has been adjusted to 2025 dollars using the Consumer Price Index —we’ll talk more about whether this is a reasonable way to adjust for inflation later.) We can see that, per this metric, construction labor productivity — the amount of construction output we get for a given amount of labor — is virtually flat between 1964 and 2024, whereas labor productivity in the economy overall rose by a factor of three. Sector-wide metrics which look at productivity trends across the entire construction industry are very common. Paul Teicholz uses the same data we used above to look at trends in construction productivity in a 2013 article , and his 2004 article uses a very similar metric (rather than total spending, he uses US Department of Commerce construction spending data, a subset, as a measure of output). • •

In their 2025 paper “ The Strange and Awful Path of Construction Productivity in the US ”, economists Austan Goolsbee and Chad Syverson use a slightly different sector-wide productivity metric. For output they use real (inflation-adjusted) construction value-add data from the Bureau of Economic Analysis , and for input they use the number of full-time construction employees. (Unlike total construction spending, which just tracks the value of the outputs, value-add measures the value of construction outputs minus the value of the inputs used.) Goolsbee and Syverson also look at trends in construction total factor productivity (TFP), which measures productivity of both labor and capital (equipment, machinery, etc.) by comparing the growth rates of real construction value-add to the growth rates of construction labor and capital inputs. According to Goolsbee and Syverson’s productivity metrics, construction productivity looks even worse. Productivity increased from the 1950s until the mid-1960s, but since then it has declined by roughly 50%. Discussions of US construction productivity often reference this Goolsbee and Syverson paper, or the data behind it. An early version of Goolsbee and Syverson’s paper is what Ezra Klein is referring to in his 2023 New York Times column, and it’s referred to in a 2025 Federal Reserve Economic Brief examining productivity. The data is also used in a 2026 report from Goldman Sachs looking at the causes of low US construction productivity. Management consultancy McKinsey likewise uses BEA value-add data in a 2017 report to construct a similar productivity metric, gross value add per hour worked, to show that in the US construction productivity improvement had lagged virtually every other industry: • •

The Bureau of Labor Statistics also uses BEA data, combined with its own estimates of hours worked, to calculate trends in both labor productivity and total factor productivity for a variety of sectors , including construction. This metric likewise shows construction productivity as stagnant or declining. It’s not uncommon for discussions of productivity to also reference this BLS metric; for instance, it’s used by Federal Reserve economists Daniel Garcia and Raven Molloy in their 2025 paper “Reexamining Lackluster Productivity Growth in Construction”. Sector-wide measures of US construction productivity thus tell a consistent story of stagnant productivity growth, differing only in how bad the problem appears. By some measures, productivity is merely flat over the last several decades; by others, productivity has declined significantly. Sub-sector productivity metrics Subsector metrics are also commonly used to get a picture of national construction productivity trends, particularly metrics that look at trends in housing construction. In their 2023 NBER working paper, “ Why Has Construction Productivity Stagnated? ” Princeton economist Leonardo D’Amico and coauthors looked at productivity trends in US homebuilding by dividing the total number of housing units produced in the US by the total number of residential construction employees. They found that housing productivity had declined significantly since the 1960s — though, as we’ll see, there are issues with their choice of metric. Goolsbee and Syverson also looked at housing units per employee in their 2025 paper, along with another housing productivity metric, square footage of housing per employee. As with D’Amico et al., housing units per employee shows declining productivity over time, while square feet per employee shows slightly more complex trends: productivity appears to decline between the 1970s and the early 1990s, and decline since then for multifamily construction, but single-family construction shows an increase in productivity of close to 50% between 1990 and 2020. In their 2025 paper, Garcia and Molloy also look at productivity trends in single-family home construction using square footage of housing produced per employee, though they also try to include quality adjustments in this metric. (We’ll discuss quality adjustments more later.) • •

Via D’Amico et al. (2023) • •

Via Goolsbee and Syverson (2025) The Bureau of Labor Statistics also produces estimates for construction productivity trends for four sub-sectors : single-family home construction, multifamily home construction (i.e., apartment buildings), industrial building construction, and highway and bridge construction. These are based on individual subsector estimates of construction spending from the US Census, and BLS estimates of hours worked. Per the BLS, while single-family home productivity has been stagnant since 1987 and highway and bridge productivity has declined, productivity is up for both multifamily construction and for industrial building construction. Construction subsector productivity estimates thus generally show stagnant or declining construction productivity, though with significant variation. Some subsectors show increasing productivity, and some show different trends by different metrics. Single-family home construction shows increasing productivity when measured by square feet of home per employee, but unchanging productivity when measured by subsector spending per labor hour; for multifamily home construction, the reverse is true. Project and building productivity metrics Below the level of construction subsectors, we have productivity metrics that look at trends for individual building types, such as the amount of labor required to build a single-family home. These sorts of metrics are much less common, as it’s rare to get detailed project-level productivity data from builders, but are still seen occasionally. In 1964 and 1972 the Bureau of Labor Statistics conducted studies on the number of hours it took to build a single-family home, finding that the average annual percent change in labor hours per square foot was just -0.6% per year (ie: productivity increased, but slowly). The Construction Industry Institute has a “Benchmarking and Metrics Productivity Database” that tracks project-level productivity metrics for submitted projects. A NIST analysis of this database from 2000 to 2007 noted a decline in project-level productivity, measured in output in dollars per labor-hour. We can construct our own building-level productivity metric by using data from construction estimating guides. Estimating guides, produced by companies like RS Means and Craftsman, provide information on cost, labor, and material requirements for hundreds of different construction tasks, and are used to generate cost estimates for new construction projects. Some companies have also often been producing their estimating guides for many years, making them a valuable tool for analyzing productivity trends; both RS Means and Craftsman have been producing estimating guides since the 1950s. Starting in 1993, Craftsman’s National Construction Estimator included an estimate of the total number of hours required to build a “typical” single-family home. If we compare the estimated number of hours per square foot in 1993 and 2026, they’re almost identical. The only task that has changed is insulation installation, which took a single man six days in 1993 and now takes one man 3 days. It’s also worth noting that this hours per square foot figure is also virtually the same as the number of hours per square foot calculated by the BLS in their 1964 and 1972 studies. Thus, project-level measurements of US construction productivity also tend to show a stagnation or a decline in US construction productivity over time. Task-level productivity metrics Finally, below project-level productivity metrics, we have measures that look at productivity of individual construction tasks: laying bricks, framing walls, installing plumbing, and so on. These metrics are fairly commonly used, thanks to the existence of estimating guides. We can look at changes in task-level construction productivity by seeing how the time and labor required for various specific construction tasks has changed in estimating guides over time. Allmon et al (2000) looked at productivity changes for 20 different construction tasks from 1974 through 1996 using RS Means estimating guide data, and found that labor productivity increased for seven tasks, decreased for two tasks, and was unchanged for 11 tasks. Goodrum et al (2002) looked at productivity changes between 1976 and 1998 for 200 different construction tasks using data from several different estimating guides. They found that labor productivity declined for 30 tasks, was unchanged for 64 tasks, and improved for 107 tasks, with an average growth rate in labor productivity ranging from 0.8% to 1.8% depending on the estimating guide. A follow up study by Goodrum in 2009 that looked at productivity trends in 100 different construction tasks between 1977 and 2004 found a somewhat lower average productivity increase of just 0.47% per year, with significant variation between task categories. We can also use different versions of estimating guides to do our own analysis of productivity trends. The chart below shows the relative installation rates for 40 different construction tasks which are listed in the RS Means estimating guides from 1985 and 2023. 10 tasks got more productive over the period, 10 got less productive, and 20 tasks were unchanged. • •

We can also try to calculate installation rates directly, using the values RS Means lists for task labor cost and hourly wages. The chart below shows the installation rates calculated for 17 construction tasks performed by either carpenters or sheet metal workers that were listed in the 1954, 1985, and 2023 version of the RS Means estimating guide. 1 Effective installation rates for each task were calculated by dividing unit labor costs for the task by the average worker wage for that task type. By this analysi

内容较长，当前仅展示前 14000 字。可点击“打开原文”查看完整内容。

43

Book Review: On the Calculation of Volume - Solvej Balle ★★★★★

📝 Terence Eden’s Blog

🏷️ 其他

↗ 打开原文

📌 AI 摘要: 这是一篇关于小说《论体积的计算》的书评，作者高度评价了这部以时间循环为背景、探讨存在与关系的文学作品。

💡 核心要点:

小说设定在11月18日，与书评作者生日相同，带来了独特的阅读体验。
故事通过被困女性的日记展开，探讨了幽闭、怨恨与和解等主题。
书评作者被作品深度震撼，但因强度过高而犹豫是否阅读后续系列。

🧠 深度分析:

书评揭示了文学如何通过‘时间循环’等科幻设定，深刻探讨痴呆、离婚、环保等现实议题，展现了严肃文学的思辨价值。
作者强烈的个人共鸣与阅读后的情感消耗，说明了优秀文学作品能带来的深刻心理影响与沉浸式体验。

📖 站内阅读原文（RSS全文）

I had the most intense time reading this book. Do you ever see the date of a famous event and notice that it is also the date of your birthday? When I do, my brain gets a fun jolt of recognition. This book is set perennially on the 18th of November - my birthday. My poor little brain was exhausted and satiated from the repeated mentions. A most curious experience.

It would be easy to dismiss this as "Groundhog Day" but French. Like the movie Palm Springs , it revitalises the "time loop" concept. Told through the diary of a woman trapped, we get an intimate sense of her claustrophobia and resentment.

The novel is quiet and contemplative. Much like " In Search of Lost Time ", it revels in describing the mundane. Although the prose is much more captivating than Proust! It meanders in lovely an unhurried way as our protagonist attempts to first understand and then make peace with her predicament.

You could read it as a meditation on dementia - as her partner forgets every previous day. Or on divorce - as she attempts to hide in her own house. Perhaps it is an allegory for environmentalism as she tries to leave no mark on the world?

I got to the end stunned by the journey - and I completely understand why it has attracted such a passionate following. That said, it was so intense that I'm not sure I can handle reading the next six(!) in the series.

44

The Many Flavors of Ignore Files

📝 Andrew Nesbitt

🏷️ 工具 🏷️ 软件工程

↗ 打开原文

📌 AI 摘要: 文章通过作者修复go-git库中gitignore实现差异的经历，深入剖析了.gitignore语法的复杂性和众多工具对其语法的非标准实现。

💡 核心要点:

gitignore模式匹配包含四层优先级、锚定规则、通配符语义等复杂细节。
许多工具声称支持“gitignore语法”，但实现往往不完整或存在差异。
文章列举了从Docker到各类云平台、编辑器的超过15种其他ignore文件。

🧠 深度分析:

正确理解gitignore的复杂规则对调试文件忽略问题和编写精确规则至关重要，可避免‘幽灵差异’等问题。
工具间忽略文件语法的碎片化增加了开发者的认知负担，在跨工具协作时需注意兼容性问题。
对于需要实现类似功能的开发者，应参考git源码（如wildmatch.c）而非简单模仿，以确保行为一致。

📖 站内阅读原文（RSS全文）

A bug report in git-pkgs led me down a rabbit hole: files that git ignored were showing up as phantom diffs, and the cause turned out to be go-git’s gitignore implementation , which doesn’t match git’s actual behavior for unanchored patterns in nested directories. I went looking for a Go library that fully matched git’s pattern semantics and couldn’t find one, so I wrote git-pkgs/gitignore with a wildmatch engine modeled on git’s own wildmatch.c .

Building that made me appreciate how much complexity hides behind .gitignore , and got me thinking about all the other tools with their own ignore files. Most claim to use “gitignore syntax” without specifying which parts, and that phrase turns out to be doing a lot of work. Every tool wants to be git until it has to implement git’s edge cases.

gitignore

Most people know that *.log ignores log files and node_modules/ ignores the node_modules directory. But gitignore does far more than simple glob matching. I covered the basics in Git’s Magic Files , but getting a correct implementation working forced me to deal with all of it. The gitignore docs describe the behavior in prose; the real authority is the implementation in dir.c and wildmatch.c , with tests in t0008-ignores.sh and t3070-wildmatch.sh .

Four layers of patterns. Git doesn’t just read one .gitignore file. It checks patterns from four sources in order of increasing priority: the global excludes file ( core.excludesFile , defaulting to ~/.config/git/ignore ), then .git/info/exclude for repo-local patterns that aren’t committed, then the root .gitignore , then .gitignore files in each subdirectory. A pattern in src/.gitignore only applies to files under src/ . Patterns in deeper directories override patterns in parent directories, and the last matching pattern wins. If you’re debugging why a file isn’t being ignored (or why it is), git check-ignore -v <path> will tell you exactly which pattern in which file is responsible.

Anchored vs. unanchored patterns. A pattern with no slash in it, like *.log , is unanchored and matches at any depth because git effectively prepends **/ to it. But the moment a pattern contains a slash, including a leading / , it becomes anchored to its .gitignore ’s directory. This distinction is where go-git’s implementation broke down for us.

Pattern Matches Doesn’t match Why

debug.log debug.log , logs/debug.log Unanchored, matches at any depth

/debug.log debug.log at root only logs/debug.log Leading / anchors to root

doc/frotz doc/frotz a/doc/frotz Contains / , so anchored

build/ build/ (dir), src/build/ (dir) build (file) Trailing / restricts to directories

Wildcards. * matches any string within a single path segment but does not cross / boundaries. ? matches exactly one character, also not / . These follow the rules of git’s wildmatch.c , which is subtly different from shell globbing or Go’s filepath.Match .

Doublestar ** . Only special when it appears as a complete path segment between slashes: **/logs matches logs at any depth, logs/** matches everything under logs/ , and foo/**/bar matches foo/bar , foo/a/bar , foo/a/b/c/bar with zero or more intermediate directories. But foo**bar is not special because the stars aren’t a standalone segment; they’re just two regular * wildcards that won’t cross a / .

Bracket expressions. [abc] matches one character from the set, ranges like [a-z] and [0-9] work as expected, and both [!a-z] and [^a-z] negate the match. All 12 POSIX character classes are supported: [:alnum:] , [:alpha:] , [:blank:] , [:cntrl:] , [:digit:] , [:graph:] , [:lower:] , [:print:] , [:punct:] , [:space:] , [:upper:] , [:xdigit:] . You can mix classes with ranges in a single expression: [a-c[:digit:]x-z] . The edge cases are where it gets interesting: ] as the first character after [ is a literal member of the class, not the closing bracket. Ranges are byte-value ordered, so [B-a] matches bytes 66 through 97, which includes uppercase B through Z, several symbols, and lowercase a.

Directory-only patterns. A trailing / means the pattern only matches directories, so build/ matches the directory build but not a file named build , and it also matches everything inside that directory because once a directory is ignored git skips it entirely and never looks at its contents.

Negation. A leading ! re-includes something a previous pattern excluded. The subtlety is that you can’t re-include a file if its parent directory was already excluded, because git never descends into the excluded directory to check. To ignore everything except one nested path, you need to re-include each intermediate directory:

/* !/foo /foo/* !/foo/bar

This ignores everything except foo/bar . You have to re-include foo/ , then re-exclude foo/* , then re-include foo/bar . Skipping the middle step means foo/bar stays excluded.

Escaping. A backslash makes the next character literal, so \!important matches a file literally named !important rather than being a negation pattern, and \#comment matches a file named #comment rather than being treated as a comment line.

Trailing spaces. Unescaped trailing spaces on a pattern line are stripped, but trailing tabs are not. A backslash before a trailing space preserves it. Leading spaces are always significant: ` hello is a valid pattern matching a file named hello`.

Tracked files are immune. If a file is already tracked by git, adding it to .gitignore does nothing. You need git rm --cached first. This is probably the single most common source of confusion with gitignore. There’s also git update-index --assume-unchanged which tells git to pretend a tracked file hasn’t changed , useful for local config tweaks you don’t want showing up in git status .

Everything else

.gitignore is the original. Then the copies, roughly in order of how likely you are to encounter them:

• .dockerignore for Docker build context

• .npmignore for npm package publishing

• .prettierignore , .eslintignore , .stylelintignore for JavaScript linters and formatters

• .hgignore for Mercurial

• .containerignore for Podman and Buildah (the OCI alternative to .dockerignore )

• .gcloudignore for Google Cloud

• .vercelignore for Vercel ( .nowignore was the legacy name)

• .slugignore for Heroku

• .ebignore for AWS Elastic Beanstalk

• .cfignore for Cloud Foundry

• .helmignore for Helm charts

• .artifactignore for Azure DevOps

• .funcignore for Azure Functions

• .vscodeignore for VS Code extension packaging

• .chefignore for Chef

• .bzrignore for Bazaar

• .cvsignore for CVS

• .ignore , .rgignore , .agignore for ripgrep and the silver searcher

How others differ

Docker’s is probably the most consequential ignore file after git’s, because it affects build context size and therefore build speed and layer caching. But it’s still just one flat file with no cascading, no per-directory overrides, and no global config. The pattern matching differs in subtle ways too: gitignore automatically prepends **/ to unanchored patterns so they match at any depth, while Docker’s implementation (using Go’s filepath.Match under the hood) doesn’t do the same implicit anchoring. The @balena/dockerignore npm package has good documentation on these differences.

npm’s is interesting because of its inverted relationship with package.json . You can use a files array in package.json to allowlist instead of blocklist, and if you do, .npmignore is ignored. If there’s no .npmignore at all, npm falls back to .gitignore , which catches people out when they publish packages and find that their dist/ directory was excluded because gitignore told npm to skip it. Running npm pack --dry-run before publishing shows you exactly which files would be included, which would have saved me hours the first time I hit this.

Mercurial’s .hgignore is more powerful than gitignore. It lets you choose your syntax per section with syntax: glob or syntax: regexp , and you can combine both in the same file, switching between them as needed. Glob patterns for the simple stuff, a regex for that one weird build artifact naming scheme, all in one file. It’s the only ignore file I know of that gives you regex, and the ability to mix syntaxes is something git never adopted.

“Uses gitignore syntax”

Most tools say “uses gitignore syntax” in their docs. What they usually mean is: glob patterns, one per line, # for comments, maybe ! for negation. That’s a reasonable subset, but the differences bite you when you assume full compatibility.

Some don’t support negation at all, some don’t support comments, and some treat * as matching directory separators while others don’t. Doublestar ** is supported by most but not all, and trailing / for directory-only matching varies enough between tools that you can’t assume it works the same way everywhere.

The underlying cause is implementation diversity. Tools using Go’s filepath.Match get different behavior from tools using the ignore npm package, which get different behavior from tools using Python’s pathspec library, which get different behavior from tools calling out to git’s own matching code. Each reimplementation makes slightly different choices about edge cases, and the gitignore spec is informal enough that these choices are all defensible. This is exactly what I ran into with go-git: it’s a mature, widely-used library, and its gitignore implementation still doesn’t handle unanchored patterns correctly in nested directories.

A proper compatibility matrix across all these tools (supports negation? comments? doublestar? directory-only matching? cascading?) would be useful reference material. I haven’t found one, and writing it would mean empirically testing each tool rather than trusting their docs. Create a test fixture directory with files designed to probe each feature, write the ignore file, run the operation, and see what actually gets included. The tricky part is that each tool’s operation is different: npm pack --dry-run , docker build , git status , eslint . . You’d need per-tool test harnesses.

CommonIgnore

One corner of the ecosystem actually tried to consolidate rather than adding yet another format. ripgrep and the silver searcher (ag) both deprecated their tool-specific ignore files ( .rgignore and .agignore ) in favor of a shared .ignore file. ripgrep’s precedence chain is .gitignore then .ignore then .rgignore , with each layer overriding the previous. BurntSushi extracted the matching logic into the ignore crate (part of the ripgrep monorepo, 91M+ downloads), and other tools like fd picked it up too. It’s tool-agnostic by convention rather than by any formal standard, but it’s the closest anyone has come to sharing an ignore format across tools.

Markdown had a similar problem for years. Every tool claimed to support “Markdown” but each implemented a slightly different dialect, with different rules for edge cases around nesting, link parsing, and emphasis. CommonMark fixed this by writing an unambiguous formal spec with hundreds of examples that serve as a test suite. Now tools can test their parser against the spec rather than guessing at intent, and users can rely on consistent behavior across implementations.

It’s not hard to imagine something similar for ignore files. Git’s documentation describes the behavior in prose, which leaves room for interpretation on things like how * interacts with / , whether ** must be surrounded by separators, and what happens when bracket ranges span from uppercase to lowercase. A formal spec with a shared test suite could let tool authors say “we implement level 1” (basic globs and comments) or “level 2” (add negation and doublestar) rather than the current vague gesture at gitignore compatibility. The wildmatch test cases in git’s own test suite are a starting point, but they only cover pattern matching, not the layering, anchoring, and directory semantics that trip up most implementations.

45

Pluralistic: Doctors' union may yet save the NHS from Palantir (12 Feb 2026)

📝 Pluralistic: Daily links from Cory Doctorow

🏷️ 开源项目 🏷️ 网络安全

↗ 打开原文

📌 AI 摘要: 文章核心讲述了英国医生工会（BMA）正基于一个卓越的本土开源替代方案（OpenSAFELY），发起抵制将NHS患者数据移交给美国军事承包商Palantir的运动。

💡 核心要点:

英国工党政府将NHS患者数据合同授予有争议的美国军事承包商Palantir，而非本土开源方案OpenSAFELY。
OpenSAFELY是一个基于可信研究环境的隐私保护系统，在新冠疫情期间已成功产出大量高质量医学研究成果。
英国医学协会（BMA）作为强大工会，已建议其成员医生抵制使用Palantir产品，并可能迫使政府改变决定。

🧠 深度分析:

此事件凸显了在关键公共数据系统（如医疗）中，技术主权、隐私保护与商业游说力量之间的激烈冲突，选择关乎国家长期利益。
OpenSAFELY的成功案例为全球提供了如何在保护隐私前提下高效利用医疗数据进行科研的范本，其开源模式具有可扩展和持续改进的优势。
强大的专业工会（如BMA）介入技术采购决策，可能成为制衡不当商业行为、推动采用更优技术方案的重要社会力量。

📖 站内阅读原文（RSS全文）

->->->->->->->->->->->->->->->->->->->->->->->->->->->->->

Top Sources: None

-->

Today's links

• Doctors' union may yet save the NHS from Palantir : There is power in the union.

• Hey look at this : Delights to delectate.

• Object permanence : Premature internet activists; Privacy Without Monopoly; "Broad Band"; Yazidi supersoldiers; I was a Jeopardy! clue.

• Upcoming appearances : Where to find me.

• Recent appearances : Where I've been.

• Latest books : You keep readin' em, I'll keep writin' 'em.

• Upcoming books : Like I said, I'll keep writin' 'em.

• Colophon : All the rest.

Doctors' union may yet save the NHS from Palantir ( permalink )

If you weren't paying close attention, you might think that the most grotesque and indefensible aspect of Keir Starmer's Labour government turning over NHS patient records to the American military contractor Palantir is that Palantir are Trumpist war-criminals, "founded to kill communists":

https://www.thecanary.co/trending/2026/01/07/palantir-kill-communists/

And that is indeed grotesque and indefensible, and should have been grounds for Starmer being forced to resign as PM long before it became apparent that he stuffed his government with Epstein's enablers and chums:

https://www.thenational.scot/news/25451640.streeting-defends-peter-mandelsons-relationship-jeffrey-epstein/

But it's actually much worse than that! It's not just that Labour hand over Britain's crown jewels to rapacious international criminals who are deeply embedded in a regime that has directly threatened the sovereignty of the UK. They also passed up a proven, advanced, open, safe, British alternative: the OpenSAFELY initiative, developed by Ben Goldacre and his team at Jesus College Oxford:

https://www.opensafely.org/

OpenSAFELY is the latest iteration of Goldacre's Trusted Research Environment (TRE), arguably the most successful patient record research tool ever conceived. It's built atop a special server that can send queries to each NHS trust, without ever directly accessing any patient data. Researchers formulate a research question – say, an inquiry into the demographics of the comorbidities of a given disease – and publish it using a modified MySQL syntax on a public git server. Other researchers peer-review the query, assessing it for rigour, and then the TRE farms that query out to each NHS trust, then aggregates all the responses and publishes it, either immediately or after a set period.

This is a fully privacy-preserving, extremely low-cost, rapid way for researchers to run queries against the full load of NHS patient records, and holy shit does it ever work. By coincidence, it went online just prior to the pandemic, and it enabled an absolute string of blockbuster papers on covid, dozens of them, including several in leading journals like Nature :

https://www.digitalhealth.net/2022/04/goldacre-trusted-research-environments/

This led HMG to commission Goldacre to produce a report on the use of TREs as the permanent, principal way for medical researchers to mine NHS data (disclosure: I was interviewed for this report):

https://www.gov.uk/government/publications/better-broader-safer-using-health-data-for-research-and-analysis

This is a near-miraculous system: an ultra-effective, ultra- cost -effective, Made-in-Britain, open, transparent, privacy-preserving, rigorous way to produce medical research insights at scale, which could be perfected in the UK and then exported to the world, getting better every time a new partner signs on and helps shoulder the work of maintaining and improving the free/open source software that powers it.

OpenSAFELY was the obvious contender for NHS research. But it wasn't the only one: in the other corner was Palantir, a shady American company best known for helping cops and spies victimise people on the basis of dodgy statistics. Palantir blitzed Westminster with expensive PR and lobbying, and embarked on a strategy to "hoover up" every small NHS contractor until Palantir was the last company standing. Palantir UK boss Louis Moseley called it "Buying our way in":

https://pluralistic.net/2022/10/01/the-palantir-will-see-you-now/#public-private-partnership

It worked. First, Palantir got £60m worth of no-bid contracts during the acute phase of the pandemic, and then it bootstrapped that into a £330m contract to handle all the NHS England data:

https://www.theregister.com/2023/11/22/palantir_wins_nhs_contract/

It was a huge win for corruption over excellence and corporate surveillance over privacy. At the same time, it was a terrible blow to UK technological sovereignty, and long-term trust in the NHS.

But that's not where it ended. Palantir continued its wildly profitable, highly public programme of collaborating with fascists – especially Trump's ICE kill/snatch-squads – further trashing its reputation around the world. It's now got so bad that the British Medical Association (BMA) – a union representing more than 200,000 UK doctors – has told its members that they should not use the Palantir products that the NHS has forced onto their practices:

https://www.bmj.com/content/392/bmj.s168/rr-2

In response, an anonymous Palantir spokesperson told The Register that Britons should trust its software because the company is also working with British police forces:

https://www.theregister.com/2026/02/11/bma_palantir_nhs/

The BMA is a very powerful, militant union, and it has already run successful campaigns against Starmer's government that forced Labour to shore up its support for the NHS. The fact that there's a better, cheaper, more effective, technologically sovereign tool that HMG has already recognised only bolsters the union's case for jettisoning Palantir's products altogether.

( Image: Gage Skidmore , CC BY 2.0 , modified )

Hey look at this ( permalink )

• Open Letter to Tech Companies: Protect Your Users From Lawless DHS Subpoenas https://www.eff.org/deeplinks/2026/02/open-letter-tech-companies-protect-your-users-lawless-dhs-subpoenas

• Auspicious Omens and Excellent Insubordination https://www.meditationsinanemergency.com/auspicious-omens-and-excellent-insubordination/

• Olympic Spirits on ICE https://prospect.org/2026/02/11/feb-2026-magazine-sports-olympic-spirits-on-ice-los-angeles/

• Bracing for the Enshittification of Embodied AI and Robotics https://sites.google.com/view/bracing-for-enshittification

• Joshua Idehen – Once in a lifetime (Talking Heads/Angélique Kidjo) https://www.youtube.com/watch?v=xQG5zN8QOAs