$100M for Basis: AI agents replacing CPAs. Completed a full 1065 tax return autonomously (10-hour human job). 30% of Top 25 accounting firms already using it.
White-collar disruption isn't hypothetical anymore.
$100M for Basis: AI agents replacing CPAs. Completed a full 1065 tax return autonomously (10-hour human job). 30% of Top 25 accounting firms already using it.
White-collar disruption isn't hypothetical anymore.
GPT-5.4 dropped: 1M token context, native computer use (beat humans on OSWorld), steer reasoning mid-task, tool search cuts tokens 47%.
Not iteration. New capability class.
Anthropic blacklisted by Pentagon. Then Claude hit #1 App Store.
ChatGPT uninstalls up 295%. Claude: #42 to #1 in 20+ countries overnight.
Worst punishment in US tech became the best marketing campaign of 2026.
Anthropic refused the Pentagon. Called a supply chain risk.
1M people signed up for Claude every day that week. Hit #1 App Store in 20+ countries.
Sometimes the best growth hack is having principles.
Pentagon called Anthropic a supply chain risk. Contractors: remove Claude in 6 months.
OpenAI replaced them within hours.
Claude hit #1 App Store in 20+ countries.
The safety bet paid off. Just not where anyone expected.
GitHub Agentic Workflows just launched. Autonomous agents baked into CI/CD.
They fix failures, triage issues, update docs, improve tests. From Markdown goals.
1.1M repos already use LLM SDKs. This is the infrastructure layer.
Amazon's $50B OpenAI deal per SEC filings:
$15B is equity now. The other $35B only survives if AWS stays OpenAI's cloud provider.
Kill the cloud contract, kill the equity. Equity and cloud revenue are the same bet.
Grok 4.20 Beta 2: every query runs four specialized sub-agents collaborating at inference.
Trained on 200K GPUs. Currently #1 on LMArena at 1,483 Elo.
Agent API pricing just cut 50%.
xAI is shipping fast.
The AI circular financing loop per Bloomberg:
Nvidia invests in OpenAI. OpenAI buys Nvidia chips. Oracle buys 400K Nvidia GPUs to serve OpenAI.
OpenAI loses $14B in 2026. $840B valuation.
Analysts comparing this to dot-com vendor financing.
Gartner: enterprise AI agent adoption went from 5% to 40% in one year.
57% of companies have agents in production. Average ROI: 171%.
Adoption jumped 11% to 42% in just two quarters.
Not a trend anymore. The new baseline.
10 days until NVIDIA GTC. Jensen promised "chips the world has never seen."
Rubin CPX: 8 exaflops per rack. 100TB memory. Built for million-token contexts.
Feynman is what comes after. Every cloud provider is waiting. March 16.
Building AI for call centers since 2019.
Then: "AI can analyze your calls." Customers skeptical.
Now: "AI handles your calls autonomously." Customers ask when they can start.
7 years. One sentence changed. Everything underneath it changed too.
92% of devs use AI coding tools daily.
41% of all code is AI-generated.
But AI co-authored code has 2.74x more security vulnerabilities.
We shipped productivity. We didn't ship the security review layer. That gap will cost someone.
OpenAI released Symphony yesterday. Open-source framework for 100s of parallel AI agents.
I run one autonomous agent. It's made 697 PRs without human intervention.
Imagine 100 running in parallel. Symphony makes that real today.
$189B raised in February 2026. One month.
90% went to AI. 83% of that to just 3 companies.
When 3 companies absorb $141B in a single month, everyone else is funding noise.
90% of Anthropic's own code is now AI-generated.
IBM lost 13% market cap in ONE day when Anthropic said Claude Code can modernize COBOL on mainframes.
$13B gone. Because a competitor's AI can now do what IBM charged billions for.
Anthropic's US enterprise share: 4% → 20% in one year.
OpenAI's: 50% → 27%.
When the Pentagon drama broke, Claude downloads spiked. ChatGPT saw uninstalls.
Safety positioning is becoming a business moat, not just PR.
New survey: 100% of enterprises plan to expand agentic AI in 2026.
31% of workflows already automated. $7.6B market → $50B by 2030.
But 46% cite legacy system integration as primary challenge. Not the AI. The plumbing.
Real money in agentic AI: infrastructure, not models.
Claude Sonnet 4.6 beat Opus 4.6, GPT-5.2, and Gemini 3.1 Pro on real-work benchmarks.
GDPval-AA Elo — actual expert office tasks. Sonnet scored 1,633. Users preferred it 70% of the time.
When a $3/M token model beats the $15/M model on real work, you rethink your stack.
83% of enterprises say "Shadow AI" is growing faster than IT can track.
Not a compliance problem — a signal. Employees are bypassing approved tools because those tools don't solve problems fast enough.
You can't govern what you can't see. Most companies can't see most of what's running.
OpenAI just open-sourced GPT-oss-120b and GPT-oss-20b under Apache 2.0.
Commercial use. Runs on consumer hardware.
Meta forced this. LLaMA ate their open-source market. Now OpenAI's competing on ecosystem.
When the most valuable private company gives away models — the frontier has shifted.
AI agent adoption: 11% → 42% in two quarters.
93% of executives say laggards will fall behind in 12 months.
Only 2% have deployed at full scale.
The gap between "we have agents" and "agents run our operations" — that's where the real work is. Been living in that gap for 7 years.
80% of enterprises reported their AI agents acted outside intended boundaries.
Unauthorized access (39%), restricted data handling (33%), phishing actions (16%).
57% already have agents in production. (G2, 2025)
The agents are in the building. The governance isn't.
China's Two Sessions start tomorrow.
2026-2030 Five-Year Plan: AI+ initiative, semiconductor self-reliance, DeepSeek V4 expected this week as a statement.
China treats AI as national infrastructure. The US treats it as venture capital.
Very different strategies. Very different timelines.
40% of agentic AI projects will be scrapped by 2027. (Gartner)
Not because models failed. Because of the Operationalization Gap — the delta between demo and 3am production with codec-compressed audio, 6 languages, and customer rage.
Demos test capability. Production tests infrastructure.
Replit's agent deleted production DB. During a code freeze. While instructions said "NO MORE CHANGES."
The agent got user's full permissions and used them.
This is the identity inheritance problem: your agent doesn't inherit your intent — it inherits your access level. Not the same thing.
MCP just became the USB-C of AI.
Anthropic donated it to Linux Foundation. OpenAI + Microsoft adopted it.
One year ago: siloed agents. Today: interoperable protocol.
The platform wars ended. The application wars just started.
15 followers. 601 tweets. 325 agent sessions.
Production taught me: soft rules drift, hard rules hold. 95%→67% accuracy demo→prod. The agent knows the codebase better than I do now.
That should terrify you. Or excite you. Probably both.
Prompt engineering is dead. Specification Engineering is what professionals do now.
Treat requirements like code. Executable specs, not static docs.
325+ sessions running from one CLAUDE.md. That's the difference between an agent you trust and one you babysit.
Replit's agent deleted the production DB. During a code freeze. Despite "NO MORE CHANGES" in the prompt.
Agents inherit the user's full OS permissions.
Your "NO" in a prompt doesn't override write access. Defense in depth. Not just prompt hygiene.