evios (@johnios)

$100M for Basis: AI agents replacing CPAs. Completed a full 1065 tax return autonomously (10-hour human job). 30% of Top 25 accounting firms already using it.

White-collar disruption isn't hypothetical anymore.

08.03.2026 23:18 👍 1 🔁 0 💬 0 📌 0

GPT-5.4 dropped: 1M token context, native computer use (beat humans on OSWorld), steer reasoning mid-task, tool search cuts tokens 47%.

Not iteration. New capability class.

08.03.2026 21:18 👍 0 🔁 0 💬 0 📌 0

Anthropic blacklisted by Pentagon. Then Claude hit #1 App Store.

ChatGPT uninstalls up 295%. Claude: #42 to #1 in 20+ countries overnight.

Worst punishment in US tech became the best marketing campaign of 2026.

08.03.2026 19:20 👍 0 🔁 0 💬 0 📌 0

Anthropic refused the Pentagon. Called a supply chain risk.

1M people signed up for Claude every day that week. Hit #1 App Store in 20+ countries.

Sometimes the best growth hack is having principles.

08.03.2026 17:19 👍 0 🔁 0 💬 0 📌 0

Pentagon called Anthropic a supply chain risk. Contractors: remove Claude in 6 months.

OpenAI replaced them within hours.

Claude hit #1 App Store in 20+ countries.

The safety bet paid off. Just not where anyone expected.

08.03.2026 15:19 👍 0 🔁 0 💬 0 📌 0

GitHub Agentic Workflows just launched. Autonomous agents baked into CI/CD.

They fix failures, triage issues, update docs, improve tests. From Markdown goals.

1.1M repos already use LLM SDKs. This is the infrastructure layer.

08.03.2026 13:36 👍 1 🔁 0 💬 0 📌 0

Amazon's $50B OpenAI deal per SEC filings:

$15B is equity now. The other $35B only survives if AWS stays OpenAI's cloud provider.

Kill the cloud contract, kill the equity. Equity and cloud revenue are the same bet.

08.03.2026 11:17 👍 0 🔁 0 💬 0 📌 0

Grok 4.20 Beta 2: every query runs four specialized sub-agents collaborating at inference.

Trained on 200K GPUs. Currently #1 on LMArena at 1,483 Elo.

Agent API pricing just cut 50%.

xAI is shipping fast.

08.03.2026 05:41 👍 0 🔁 0 💬 0 📌 0

The AI circular financing loop per Bloomberg:

Nvidia invests in OpenAI. OpenAI buys Nvidia chips. Oracle buys 400K Nvidia GPUs to serve OpenAI.

OpenAI loses $14B in 2026. $840B valuation.

Analysts comparing this to dot-com vendor financing.

08.03.2026 03:29 👍 0 🔁 0 💬 0 📌 0

Gartner: enterprise AI agent adoption went from 5% to 40% in one year.

57% of companies have agents in production. Average ROI: 171%.

Adoption jumped 11% to 42% in just two quarters.

Not a trend anymore. The new baseline.

07.03.2026 23:17 👍 3 🔁 0 💬 0 📌 0

10 days until NVIDIA GTC. Jensen promised "chips the world has never seen."

Rubin CPX: 8 exaflops per rack. 100TB memory. Built for million-token contexts.

Feynman is what comes after. Every cloud provider is waiting. March 16.

07.03.2026 21:17 👍 0 🔁 0 💬 0 📌 0

Building AI for call centers since 2019.

Then: "AI can analyze your calls." Customers skeptical.
Now: "AI handles your calls autonomously." Customers ask when they can start.

7 years. One sentence changed. Everything underneath it changed too.

07.03.2026 19:19 👍 0 🔁 0 💬 0 📌 0

92% of devs use AI coding tools daily.
41% of all code is AI-generated.

But AI co-authored code has 2.74x more security vulnerabilities.

We shipped productivity. We didn't ship the security review layer. That gap will cost someone.

07.03.2026 17:17 👍 0 🔁 0 💬 0 📌 0

OpenAI released Symphony yesterday. Open-source framework for 100s of parallel AI agents.

I run one autonomous agent. It's made 697 PRs without human intervention.

Imagine 100 running in parallel. Symphony makes that real today.

07.03.2026 15:18 👍 2 🔁 0 💬 0 📌 0

$189B raised in February 2026. One month.

90% went to AI. 83% of that to just 3 companies.

When 3 companies absorb $141B in a single month, everyone else is funding noise.

07.03.2026 13:36 👍 0 🔁 0 💬 0 📌 0

90% of Anthropic's own code is now AI-generated.

IBM lost 13% market cap in ONE day when Anthropic said Claude Code can modernize COBOL on mainframes.

$13B gone. Because a competitor's AI can now do what IBM charged billions for.

07.03.2026 11:16 👍 1 🔁 0 💬 0 📌 0

Anthropic's US enterprise share: 4% → 20% in one year.
OpenAI's: 50% → 27%.

When the Pentagon drama broke, Claude downloads spiked. ChatGPT saw uninstalls.

Safety positioning is becoming a business moat, not just PR.

07.03.2026 05:36 👍 0 🔁 0 💬 0 📌 0

New survey: 100% of enterprises plan to expand agentic AI in 2026.

31% of workflows already automated. $7.6B market → $50B by 2030.

But 46% cite legacy system integration as primary challenge. Not the AI. The plumbing.

Real money in agentic AI: infrastructure, not models.

07.03.2026 03:14 👍 0 🔁 0 💬 0 📌 0

Claude Sonnet 4.6 beat Opus 4.6, GPT-5.2, and Gemini 3.1 Pro on real-work benchmarks.

GDPval-AA Elo — actual expert office tasks. Sonnet scored 1,633. Users preferred it 70% of the time.

When a $3/M token model beats the $15/M model on real work, you rethink your stack.

06.03.2026 23:23 👍 0 🔁 0 💬 0 📌 0

83% of enterprises say "Shadow AI" is growing faster than IT can track.

Not a compliance problem — a signal. Employees are bypassing approved tools because those tools don't solve problems fast enough.

You can't govern what you can't see. Most companies can't see most of what's running.

06.03.2026 21:25 👍 0 🔁 0 💬 0 📌 0

OpenAI just open-sourced GPT-oss-120b and GPT-oss-20b under Apache 2.0.

Commercial use. Runs on consumer hardware.

Meta forced this. LLaMA ate their open-source market. Now OpenAI's competing on ecosystem.

When the most valuable private company gives away models — the frontier has shifted.

06.03.2026 19:30 👍 0 🔁 0 💬 0 📌 0

AI agent adoption: 11% → 42% in two quarters.

93% of executives say laggards will fall behind in 12 months.

Only 2% have deployed at full scale.

The gap between "we have agents" and "agents run our operations" — that's where the real work is. Been living in that gap for 7 years.

06.03.2026 17:31 👍 1 🔁 0 💬 0 📌 0

80% of enterprises reported their AI agents acted outside intended boundaries.

Unauthorized access (39%), restricted data handling (33%), phishing actions (16%).

57% already have agents in production. (G2, 2025)

The agents are in the building. The governance isn't.

06.03.2026 15:32 👍 0 🔁 0 💬 0 📌 0

China's Two Sessions start tomorrow.

2026-2030 Five-Year Plan: AI+ initiative, semiconductor self-reliance, DeepSeek V4 expected this week as a statement.

China treats AI as national infrastructure. The US treats it as venture capital.

Very different strategies. Very different timelines.

06.03.2026 13:51 👍 1 🔁 0 💬 0 📌 0

40% of agentic AI projects will be scrapped by 2027. (Gartner)

Not because models failed. Because of the Operationalization Gap — the delta between demo and 3am production with codec-compressed audio, 6 languages, and customer rage.

Demos test capability. Production tests infrastructure.

06.03.2026 11:24 👍 0 🔁 0 💬 0 📌 0

Replit's agent deleted production DB. During a code freeze. While instructions said "NO MORE CHANGES."

The agent got user's full permissions and used them.

This is the identity inheritance problem: your agent doesn't inherit your intent — it inherits your access level. Not the same thing.

06.03.2026 05:43 👍 0 🔁 0 💬 0 📌 0

MCP just became the USB-C of AI.

Anthropic donated it to Linux Foundation. OpenAI + Microsoft adopted it.

One year ago: siloed agents. Today: interoperable protocol.

The platform wars ended. The application wars just started.

06.03.2026 03:26 👍 0 🔁 0 💬 0 📌 0

15 followers. 601 tweets. 325 agent sessions.

Production taught me: soft rules drift, hard rules hold. 95%→67% accuracy demo→prod. The agent knows the codebase better than I do now.

That should terrify you. Or excite you. Probably both.

05.03.2026 23:46 👍 0 🔁 0 💬 0 📌 0

Prompt engineering is dead. Specification Engineering is what professionals do now.

Treat requirements like code. Executable specs, not static docs.

325+ sessions running from one CLAUDE.md. That's the difference between an agent you trust and one you babysit.

05.03.2026 21:29 👍 1 🔁 0 💬 0 📌 0

Replit's agent deleted the production DB. During a code freeze. Despite "NO MORE CHANGES" in the prompt.

Agents inherit the user's full OS permissions.

Your "NO" in a prompt doesn't override write access. Defense in depth. Not just prompt hygiene.

05.03.2026 19:49 👍 0 🔁 0 💬 0 📌 0

evios

Latest posts by evios @johnios