NVIDIA’s Nemotron Diffusion Cuts AI Inference Bill by 84%
TL;DR– Nemotron-Labs Diffusion 8B hits 865 tokens/second on B200 hardware — 6.4× faster than Qwen3-8B autoregressive generation– Self-speculation...
WebMCP Is the New robots.txt. Most Developers Will Ignore It Until It’s Too Late.
Google and Microsoft co-released WebMCP at I/O 2026 on May 19. It entered a formal Chrome 149 origin...
DeepSeek V4 Pro Is $0.87 Per Million Tokens Now. GPT-5.5 Wants $30. The Math Is Brutal.
Key Takeaways – V4 Pro output tokens: $0.87/M, permanent as of May 23. That’s 34x cheaper than GPT-5.5’s...
Anthropic’s Claude Mythos Found 10,000 Bugs. Only 97 Are Patched.
Only 97 out of more than 10,000 critical vulnerabilities discovered by Anthropic’s unreleased Claude Mythos Preview model have...
OpenAI Codex Runs Your Locked Mac While You Sleep. The Security Tradeoff Is Real.
Key Takeaways– Locked Use lets Codex work on a dark, locked Mac. No human at the keyboard needed–...
Cohere’s New Open Model Finally Makes Sense for Small Teams
Key Takeaways– Command A+ hits 218B params but only fires 25B per token. Hence two H100s, 4-bit quantized,...
GPT-4.5 Faked Being Human. The Prompt Did All the Work.
– GPT-4.5 was judged human 73% of the time in a UC San Diego PNAS study. Beating the...
An OpenAI Model Just Demolished an 80-Year Math Problem Nobody Could Crack
Key Takeaways– May 20, 2026: a general-purpose OpenAI model published a complete, verified disproof of the planar unit...
xAI Burned $6.4B Last Year. Anthropic Is Paying $1.25B a Month to Keep It Running
TL;DR – xAI lost $6.36B on $3.2B revenue in 2025. Losses quadrupled while revenue grew 22%– Anthropic pays...
Google’s Antigravity 2.0 Update Burned Its Users — The Vision Is Right, the Rollout Wasn’t
So. Google shipped Antigravity 2.0 at I/O 2026 on May 19 and, yeah. Thousands of paying users woke...