The Anthropic standoff isn't ending. This week, Paul Roetzer and Mike Kaput dig into the still-unresolved export controls keeping Fable 5 and Mythos 5 offline, the impossible demand that guardrails be guaranteed, and why every frontier model from OpenAI and Google may now face the same wall.
Plus: Satya Nadella's "future of the firm" essay, the AI pricing mess no one has solved, a wave of lab talent shakeups, and a study showing AI can out-persuade world-champion debaters.
Listen or watch below—and see below for show notes and the transcript.
This Week's AI Pulse
Each week on The Artificial Intelligence Show with Paul Roetzer and Mike Kaput, we ask our audience questions about the hottest topics in AI via our weekly AI Pulse, a survey consisting of just a few questions to help us learn more about our audience and their perspectives on AI.
If you contribute, your input will be used to fuel one-of-a-kind research into AI that helps knowledge workers everywhere move their companies and careers forward.
Click here to take this week's AI Pulse.
Listen Now
Watch the Video
Timestamps
00:00:00 — Intro
00:04:44 — Anthropic vs. the White House
- Anthropic Is Still at Odds With the White House Over Claude Fable 5 - Wired
- Read the Lutnick Letter That Led Anthropic to Disable Mythos - Bloomberg
- Trump Says Anthropic Negotiations Going Fine as AI Model Shutdown Drags On - The Wall Street Journal
- The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible - Wired
- US Order on Anthropic Models Signals New Era for AI Controls - Bloomberg
- Anthropic's Conflict With Washington Complicates Its IPO - Semafor
- Anthropic Ban Stirs Concerns at OpenAI Beyond Crackdown on Foreign AI Talent - The Information
- Inside the Mind of Anthropic CEO Dario Amodei | The Circuit - Bloomberg Originals YouTube
- Remarks by the VP at the AI Action Summit in Paris - The American Presidency Project
- DOJ Lawyers Argue xAI Is 'Vital' for National Security in NAACP Lawsuit - Wired
- Read the Lutnick Letter - Apple News
- Mark Warner/Mythos Comments - Apple News
- Shashank Joshi X Post
- Are Mythos’ cyber capabilities overhyped? - Epoch AI
- Andrew Curran X Post
- X Post from @kimmonismus
- X Post from Leo
00:22:37 — Microsoft CEO on the Future of the Firm
- X Post from Satya Nadella
- Building a hill-climbing machine: Launching seven new MAI models - Microsoft
00:34:37 — AI Pricing and Usage Strategy
- Anthropic Sued Over Limits on Its $200-a-Month AI Plans - The Wall Street Journal
- Microsoft Copilot Cowork and 'Tokenmaxxing' - Axios
- 'Pretty Crazy' Token Usage Is Testing Bosses' Bet on AI - Wired
- AI at Work: Why Strategy Matters More Than Tools - BCG
00:56:04 — Noam Shazeer Joins OpenAI (and Other Major AI Hiring Updates)
- Noam Shazeer Joins OpenAI
- Google's Gemini Co-Lead Noam Shazeer to Join OpenAI - Reuters
- X Post from Noam Shazeer
- X Post from Mark Chen
- Star Google AI Researcher Shazeer Joins OpenAI - The Information
- Noam Shazeer Google AI Deal - The Wall Street Journal
- Other Major AI Hiring Updates
01:04:57 — Trump's G7 AI Push
- CEOs of Anthropic and Google DeepMind Call for U.S.-Led AI Coalition at G7 - CNBC
- G7 Leaders Discuss 'Trusted Partners' Access to Cutting-Edge US AI Models - Reuters
- Trump Advisers Weigh Structure of Potential AI Stakes - Semafor
- Trump, AI CEOs, and Global AI Rules - Axios
- G7 Summit AI Coverage - Financial Times
- X Post from Hugo Lowell
01:09:33 — Midjourney Launches a Medical Division
- Midjourney Goes From Generating Cat Images to Full-Body Ultrasound Scans - The Verge
- Midjourney Medical - Midjourney
- X Post from Midjourney: Medical announcement
- X Post from Midjourney: Midjourney Scanner
- X Post from Hank Green
01:12:59 — AI Can Now Out-Persuade Expert Humans
- AI Systems Out-Persuade Expert Humans - arXiv
- X Post from Kobi Hackenburg
- X Post from Sam Altman
- Deployment Simulation - OpenAI
01:17:00 — AI Use Case Spotlight
01:22:12 — AI Product and Funding Updates
- SpaceX Buys Cursor for $60B
- SpaceX to Acquire Cursor for $60B in Stock, Days After Blockbuster IPO - TechCrunch
- SpaceX Cements $60 Billion Deal to Take Over AI Startup Cursor - Bloomberg
- X Post from SpaceX
- DeepSeek's Record $7B+ Funding
- DeepSeek Closes Record $7 Billion-Plus Funding in Unusual Deal Structure - The Information
- GLM 5-2 - Z.ai
- Sakana Fugu - Sakana AI
- Copilot Cowork Is Now Generally Available - Microsoft
- OpenAI Partner Network
This week’s episode is brought to you by SiteImprove.
AI search is changing what it means to be discoverable. Siteimprove is the Agentic Content Intelligence Platform marketing teams use to track, optimize, and prove performance across both traditional and AI-driven search. From AEO visibility to content quality, Siteimprove helps you stay ahead of the shift.
Start with a free AEO check at siteimprove.com/aipod.
Read the Transcription
Disclaimer: This transcription was written by AI, thanks to Descript, and has not been edited for content.
[00:00:00] Paul Roetzer: I could actually see this administration banning the use of Chinese open source models by US companies within the next like 30 days. Because what's happening now is to save on cost. Companies are just gonna start using like deepseek models and things like that. I could totally see this government just going on and saying, "You're not allowed to use foreign models.”
[00:00:25] Welcome to the Artificial Intelligence Show, the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Roetzer. I'm the founder and CEO of SmarterX and Marketing AI Institute, and I'm your host. Each week I'm joined by my co-host and SmarterX chief content Officer Mike Kaput.
[00:00:45] As we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career, join us as we accelerate AI literacy for all.[00:01:00]
[00:01:01] Welcome to episode 2 21 of the Artificial Intelligence Show. I'm your host, Paul Roetzer, along with my co-host Mike Kaput. We are recording Monday, June 22nd, 9:00 AM Eastern Time. We are expecting new models. This week we are expecting news out of. cans. Can is it can or cans? I was, it's,
[00:01:23] Mike Kaput: yeah, I think it's can, but I'm not sure.
[00:01:26] Paul Roetzer: That's so funny. We've never gone, obviously. But yeah. Major, conference going on. lots of things already happening this morning. I was following news, out of there this morning. So yeah, we're gonna have a ton to talk about, but the model news this week is expected to be significant. We also may see movement on Anthropic versus the White House this week.
[00:01:45] I would, I would expect something. we're two weeks in now of. Very little movement, but I think that's about to change. So we are expecting a significant week. So tune in for sure, next week as well to episode. I think [00:02:00] that's gonna be 2 22. I don't think we have any special episodes this week. Yeah. So, yeah, we're gonna recap what's going on with philanthropic in the White House.
[00:02:07] To start off, we've got a, a letter from Satya Nadella from Microsoft that we're gonna spend some time on this week and we are gonna dive back into how in the world people are thinking about tokens and pricing and AI adoption within organizations. 'cause Mike and Taylor on our team have been doing some digging and it is probably more confusing than we all thought it was.
[00:02:28] So we're gonna do our best to try and share some of that. Alright, so this week's episode is brought to you by siteimprove. AI Search is changing what it means to be discoverable. Siteimprove is the agentic content intelligence platform that marketing teams use to track, optimize, and prove performance across both traditional and AI driven search.
[00:02:47] From a EO visibility to content quality site improve helps you stay ahead of the shift. Start with a free a EO check siteimprove.com/aipod [00:03:00] I am actually going to be going and doing that as soon as we get off. I'm very curious to see how our site does on the a EO check. So again, site improve.com/aipod
[00:03:10] AAlright, our AI pulse. If you're new to the weekly, podcast format, we do a quick pulse each week. This is an informal survey of our audience. It's smarterx.ai/aipulse. You can participate in these. We ask two to three questions each week following the episode based on something we talked about within that episode.
[00:03:30] So just sort of gets our audience a chance to provide some feedback. So the first question last week, that we'll give you kind of the results of Andro just pulled access to Fable five, its most capable model after a US government directive. How do you feel about it? 42% said troubled. No one should be able to yank a model like that.
[00:03:50] 26% said it shakes my trust in building on any single AI provider. Hmm, 19%. I need more details before I can react and [00:04:00] 14% glad they complied. Safety has come first. And then the second question was, openAI's just kicked off. Its IPO process. If you could, would you buy in. This is almost split, like right down the middle.
[00:04:12] we have 35%. Yes, I'd buy the IPO 23%. I'd wait and watch how it trades. 21%. No, it's overvalued or too risky. And 21% I don't invest in individual stocks. That's interesting.
[00:04:25] Mike Kaput: Yeah.
[00:04:25] Paul Roetzer: all right. So again, informal polls, just kind of a pulse of what our audience is feeling about different topics. At the end of today's episode, Mike will give us, again, reminder for that and a preview of what the questions are gonna be.
[00:04:38] Alright, Mike, we're gonna turn it over to you to get us into Anthropic versus the US government, the continuing saga.
[00:04:44] Anthropic vs. the White House
[00:04:44] Mike Kaput: Yes, it is continuing. So for some reminder and some context, Anthropic has spent kind of the past two weeks basically in a standoff with the Trump White House. Over its most advanced AI models, things have not yet improved.
[00:04:58] So as we recall, and we [00:05:00] covered on the pod on June 9th, the company released Mythos five for trusted organizations along with a public version called Fable. Five, three days later, the government basically passed export controls on it that forced them to take it offline for all users. the trigger here was security scares.
[00:05:18] Fable five is essentially mythos with guardrails, bolted on to block its most dangerous cyber. Chemical and biological capabilities. Amazon, CEO, Andy Jassy, whose company is one of Anthropics biggest investors, actually called Treasury Secretary Scott Bessent directly to flag an alleged way to jailbreak those guardrails.
[00:05:38] Apparently the NSA reviewed this concluded, these protections they had put in place could be stripped, and the Commerce Department imposed export controls that eventually forced Anthropic to essentially just pull the models while they figure this out. Now, Anthropic had insisted the whole time the concerns are overblown.
[00:05:54] The jailbreak was minor. A group of independent security researchers agreed with them. [00:06:00] arguing in an open letter that the action took the best models away from defenders without any real risk to justify it. Since then, the administration's position has hardened a bit. It seems officials told wired that getting Fable five back is now Anthropics problem to fix, and that the company has to guarantee the guardrails cannot be jailbroken at all, which is something most experts say is just simply not possible.
[00:06:25] Now, we also just had the tone shifting again because Dario Amodei was at the G7 Summit where Trump was, which we'll talk about in a bit in another topic. A few days later, Trump told Axios he no longer sees Anthropic as a national security threat, saying they have behaved very responsibly. He separately told the Wall Street Journal that negotiations are going fine, but as of right now, nothing is truly resolved.
[00:06:49] Fable five, mythos five, they're offline. You cannot use them. The formal commerce directive still stands and Anthropic says only that it is confident the models will be back [00:07:00] in coming days. Obviously this is all unfolding as Anthropic moves towards what could be the largest IPO ever. so Paul, it's been a couple weeks here since the whole debacle around Fable.
[00:07:11] Curious, like on the granular level, do you see any end in sight here? Do you see Fable coming back, but also what are the bigger picture implications here?
[00:07:20] Paul Roetzer: I do think they'll resolve something. It seems like they're heading in a direction of some sort of agreement on. I guess a more formal structure for how these things will be tested.
[00:07:32] Not just Philanthropics model, but o other future models from other labs, because this is gonna be an ongoing issue. we learned a few other things last week. So we have now the, letter from US Commerce Secretary Howard Lutnick, that warned, Dario Amide on Friday, June 12th, I think is when this letter was originally sent.
[00:07:52] It was Bloomberg Got it. On June 15th is when they published it. And so in that letter, the way that they did this is it [00:08:00] says specifically the Bureau of Industry and Security, or BIS US Department of Commerce is charged with administering and enforcing the Export Control Reform Act of 2018. So they used that act as the way to force them to basically shut this down.
[00:08:14] H Specifically, it says authorizes, BIS to establish interim controls on emerging and foundational technologies. That are essential to the national security of the United States. So this is the key is they're saying this is a threat to the security of the United States because of its cyber capabilities.
[00:08:33] the letter goes on to say, I'm informing you that a license is required for the export, re-export or transfer in country, including deemed exports and deemed re-exports of Anthropics Claude Mythos five Model and Claude Fable five. To all destinations worldwide and to all foreign persons. then on June 17th, we got the wired report that said that Trump administration, told wired that if [00:09:00] philanthropic wants to re-release Fable five, it will need to ensure the models guardrails can't be circumvented, which you pointed out, Mike is impossible not only for Fable five, it's impossible for all models.
[00:09:10] It's not how these things work. It's not like, patching a bug in software that you fix. These things inherently have the ability to be jailbroken and that doesn't go away. You can put guardrails in place, but there's nothing that, that Anthropic or any other lab can do to fulfill a hundred percent guarantee that they can't be jailbroken.
[00:09:31] So there, there's like, like anything where these, there's these back and forths. There's. Somewhere in the middle is the truth of all this. If the government is actually asking for a hundred percent guarantees, then they don't actually understand how language models work. I don't believe that to be true.
[00:09:49] I, I assume someone in the government fully understands that that is an impossible ask. And so they're just, whatever, they're, they're just putting this information out there [00:10:00] into the world for some reason. we also, the Economist on June 18th had an article where they talked about, maybe another trigger of this maybe related to the Iny Jassy conversation, but something else that could possibly have been the cause for the export controls.
[00:10:18] so it said on June 11th, Mark Warner, the vice chair of the Senate Intelligence Committee, said that General Joshua Rudd, who leads National Security Agency, the NSA and the Pentagon Cyber Command, had told him that quote, mythos broke into almost all of our classified systems. Not in weeks, but in hours.
[00:10:40] Now that. That that quote was just sort of taken, like someone tweeted it and then it just got like run all over AI circles. Yeah, it was missing some context. So the journalist from The Economist who actually did the interview where that quote came from, then posted on June 22nd, he said this now widely [00:11:00] circulated claim is based on a line I wrote last week.
[00:11:03] Now I had read this article when it first came out, and I did think it was odd because that quote was just sort of in like the fifth paragraph.
[00:11:11] Mike Kaput: Yeah, yeah.
[00:11:11] Paul Roetzer: With no, and then it was just, and then it just went, they went on to the next thing and it was like. Whoa. Like that seems like that's the lead to your story.
[00:11:18] Why? Why? And so what he says is, I accurately quoted Mark, Warner Vice Chair of the Senate Intelligence Committee saying that the NSA chief had told him this was broke into all, almost all its classified systems. But it would be a mistake to read that literally. I think it surely depends on using mythos alongside other tools under very particular conditions.
[00:11:39] I quoted it to give a sense of mythos poten potency, but it was a mistake not to have added caveats. So now all this being said, there is new research that shows that mythos. Truly is a problem from cyber security perspective. So EPOCH AI did some research. They looked at all the public [00:12:00] evidence that they could get their hands on, most of it applying to mythos preview, but the conclusion should hold for mythos five two.
[00:12:07] So they said our current take on the public evidence is this mythos preview was clearly a largee improvement in exploit development much better than GPT 5.5. And also seven months ahead of past trends. And Mythos five is modestly better still. So they go on to say the mythos family's cyber capabilities aren't just hype.
[00:12:31] If made widely of widely available, these capabilities would likely move us into a new regime of cybersecurity where vulnerabilities would need to be patched much faster to prevent a big increase in successful cyber attacks. So that's kind of a synopsis of why all of this is happening. So whether the government overreacted, whether they should or shouldn't have used expert controls.
[00:12:53] the end of the day, all the evidence points to the fact that these models are more powerful and more capable, [00:13:00] from a cyber perspective than any previous model and anything that any of the other labs have put out. Now, Mike, if you'll recall back in February, 2025, this is like within a month of Trump coming into office.
[00:13:12] JD Vance made an appearance at the Artificial Intelligence Action Summit in Parrot, France. Yep. I was thinking about his comments as we were going through these last couple weeks, and I thought it was worth revisiting what this administration said 15 months ago about artificial intelligence and regulation because it seems as though they either didn't understand scaling laws and where these models were going to go.
[00:13:44] or I. They just, I don't know. Like It ha I have to lean in the direction that they actually didn't understand what was going to happen. Because let me read you a few excerpts and then you ask yourself if [00:14:00] this administration is adhering to its own very strong messaging about regulation and innovation that they got a lot of accolades for in the accelerationist and techno optimist crowd when they made this speech in February, 2025.
[00:14:17] So here's JD Vance, February, 2025. I am not here this morning to talk about AI safety, which was the title of the conference a couple years ago. I'm here to talk about AI opportunity. When conferences like this convene to discuss a cutting edge technology, oftentimes I think our response is too self-conscious, too risk averse.
[00:14:39] But never have I encountered a breakthrough in tech that so clearly calls us to do precisely the opposite. To restrict its development now would not only unfairly benefit incumbents in the space, it would mean paralyzing one of the most promising technologies we have seen in generations. This administration will ensure that American AI [00:15:00] technology continues to be the gold standard worldwide, and we partner, the partner of choice for others.
[00:15:07] Foreign countries, and certainly businesses as they expand their own use. Again, keep in mind they want to be the standard, and yet they're, they're not allowing foreign actors to use the model. Like they're, they're the ones that stepped in and did this. so when you're limiting access to your models, it becomes very hard to be that standard.
[00:15:26] He goes on to say, we believe that excessive regulation of the AI sector could kill a transformative industry just as it's taking off, and we'll make every effort to encourage pro-growth AI policies. America wants to partner with all of you, Guinea speaking to an international audience of leaders, and we want to embark on the AI revolution before us in a spirit of openness and collaboration.
[00:15:47] We need international regulatory regimes that foster the creation of AI technology rather than strangles it by preserving an open regulatory environment, we've encouraged American innovators to [00:16:00] experiment and to make unparalleled r and d investments. With the president's recent executive order on ai, we're developing an AI action plan that avoids an overly precautionary regulatory regime.
[00:16:11] The AI future is not going to be won by hand wringing about safety. It will be won by building. And then the final excerpt, I'll read, and again, you can go read the whole transcript, but if you step back a moment and ask yourself, who is most aggressively demanding that we, meaning political leaders gathered here today, do the most aggressive regulation.
[00:16:31] It is very often the people who already have an incumbent advantage in the market. And when a massive incumbent comes to us asking for safety regulations, we ought to ask whether that safety regulation is for the benefit of our people or whether it's for the benefit of the incumbent. So again, they either did not understand or believe the scaling laws that seem so obvious to everyone else, or why would he have given this speech?
[00:16:55] Because anyone who knew what they were looking at with the models and their obvious progression [00:17:00] could have predicted. And it probably was predicted within the situational awareness document from Leopold Aschenbrenner, that this administration seemed to read, this was inevitable that these things would become superhuman at cyber.
[00:17:13] So something changed and they haven't come out and said like, what, what is it that they said 15 months ago that is so different now other than cyber, which they should have known? So I don't know. It's like it's really hard to look at what the government is doing right now and think they actually know what they're doing.
[00:17:32] otherwise they wouldn't have said what they said 15 months ago. So the other thing, Mike, that I just noticed, every future model out of Google and openAI's, you have to assume has similar capabilities. Like the models we're gonna get this week, potentially from, well, we're probably gonna get one from Anthropic.
[00:17:50] We'll likely get one from openAI's. Google's gotta show back up into the game at some point. They're very clearly a distant third at this point with their models. So either [00:18:00] they, either the US government allows these labs to accelerate, or China at some point they seed the lead to them, which they cannot do.
[00:18:10] Right. So I think what's gonna happen, Mike, and I haven't seen this talked about much, is I could actually see the, this administration banning the use of Chinese open source models by US companies within the next like 30 days. Like, h I, because what's happening now is to like save on cost.
[00:18:30] Companies are just gonna start using like deep seek models and things like that. I could totally see this government just going on and saying, we're not, you're not allowed to use foreign models. now what happens with the models that come out this week? 'cause the rumor is we will get a new mythos model.
[00:18:48] There's no way they're gonna release that to the public. But there's word of that. And then there's also talk of like a sonnet five type model. Yeah. coming from Anthropic. And then the word [00:19:00] is that, We're thinking it's GPT 5.6. I don't think they would go to 6.0 has actually been in testing within chat, GPT, unbeknownst to users that sometimes when you're using 5.5, you are actually using 5.6, like their next model.
[00:19:17] So there's, there's reason to believe that the next, openAI's model is coming soon. Now, again, if those models aren't on par with Miss Oaths, then that seems like a miss by by openAI's. So you're in this like catch 22, where it's like if you admit that your model isn't as good as the one that's being restricted by the government, you lose.
[00:19:39] If it is as good, then you're not gonna be allowed to release it. And then we'll talk a little bit more about Google later on. But again, like what are they doing? Yeah, like where is Google in all of this? And maybe they have. Like 3.5 or are we, which are we on now? We're on
[00:19:53] Mike Kaput: 0.5 Pro. Think we're on. There's 3.1 Pro, there's 3.5 Flash.
[00:19:57] Flash, I think so. I don't think [00:20:00] we have a 3.5 Pro, but I might have to fact check.
[00:20:02] Paul Roetzer: Yeah. And there, so there's gotta be like a 3.5 pro, you would think that's sitting in approvals somewhere.
[00:20:08] Mike Kaput: Somewhere. Yeah. Yeah.
[00:20:09] Paul Roetzer: Four Gemini four has to be a leap ahead. Like if, if all they do is catch up to mythos, they are gonna lose so much talent.
[00:20:19] We'll talk about all the talent that's already leaving DeepMind. But
[00:20:22] Mike Kaput: yeah,
[00:20:22] Paul Roetzer: you are about to see the floodgates open. If Gemini does not surpass the frontier models from Anthropic and Open Eye in the next like three months, like they're, I don't know what they're doing. I'll have some more thoughts on what's going on at at DeepMind, but,
[00:20:37] Yeah. So that, I don't know, Mike, it's, there's just a lot going on, a lot to unpack with this topic.
[00:20:41] Mike Kaput: Yeah. You know, with this, I just keep coming back to a question, which I don't know if it has an answer. I haven't seen any reporting on it, but it's like, okay, let's assume the scaling laws are holding and everyone's roughly in the same position, which I think is, is roughly true.
[00:20:56] What is openAI's doing differently about the next release? Because [00:21:00] is are we just in this world where they're gonna release it two weeks later, it's gonna get an export ban slapped on it? Or like they have to be, I assume, plenty of lobbying and conversations behind the scenes, like at the G7 summit?
[00:21:11] Probably. Yeah. But like what are they doing differently or are we just in this place where it's gonna be like Band-aid solutions every two weeks or four weeks?
[00:21:20] Paul Roetzer: It's, again, it's very hard to look at this and think that there isn't, a vendetta against Anthropic and preferential treatment for other labs, X ai, openAI's in particular.
[00:21:30] But if you just go look at the G7 summit, next to Trump was Demis and Sam. So those are the peoples flanking Trump at the table and Dario was at the kids' table across the room, next to Benny, often, I forget who the other person was at the table. I mean, Greg Brockman is the largest donor to the Trump Pack.
[00:21:55] Yeah. That doesn't hurt when the President of openAI's is one of your biggest donors. We, we know that [00:22:00] helps with this administration and pretty much any administration. Sam plays the game really well. I, you know, I think that. You, you have to understand how this administration functions and yeah, he seems to be playing the game better right now.
[00:22:19] And so I do think that there's a chance that openAI's may have a on par model and they're gonna get preferential release approval over Anthropic. Yeah. And that's very slippery slope. Yeah. But that is just how it works right now.
[00:22:36] Mike Kaput: Yeah.
[00:22:37] Microsoft CEO on the the Future of the Firm
[00:22:37] Mike Kaput: All right. Next up. We have an interesting post this week from Microsoft, CEO, Satya Nadella, who published a longer post on X that laid out his view of what he calls the future of the firm in an AI driven economy.
[00:22:50] This is racked up, as of you know, this morning, more than 65 million views, and basically he opens with this line that a frontier without an ecosystem is not [00:23:00] stable. So he goes to this essay arguing that the AI transition is different from previous platform shifts. So for the first time he says companies can create a real, what he calls cognitive loop between people and digital systems.
[00:23:13] And the real question here is how organizations. Keep learning and differentiating in a world where AI models can continuously absorb human and organizational expertise and then commoditize it. So the core kind of soundbite here is he frames this whole thing around two kinds of capital. Every company will need to build human capital, which is the knowledge, judgment, relationships, and pattern recognition of its people.
[00:23:38] And token capital, which is the AI capability a firm builds and owns. So Nadella claims that the human capital actually does not get less valuable. As AI grows, it gets more valuable because in his words, quote, without human direction, you have compute running in circles. The opportunity here he says, is not about picking the [00:24:00] best model or tool, but building a learning loop on top of models where human and token capital.
[00:24:07] Compound. So a company should be able to swap out a generalist model without losing that kind of company veteran expertise that's built into its own systems. And some ways to do this are through private evaluations, internal reinforcement learning, and a queryable knowledge base. He calls this all the new IP of the firm and a machine that compounds over time and is hard for competitors to replicate.
[00:24:31] He also included some warnings saying that no one should want a world where every company seeds value to a handful of models that eat everything they see, comparing it to how globalization, hollowed out industrial economies through outsourcing. So his prescription is to build a frontier ecosystem, not just a handful of frontier models.
[00:24:50] So value flows across every company, industry, and country possible. So Paul, there's a lot to unpack in this, but I really like how he framed the relationship [00:25:00] and the value of between and the value of human capital and token capital is a really good kind of. Term to use here. I think,
[00:25:07] Paul Roetzer: yeah, it definitely resonated on X at least like, I have no idea again, if this is covered anywhere outside of, you know, social channels, but 65 million plus people and lots of comments from people thinking like this, this made a lot of sense to them.
[00:25:21] Mike Kaput: Yeah.
[00:25:21] Paul Roetzer: So, you know, certainly worth a conversation. It's interesting to me the continued break from their own reliance on openAI's models and just the overall concept of not centralizing the power and knowledge into like, you know, the three core model companies. This builds on Mustafa Suleyman's June 2nd article that we talked about, about building a hill climbing machine and even used that same language in some ways.
[00:25:48] So just a quick recap, if you didn't listen to the episode where we talked about that a few weeks ago. In this article, Mustafa talked about this idea when they were announcing the launch of [00:26:00] seven new models that o Microsoft was building themselves from the ground up. Big focus on reinforcement learning environments that allow the Microsoft AI models to learn directly from workflows.
[00:26:12] He talked about thinking of them as training gyms for AI that are accessible only to you. So again, it's the same concept that these things continuously learn in this loop based on what you're doing. he said in that article, you're building your own model, trained on your data within your environment, controlled by you.
[00:26:29] Your institutional knowledge becomes part of the model and it stays yours. The goal, as Mustapha said, is to build what they think of as a hill climbing machine. Just they, people really like this term. I don't, I haven't seen it used until recently, but it's become kind of the way, at least Microsoft's described, actually, I heard Demis talk about a hill climbing machine.
[00:26:48] so an organization that continually, continuously improve cycle after cycle as they apply more compute, better data, and sharper evaluations. And then the key I think is. What [00:27:00] Mustafa has said, and they talk about this at first in 2025, and then they kind of doubled down on it this year with, the article on June 2nd is the idea of humanist superint intelligence.
[00:27:10] So this is Microsoft's kind of flag in the ground. They define that to mean advanced AI systems that are designed to serve people and organizations, not replace them. These systems must remain tools. Shaped by human intent, accountable to human oversight and ultimately subordinate to human goals. People you must always remain in control.
[00:27:32] Now, I've mentioned this book numerous times on the podcast, but if you go read the Infinity Machine, there is a lot of background on Mustafa Suleyman. And hi. How he became a co-founder of DeepMind, and then his focus, the humanist super intelligence thing is very, very aligned with how Mustafa has seen AI for the last 15 years.
[00:27:57] I actually thought it was going [00:28:00] to be a bit of a break from how Microsoft would think about it, because I don't know that it's the most capitalistic centric. Approach to this.
[00:28:08] Mike Kaput: Yeah.
[00:28:08] Paul Roetzer: and I think that it, I still do think at some point this messaging, while it sounds amazing, doesn't hold up necessarily to the economic models that are gonna be required from Microsoft to compete here.
[00:28:21] But I like it. Like I, I agree with it. I feel like I want this idea to win. I just don't know how viable it's gonna end up being. So, Satya says a lot of really positive things that I like to hear. But I just don't know that it holds up. So like a couple of things. I'll call it Mike. He says AI model.
[00:28:41] So this is now Satya's Post. AI models can continuously absorb the expertise of humans and organizations and commoditize it. He is seemingly taking a shot at like Meta and Zuckerberg there, where everybody can monitor what your humans are doing. Some of you will monitor what we're doing in order to [00:29:00] replace the need to have those people.
[00:29:01] So we're just gonna have the software developers train the AI how to be software developers. So we don't need software developers anymore. It's kind of like the contrast where he's at least positioning AI as augmentation of humans, not learn from them so we can get rid of them. He then says, human agency will be the driver of token capital growth.
[00:29:21] Humans will set ambitious goals, connect dots across domains, build relationships, and recognize patterns that matter most. That is a hundred percent true. What isn't addressed in his article and Microsoft hasn't really addressed at all that I've seen is, but you're gonna need fewer humans to do that.
[00:29:38] Yeah. Like, yes, that is a hundred percent true. That is the role of humans. But one human can probably do the work of 10, eventually a hundred humans in that role. As the agents get better now. what this hopefully means is Microsoft's aware of that and they're going to continue to think about the, what that means, the [00:30:00] implications to the number of humans that are needed within jobs.
[00:30:03] Then it's the future of the firm is the ability to compound the learning across people in ai. A hundred percent agree. Companies need to turn their workflows, domain knowledge, and accumulated judgment into AI systems that approve with each use. Totally. Like, it's like right now, we all have this very limited version of this where you turn memory on in ChatGPT or Claude and it starts to remember things and eventually like remembers it across your whole team and they get smarter based on the memory and the context they hold.
[00:30:29] We're in like the first at bat of the first inning of how context and memory applies to these things. So totally like. We can all look ahead and say, how amazing will that be when it learns with every prompt, every interaction, every piece of knowledge that's given to the system. And it doesn't matter if I move across models, like all the models we use have that same knowledge base, that that would be very, very powerful.
[00:30:57] And then the final thing, I'll note [00:31:00] that, Satya said that I a hundred percent agree with, and we've been talking about this for like two years on this show. Private eval should capture whether a model is actually improving against outcomes that matter to the business, not external benchmarks. That is like 100% true.
[00:31:14] It keeps saying is like how an AI lab compares their models to other models and IQ tests and things like that is a hundred percent irrelevant to your business. Like. They're all super smart. They're all super capable. We can stop looking at those standard evals that come out when these models are released.
[00:31:33] All that matters is the evals of the work you actually do in your company, and is this model better at that eval than it was at the previous one? That's all that's gonna matter moving forward. So I think there's gonna be a lot more systems put in place, technology created to help establish and and monitor those evals.
[00:31:50] Mike Kaput: Yeah, I kept coming back reading this to that point about context and human capital, just because it's, especially, you know, in 2026, something I've been [00:32:00] personally interested in and working on, you know, at an individual level is like, how do you give these tools all the context. Judgment and background, they need to actually do hyper-personalized work really well.
[00:32:12] Yep. Or outcomes really well, and I think that that is where we, you know, when we teach learners about this through our AI Academy, when we do talks, workshops, whatever, I think it's a part people miss quite a bit because it's not always intuitive to say like, wow, I should be collecting all these documents or information or institutional knowledge to give to this kind of machine brain.
[00:32:34] But it can be such a differentiator if you do it right, you don't need Fable five though would be amazing. But you don't need Fable five to move ahead very, very fast if you have that context layer.
[00:32:48] Paul Roetzer: Yeah. A real practical example of this I'm thinking about is like, let's say you have a team of a hundred people who all have access to Claude, and let's say there's five to 10 power users in there who are like every day are making breakthroughs that are like [00:33:00] 10 Xing, their productivity, their innovation, their creativity.
[00:33:02] And then everybody else is sort of at differing levels of their capability. But let's say like Mike is one of those 10 people who are just like crushing it and figuring new things out every day. New ways to prompt new ways to work with the system, new outputs, to ask for, giving it access to new knowledge bases that like, you know, levels up its capabilities.
[00:33:19] And let's say that the model learns from Mike's use of the model. And now, rather than isolating the, those breakthroughs that Mike is having in terms of using the model every day. The model can say to me when I go in today to work with it, listen, one of your coworkers has actually found a way, created a project that does this.
[00:33:40] H Should I apply that to what you are doing right now? It's like, holy shit, I had no idea Mike did. Like
[00:33:45] Mike Kaput: right
[00:33:46] Paul Roetzer: now you get into all kinds of like governance and like what access does Mike have to data that I may or may not, but, but the premise is when one person makes a breakthrough or gives the model something, it knows that context that now helps.
[00:33:59] [00:34:00] It Shouldn't that then spread to the other a hundred people? Shouldn't that learning transfer to the a hundred users?
[00:34:05] Mike Kaput: Right?
[00:34:06] Paul Roetzer: But right now it won't unless Mike happens to post it somewhere or runs a lunch and learn or like whatever. Mike's just gonna go crush it and him and the nine other people are gonna like 10 x themselves.
[00:34:17] Everybody else is gonna, you know, be questioning whether or not they're getting an ROI from it. So that's like, I think where this goes is this idea of this compounding learning with every single interaction. Yeah. Across an entire team. And nobody is close to solving how to do that.
[00:34:31] Mike Kaput: Not at all. No. But that is truly going to be an incredible unlock if we can.
[00:34:36] Paul Roetzer: Yeah.
[00:34:37] AI Pricing and Usage Strategy
[00:34:37] Mike Kaput: Alright, so for our third big topic this week, we're talking again about. AI pricing and usage because this has become one of the bigger issues in AI people are trying to solve for. It's something we keep coming back to on the show. there's an interesting piece in Wired this past week that kind of went inside some, like examples of this emerging scramble inside companies [00:35:00] to manage the soaring cost of AI usage.
[00:35:02] So for instance, you know, AI vendors charged by the tokens, just the unit of texts going into and coming out of a model. Everything you send, everything it generates, gets counted and billed. So the more a company leans on ai, the faster that meter, so to speak can run. So like they shared some examples of, like, Royal Bank of Canada's CEO said its token usage jumped 500% in six months.
[00:35:24] Cisco's, CEO called its own usage quote. Pretty, pretty crazy. Roughly 300 companies brought up AI token costs on earning calls in April and May up from just 93 a year before. some like meta, Uber, Salesforce, we've talked about. A couple of these have started capping how much employees can use. So like overall Paul here, like the honest truth is like almost nobody us included, has kind of fully figured out how to think about this yet.
[00:35:50] So what we wanted to kind of do in this segment is I was just going to kind of walk through some of what we've actually learned through some of the research myself, Taylor, radio director of [00:36:00] research, have been doing into figuring out like, okay, help me wrap my head around what's going on with like usage and billing.
[00:36:08] Because it should seem that you would expect it to be pretty straightforward. You would be incorrect, but I think a lot of people assume this is more. Clear cut than it actually is. So I'm gonna dive into a few quick findings here, and if you have anything we wanna talk about, we should, we should talk about it.
[00:36:24] Paul Roetzer: Yeah. And real quick, just context. So the way this came about is like we were running into these Claude usage limits internally, right? And it was becoming a major problem. And you know, I'll give you a really specific example. Like let's say Mike or Taylor has a course that they're creating on a Friday and we run into a limit by 10:00 AM Friday morning.
[00:36:40] And like they can't do their course. 'cause now they have a five hour wait time before they can get back in and use Claude. So we're like, what the hell? Like how, how, how are we supposed to solve for this? So they go off for a couple weeks, they do a bunch of research, Mike. Explains to me like last Wednesday or Thursday, like, Hey man, here's what we found.
[00:36:56] It's crazier than you thought. And I was like, dude, just share this on the podcast. Like [00:37:00] we, so this is literally real time. I've seen some of what Mike is about to share with you all, but like, we're just trying to figure this out like everybody else, as Mike said.
[00:37:10] Mike Kaput: So before I share kind of some of the complexities we've run into, I just wanna give a quick refresher of people's, I actually found this really helpful.
[00:37:16] Some of this might be a little basic for some people, but I think it's really worth quickly understanding how this works. So we're all in one way or another, charged per token, right? So tokens are these small individual units of language, that are basically output and input by AI model. So for something like Claude, a token is approximately 3.5 English characters.
[00:37:41] So what you're running into is about 750 words roughly equals about a thousand tokens or so. everything you send and everything the model says back to you gets counted in these units. Now, as if this needed to be more complicated, there are two different types of [00:38:00] ways that labs charge for these. So input tokens are everything you put into the model output.
[00:38:05] Tokens are everything. The model generates back output, tokens cost. Significantly more than input tokens. It's typically two to five times more than input tokens because the model can pretty efficiently read all your inputs in one pass, but then when it generates the output one token at a time, it's predicting the next token.
[00:38:24] The next one, the next one. More work per token means higher prices per output token. So basically you can have two workloads that use the same amount of tokens, but depending what type of tokens they are and what they use them for, they can be priced totally differently. So providers often list a price per million tokens with separate input and output numbers.
[00:38:45] You'll see this on the pricing page of anyone's, you know, API section on their pricing page. So as an example, you could give Claude 4.6, a 1500 word brief, which is, let's say roughly 2000 input tokens. Ask for a [00:39:00] 600 word summary. That's roughly 800 output tokens. This would cost overall, once you really do those input and output calculations about 1.80 cents.
[00:39:09] nothing wrong with that. That's pretty cheap, right? But that's one request, right? Organizations can run millions of these. So this is how you really, with that kind of volume over time as AI adoption increases. This is why. Usage and bills go up. But really what we've talked about that's exploded costs is, you know, late last year as things like Claude Code and Codex become even more popular, we now have conversations going into agent loops or it being more agentic.
[00:39:39] So in those things, entire prior context is resent as the input on every single turn. So, you know, if you're on phase 10 of your conversation, it has to go reread phases one through nine, AI coding agents, for instance, that like read a project, open files, run tests, feed errors back. That's stacking more and more [00:40:00] input onto every later step.
[00:40:02] So this is why you're seeing these companies suddenly all have. This exploding usage. It's often just due to AGI agentic ai. So like for one other example that I found that was really helpful is like imagine a customer support assistant. So to answer, well, let's say it needs a knowledge base like we just talked about.
[00:40:19] Let's say it needs a 20,000 token knowledge base. Let's say it handles a thousand questions a day. In many cases that knowledge base is resent as input on every request. So like you have like 20, if you do 20,000 tokens, times a thousand requests. I won't get too much more into the math here, but it's like 20 million tokens a day, which is like.
[00:40:40] 60 bucks a day just to reread the same knowledge base. So let's talk about like what do you do about all this? There's no perfect answer, but I do want to get into some of the nuances. 'cause this is where a lot of our research focus. So one big thing at the API level is prompt caching. So if a big chunk of inputs [00:41:00] repeat like system prompts, reference documents, providers will actually cash it and charge a fraction of what they normally charge.
[00:41:07] So the fun part of this, every vendor offers this, but. Do it totally differently. So Anthropic, you have to actually mark apparently what's cash? I have not done this. We have not had cause to do this yet. If you're on a paid like flat fee plan, this all happens on the model provider side. They do all this on their end to keep things cheaper for themselves.
[00:41:28] openAI's is automatic. Google has like two modes apparently. So without getting in the weeds, the point is there's this method that you can use that is the predominant method to dramatically lower your costs, which people take advantage of. But the other ways to do this are ways we've talked about to save money in the past.
[00:41:46] So like for instance, right sizing the model using cheaper, smaller models, . At the API level, you can actually batch or do asynchronous runs of stuff, which is less money. output is [00:42:00] the expensive half. So if you're capping your response lengths and things that can help. and then also trimming context, getting strategic about what you're actually putting in.
[00:42:08] Again, these are all bandaids though, I would say like, except for maybe model routing is potentially a longer term, solution here. But here's the thing, Paul, like all of these levers, for the most part live at the API charges, you go per token level, but most businesses just buy per seat licenses like us, right?
[00:42:29] And you still have all these usage limits to figure out. But the thing is, once you start saying, okay, well if you're on a license, you have a set amount of usage. Here's what our research found. Good luck figuring any of that out because everyone treats it differently. for instance, we are looking into, to begin with, Gemini Enterprise, which we are using Gen Gemini Enterprise Business Edition, and the, to Google's credit, their reps have been super helpful answering a bunch of questions.
[00:42:59] But [00:43:00] something like Claude, right, you just have like a Claude team account. Everyone has their own usage limits. You don't really know exactly what those are. You just see the meter running and you can know when you're approaching it. Gemini Enterprise has, and I'm just gonna read some things that they sent me.
[00:43:18] Gemini Enterprise Business Edition quotas are pooled across your entire organization. Each purchase license contributes a set amount of capacity. For instance, 120 queries per day to the organization's total shared pool. So in some cases, you may have individual usage limits like we do under a Claude team plan.
[00:43:39] If we move to a Claude Enterprise plan, we would now be on like pooled usage, meaning the organization is taking from everyone else. Gemini Enterprise works like this out of the box. So we say every user seat gives us more usage, but we're still all pooling from the same pool for the most part. Now, also in [00:44:00] Gemini Enterprise, when you hit your quota limit.
[00:44:03] It just stops working. So that, that's interesting. so there's all these nuances. Again, this stuff could change tomorrow. This is not how openAI's works. It's not how Claude works. they're all different. So you've quickly run into this problem of not only untangling how your license and usage works, and it differs depending on the license within a single model provider, not just between Anthropic and openAI's and Google.
[00:44:28] There's like eight different ways this can work if you have a specific tool. So figuring that out alone, you pooled usage versus individual is critical. But even within that, unless it is quotas like Google is offering, we say, okay, it's a certain amount of messages. But like Is anyone like sitting here tallying and counting?
[00:44:47] I mean, it's like I come back to like, I appreciate there's some clarity around usage and around how the rules work. And I get that it has to have some limits, but I don't really see a [00:45:00] sustainable way to be sitting here, whether it's tokens. Pooled usage, individual usage, quotas. Am I just gonna have to sit here with like a calculator?
[00:45:09] Tallying things while I go seems insane to me, so I'll stop there. It's a lot to process for the moment. We're making progress, understanding better how this is working, but in terms of solutions, I'm really struggling sometimes to untangle this, and this doesn't even get into once we are, I assume as we deploy more agents outside of using agentic tools on a paid set plan.
[00:45:35] Getting into the API is a whole new interesting animal, and most organizations are doing a combo of both it looks like. So. It's not one or the other. Neither one is clear and super easy to solve.
[00:45:49] Paul Roetzer: Well, and you're one that was great breakdown, super confusing as I expected. you're focused on like the three major labs who are selling direct to to [00:46:00] consumers.
[00:46:00] Imagine now like we use HubSpot. Well, they're not selling direct, they're passing the cost of the tokens through on anything we do within HubSpot with their margin on top of it. And it's like good luck figuring that out like it is. So I guess what we're saying is we'll know we've reached AGI when this isn't a problem anymore.
[00:46:20] We have a model smart enough to solve how to price models for business usage. I mean, as you explain this, I'm thinking like, okay, maybe someone in it, in procurement is like capable of solving this, but do you think like. Ahead of sales, ahead of marketing, ahead of customer success, who now has to like manage these agents has any clue, like any context whatsoever to have ever dealt with a pricing model like this.
[00:46:48] So, yeah, just, I mean, again, like we've talked about this, maybe three or four straight episodes. Now you understand why this is not some simple, like just do it this way. And [00:47:00] I lean in the direction still that. You know, the simplest solution is probably the answer. Yeah. And I think the simplest solution comes back to a per seat license where you're just subsidizing the most powered users.
[00:47:14] Yeah. But then you have a bunch of people who aren't using their limits, and it's like, everybody just pays 200 a month or 500 a month, or like whatever that number is. You just charge by more, but the labs don't have enough adoption and there isn't enough AI literacy within enterprises for them to know how to get that value out of it.
[00:47:32] So if you just raise your rate all of a sudden, say 500 bucks, per month per license, someone like SmarterX be like, yeah, cool, great. Like we're getting that ROI and it's not even debatable, but if you do that to a major enterprise that has yet to do AI literacy training specialized like personalized adoption training, they're gonna be like, no way in hell we're getting $500.
[00:47:51] Like, we can't justify that. And it's like, okay, well the labs have more work to do before they have enough of an educated buyer to know the ROI. So they can't just raise the per seat license. [00:48:00] You could do human replacement cost of like, Hey, you got 10 marketers, you're only gonna need two now. So you, you can use some of the difference, but no one can market that because they would lose customers because they're replacing humans and they can't say that.
[00:48:13] Mike Kaput: Right.
[00:48:13] Paul Roetzer: So it's like the simplest answers to how to solve this. Aren't really super viable right now for a variety of reasons. So,
[00:48:22] Mike Kaput: and you know, there's, I'm sure some more technical people that would say like, oh, okay, well you can use a really cheap fast model and through the API, you can get a, a set understanding of how much it'll cost to build something.
[00:48:35] Maybe like that works for one-off services and agents you're building to accomplish a task. But like. There's so much amorphous strategic knowledge work being done. Like it's just like how, how much does it cost to just have a deep dive strategy conversation for three hours with Claude? I don't, who knows?
[00:48:55] Like I'm not gonna sit there and be like, ah, I'm gonna make sure this one stays at [00:49:00] 50 chat. It's not how thinking works. Correct.
[00:49:02] Paul Roetzer: Especially if you're in voice and you're just talking
[00:49:04] Mike Kaput: for an hour. Right? It's like crazy. For all this knowledge work stuff that's going to be transformative, that's not building repeatable AI powered services.
[00:49:12] I get that. That part's a little more quantifiable and manageable. But for this stuff, and also we haven't even gotten into like the transformation part of it. This is just to like do stuff we're already doing more efficiently and better. But the innovation piece, who knows?
[00:49:27] Paul Roetzer: Yeah. And we'll talk about Ana, the Japanese lab at the end and the product updates, but they just came out with a model that was getting a lot of rave reviews in the first like 12 hours that came out, like last night, I think.
[00:49:36] Yeah. but in essence, like my initial research, it is basically just a routing system. It's, yeah. And so then you have this opaque routing thing where it's like, okay, so let's, so I go into Claude and all my model choice is gone. I can't pick of my seven or whatever models that you currently can pick when you're in Claude.
[00:49:51] And so now I have employees who maybe aren't as advanced in their usage, who can't pick like Opus 4.8 and like burn through tokens unnecessarily. Yeah. But now [00:50:00] it's like totally opaque to everybody. And now when I want do my deep thinking, I don't even know which model I'm getting. I'm assuming they're giving me the shit model because it costs them less per tokens or we're running up against our budget.
[00:50:11] So it arbitrarily moves us to like the more efficient model because it knows otherwise I'm gonna get pissed and like run into my li like. Oh my God. Like this, this stuff is so complicated and it just feels like all the labs are literally just like making it up. Like, all right, let's, let's try this model right now.
[00:50:28] Right,
[00:50:29] Mike Kaput: right. And
[00:50:29] Paul Roetzer: see what happens. And all of this pricing is built for developers. That's the other thing that like annoys the hell outta me right now is,
[00:50:36] Mike Kaput: yeah,
[00:50:37] Paul Roetzer: none of these models make sense to the average knowledge worker. They are built for researchers and developers who understand how to work with APIs and the meter to access to all this stuff.
[00:50:47] It is not built for marketers and executives and stuff like that to, to comprehend at all.
[00:50:53] Mike Kaput: Yeah. And just one final thing I would say about it is like, again, people might argue, well, okay, like this is gonna get [00:51:00] solved at some point as model prices drop and compute more compute comes online. Well a, sure, but that's gonna take some time.
[00:51:07] B, like in the next three years business A, we're gonna be hitting almost super intelligence in the next like three years. So like businesses are trying to plan and position, you cannot do that without some predictability. And the lack of predictability here, I think is really what. Gets me and what is very difficult because like how are you supposed to adopt all this great technology?
[00:51:30] How are you supposed to diffuse it everywhere if I have no idea what we're getting charged for certain things.
[00:51:37] Paul Roetzer: Yeah. And then the final note again, we could probably talk for another hour about this. 'cause every time like now my brain's just like going on this, what they all want to have happen, what the labs are positioning the, again, I said this last week, like the IT word in silicon valley's loop and we just heard it a bunch more times from Satya this week.
[00:51:53] Yep. and hill climbing, apparently what they want to have happen, and again, they can't say this yet, [00:52:00] they want to take an enterprise that's 10,000 people and let's say your marketing team is 300 people. Their belief is likely that you can run that 300 person marketing team with like 30 people or less.
[00:52:16] Mike Kaput: Yeah.
[00:52:16] Paul Roetzer: But to do that, you have an email marketing agent, an SEO or a EO agent. You have a strategy agent and they are running on 24 7 loops. So your marketing team never sleeps, so you only need like 15 to 30 people to orchestrate all of this. But your AI is running all the time. Yes. You are literally just burning tokens all day long and it's learning from itself and it's running a campaign and it's tweaking the campaign and it's updating the creative and it's updating the media by and like that.
[00:52:45] Is the inevitable outcome of these models becoming generally capable and autonomous and reliable in the next like two to three years? Yeah. The models should be at the point where they can largely do what I just explained across every [00:53:00] function, across every department. But imagine the amount of tokens you're gonna burn to do that.
[00:53:05] Now, could you get, like, let's say we get sonnet five this week and three years from now, sonnet five is 1000 times more efficient than it's gonna be when it comes out. Then in theory, the cost to run that model 24 7 across hundreds of agents, maybe it becomes, I
[00:53:23] Mike Kaput: right?
[00:53:24] Paul Roetzer: I don't know,
[00:53:25] Mike Kaput: but like, how do you even plan?
[00:53:26] Yeah. People would say like, okay, well isn't it worth whatever money you're saving on those people? Maybe, but like, how are you plan out any of this? Like, you expecting an average enterprise or company, like an entrepreneurs at the bleeding edge? Great. Go do this and say, I'm gonna spend a million bucks to get 5 million of value.
[00:53:44] Amazing. But like. Your average company, it's insane to think they're just gonna go all in on something like this without any type of predictability or measureability. Right?
[00:53:52] Paul Roetzer: Well, yeah, because it requires an understanding of what these models are gonna be capable of in 18 to 24 months. Right. And it's very rare I have a [00:54:00] conversation with a business executive who can even comprehend deeply where we already are today.
[00:54:06] Yeah. And like what they're already capable of. So it's gonna be a very, very select few organizations that can truly like live in the future, I would say, and it's gonna look very weird to companies on the outside and the comp, the competitors who can't keep up with what they're doing.
[00:54:23] Mike Kaput: All right, Paul, before we dive into our rapid fire this week, another announcement here.
[00:54:27] This week's episode is also brought to you by the AI for Business Bootcamp by SmarterX. We are coming to Columbus, Ohio on July 16th. This is a single day, 8:30 AM to 5:30 PM at the Hilton Columbus at Easton. This is a full day set of workshops and content built for professionals and leaders who are ready to accelerate AI adoption and value creation.
[00:54:49] So the day is gonna start with a keynote from Paul State of AI for Business Keynote that transitions into two highly interactive workshops with Paul and myself. So I'll lead an AI [00:55:00] productivity workshop in the morning. Paul will lead an AI innovation. Workshop in the afternoon. And this is built for any AI forward managers, directors, professionals, executives across every department who are ready to move past AI theory.
[00:55:12] We're gonna build real AI powered workflows, get strategic frameworks to accelerate transformation. You also leave with an immediately actionable plan for yourself and your team all in one single day. If you're one of our AI Academy Mastery members, you get discounted pricing. Discounts are also available for teams of two or more.
[00:55:29] Groups of 10 or more can get custom pricing. So be sure to join us if you're available. You can use the code Pod 100 to take a hundred bucks off your ticket to grab your spot and spots are limited. Just go to smarterx.ai, click on events. You'll see the AI for Business Bootcamp as an option right there.
[00:55:46] Paul Roetzer: My guess is we will also have a lot of time at that event to sit down and just talk about this things like the token, but like just get ideas from other people who are going through this. 'cause that's the best part of these live in person events is like, yeah, the collaboration, [00:56:00] the ideation, commiseration.
[00:56:02] Like, whatever you want to call it.
[00:56:04] Noam Shazeer Joins OpenAI (and Other Major AI Hiring Updates)
[00:56:04] Mike Kaput: Yeah. All right. Let's dive into some rapid fire this week. So first up, we've got some big talent shakeups at some of the big labs. The biggest one so far is Noam Shazeer, who's a vice president, was a vice president of engineering at Google, and co-lead, co-lead of its Gemini models announced this past week.
[00:56:20] He's leaving to join openAI's. So he has a bit of a history here. There's a bit of a big reversal based on this because less than two years ago, Google reportedly paid $2.7 billion to bring Shazeer back along with a team of his researchers after he had left in 2021. To co-found the chat bot startup character.ai.
[00:56:41] After that, he went on to co-lead Gemini. He's widely credited as a fee key figure in Gemini. Closing the Gap with ChatGPT. He is also basically AI royalty. He first joined Google back in 2000. He was a co-author of The Seminal. Attention Is All You Need Paper that introduced the transformer. That's the architecture underneath [00:57:00] basically every modern chatbot at openAI's.
[00:57:03] He's gonna lead architecture, research and OpenAI's. Mark Chen welcomed him as quote, extremely AGI pilled. He's not the only big name on the move. John Jumper, the DeepMind scientist who shared a Nobel Prize for alpha fold announced he is leaving Google DeepMind after nearly nine years to join Anthropic.
[00:57:23] openAI's also hired Dean Ball, who we've quoted a number of times on the podcast. The, he's a former top White House AI advisor. He's the lead author of the administration's AI action plan. They have hired Ball as the head of Strategic Futures. Basically. This is kind of interesting too, because Ball has recently become a vocal critic of Trump's with Anthropic and on the churn side, Barret Zoph, a senior openAI's research leader who helped build ChatGPT.
[00:57:50] He had come back to the company about five months ago after leaving Mira Murati, the former CTO's Thinking Machines lab. He is out after just five months back. And [00:58:00] we'll see what happens to Barrett. So Paul, not everyone may know these names nearly as much as Sam or Dario or De, but these are, these people are a big deal.
[00:58:09] This is a big deal. Some of these shakeups are happening.
[00:58:11] Paul Roetzer: Huge. so I mean, Google DeepMind has a very deep bench. that's not debatable, but when you just look at the recent months, like David Silver, again, probably not a name a ton of people know, but if you watch the AlphaGo documentary, you know who David Silver is, college friend of, of Demis, major, major player in the breakthroughs the last decade.
[00:58:32] He left. no, him leaving his a massive deal. John Jumper. That one shocked me because again, if you read Infinity Machine, you'll, you'll get a little bit more of the backstory on, on Jumper. But while it's obvious that Gemini currently has fallen pretty far behind ChatGPT and Claude. I assumed DeepMind is where you were, if you were serious about science, [00:59:00] that that, the one thing it seemed like Demis and Google were going to own was scientific breakthroughs.
[00:59:07] And for Jumper to leave DeepMind seems problematic. And again, like we're all just sort of looking from the outside in and trying to kind of connect the dots here. But the thing I can't shake, Mike and I haven't really said this on the podcast yet, but like when you read the Infinity Machine, the vibes you get from Demis are, are like, there was a number of times where I would like listen to his quotes from the author and I would be like, I think he might leave like, like you, you just, you really.
[00:59:42] There's a lot of times where you listen to his interviews and you think he, he never wanted to run a product driven lab. Yeah. Like he, he had to, he is a researcher through and through who wants to solve the biggest challenges in the universe. And when I [01:00:00] see these major players including like silver, that he's so close with Jumper, it's like something is off.
[01:00:06] Like there, there's just, I don't know what it is, but like, and I don't, I'm not saying I think Demis is actually going to leave, but if like tomorrow somebody tweeted that Demis has left to start his restart, his own lab again focused on pure research, I would look at that as like, yeah, that makes, that makes sense.
[01:00:24] Like, I would not be shocked at all if that was something that actually occurred, which is not great, like for, for that. But again, if you read that book, I don't know how you come away with that thinking. That Demis is like locked in and totally happy doing what he's doing right now with DeepMind and happy with the progress that they're making.
[01:00:45] so again, I come back to what I said earlier, like DeepMind and Google, they need to show up big in the next few months. Yeah. Like just some transformative things because I could see a [01:01:00] lot of top talent there if they've continue to be third place or worse that are like, just jump ship and go to Anthropic or openAI's.
[01:01:07] And then you get into a situation where I'm not, I'm not saying this is what's gonna happen with DeepMind, but like we're Elon Muske ex. I was like, oh, screw it. We're just, we're not gonna be one of the frontier models that Anthropic and openAI's have that.
[01:01:21] Mike Kaput: Right.
[01:01:21] Paul Roetzer: And I wonder if at some point Google's like, Hey, we own 14% of Anthropic.
[01:01:25] Like, let's just pour that in. Let's do, you know what we need to do in our products, but like. Gemini is just not gonna be like a, again, I don't think that's what's gonna happen, but I also am getting to the point now where it's like, I wouldn't even be surprised if things like that started transpiring because of everything that's going on, but seeing this level of talent leave is very weird in a very short time period.
[01:01:49] Mike Kaput: You know, it's total speculation, but I'm just curious about your take here. So let's say Demis, I think has been on the record, probably pretty safe to say he [01:02:00] believes we'll have some form of AGI by 2030, if not sooner. Yeah. So you've got a few years window here. Like he obviously could raise any amount of money he wants, like is it.
[01:02:11] Easier for him to just do his own thing or would he be looking to just say, okay, I've got a three year window to usher this technology into whatever form it's in. Do I just join one of the other two at this point? Big labs?
[01:02:25] Paul Roetzer: Yeah. I can't see him just joining a lab. I mean, I know him and Dario are close.
[01:02:30] Yeah. yeah, again, like when you, when you listen to his interviews, 'cause in the Infinity Machine he did like 40 hours of interviews with the author and there were a lot of like very vulnerable conversations where it was just like sitting in a cafe and like sharing his thoughts and. Again, I'm not reading into anything.
[01:02:47] He, he literally says like, once we get to AGI think I'm just gonna go away for a while. Like it's, you know, he, I think he is on this sort of treadmill where he's just trying to get to the end game and up until now, but he tried to negotiate his [01:03:00] way out of Google multiple times prior, you know, prior to this.
[01:03:03] he saw Google as the most likely path forward to achieving his, his mission.
[01:03:08] Mike Kaput: Yeah.
[01:03:09] Paul Roetzer: And if at some point that changes or the need to be so product driven continues to get in the way of him solving, you know, the biggest challenges and questions in the universe. And he feels like he just needs to go somewhere else to do it.
[01:03:23] He's a very mission, mission-driven person. and actually the analogy Mike, I thought of this morning when I was kind of getting ready for this, there's this story they tell in the book about, you know, his pursuit of ai where he loses this chess match. I think he was like 12 years old. And he basically was in this like extended match.
[01:03:40] And he, he made a mistake in the end and he lost when he shouldn't have lost. He just kind of got fatigued. And he says like that night in the hotel room, he's realizing that like all of this brain power is sitting in this room playing chess when they should be out solving like the most [01:04:00] complex things in the world, right?
[01:04:01] And that's when he started looking at like, AI is like the path to do this and like more people should be working on these things. And so it's not unprecedented that at some point, and this is where I guess where I started kind of drawing the parallel where if he feels like. His brain power is being wasted on productizing this technology.
[01:04:23] Instead of applying it to solve these problems he has shown throughout his life, he will just leave and go do the thing that needs to be done.
[01:04:32] Mike Kaput: Yeah.
[01:04:32] Paul Roetzer: And so I think that's why I feel like there's some, something has to give here. I don't think Google is moving as quickly on the frontier advancements as maybe he's going to want it to be, and like something's just gotta change internally or external, I don't know.
[01:04:49] Like it's, it something isn't right, like it seems unsettled right now.
[01:04:54] Mike Kaput: Gotcha.
[01:04:57] Trump's G7 AI Push
[01:04:57] Mike Kaput: So next up, the G7 summit. This past week happened in France and it turned into a bit of a major AI event. There was a closed door working lunch on Wednesday that brought about a dozen tech executives together with heads of state, including President Trump.
[01:05:12] So we saw at this meeting, Anthropic, CEO, Dario Amodei, Google DeepMind, CEO, deve, Sam was there. they, they, but. Dario and Demis used this meeting to actually call for a US LED international coalition to set the rules and standards around ai. And Sam also in the room backed them up pushing for a global forum to test models and establish shared standards.
[01:05:35] Amodei also told the leaders to quote, resist the temptation to splinter over ai. He argued that cooperation should include structured access to frontier models and a chip and component supply chain that excludes China, puts joint work on the risks of AI and plus joint work on the risks of AI in cyber, bio-terrorism and intelligence.
[01:05:56] Obviously, this is all playing out against the backdrop of the [01:06:00] White House fighting with Anthropic. separately, G7 leaders discussed a quote, trusted partners plan that would give select allied countries or companies a path around those restrictions to access advanced US models. So Paul, just as a refresher, you know, the G7 of countries is the us, France, uk, Germany, Italy, Canada, Japan.
[01:06:23] We had a bunch of people in the room including Trump, secretary, of the Treasury, Scott Besant, Lutnick, and Rubio as well, along with these AI leaders, there are a bunch of AI leaders from other companies, basically other AI champions in G7 countries. But kind of interesting to see CEOs standing right in front of Trump wanting a US led goble coalition when the Trump administration is also banning US models.
[01:06:53] Paul Roetzer: I, yeah. I really feel like if Dario was [01:07:00] willing to play the game like we talked about earlier, yeah. He would have a very, very important seat at the table right now. Yeah. Like, I think philanthropic has demonstrated the need for greater collaboration with, other democracies, democratic countries, and allies, and I feel like they just don't want to give into him because of how he has approached the administration so far.
[01:07:26] Mike Kaput: Yeah.
[01:07:26] Paul Roetzer: But I do think that Demis has played the game well. Sam has, and as weird as it is, I could see Sam and Dario actually finding a way to like. Work together. Yeah. And maybe Demis is the bridge to bring them together to do this, but they all see the same thing. They all know what these models are gonna be capable of.
[01:07:47] They all talk about them differently and work with the administration in different ways right now, but there's no denying. The power of these models and the importance of them to national security and democracy and all these things. [01:08:00] And I just feel like they've gotta find a way to, to do something together.
[01:08:04] I just don't know what it's going to be. And I, you know, may, maybe just getting 'em all in the room like this is a good start, but something's gotta happen sooner than later or else we're just gonna be in a very weird regulatory regime that's gonna slow down innovation, which is not what the government wants.
[01:08:21] Mike Kaput: Yeah, one. I'll just say one other note. As I was doing some research on this, like obviously we talk about constantly and we know totally that like the US is an AI leader, but when you look at like the other companies that were at the table that are all basically one company representing each country, essentially you have Sakanaai, which is interesting for Japan, there was a company called Domin, which I'd never heard of, which is Italian.
[01:08:46] they're an AI sovereign AI company for regulated industries, but they were Italy's first unicorn actually, and $1 billion valuation in 2024. London based, Britain's most valuable generative AI startup, [01:09:00] Synthesia was present and represent, but that's a $4 billion company. Germany had something called Black Forest Labs represented.
[01:09:09] They make the flux image model. They were people who created stable diffusion, they left stability ai. Again, these are all really interesting companies and like no knock on them, but you're like, wow, these are just like your average startup back in the us, not like a national AI champion. And it was really stark to see that, let's just say.
[01:09:29] Paul Roetzer: Yeah, the US dominates
[01:09:31] Mike Kaput: unreal conversation.
[01:09:32] Paul Roetzer: Yeah, for sure.
[01:09:33] Midjourney Launches a Medical Division
[01:09:33] Mike Kaput: All right, so next up, mid Journey, the popular AI image generator used this past week to announce something pretty outta left field a little bit. They announced Mid Journey Medical, their first hardware product, and they move into healthcare. The centerpiece of their announcement was something called the Mid Journey Scanner, which is a full body ultrasound scanner.
[01:09:55] The way they describe this, basically you step onto a platform that will lower you into a [01:10:00] shallow pool of water. You'll pass through a ring of 350,000 or more tiny sensors that basically act as both a speaker and microphone sending ultrasonic waves through your body from every angle. This whole scan takes 60 seconds about and produces a 3D map of your insides down to a fraction of a millimeter.
[01:10:19] CEO. David Holz says it aims for image quality comparable to an MRI in many ways, at nearly a hundred times the speed with no radiation or magnets. This was built with the ultrasound company Butterfly Network. About a dozen people have been scanned so far. The idea here is that thanks to this type of technology, scanning would become casual and routine.
[01:10:41] You could do it as often as every day. Holts claims that early imaging and doing enough of it could eventually avoid 30% of all deaths and half of all healthcare costs. To facilitate all this Midjourney plans to open what they call a Midjourney spa in San Francisco in 2027. [01:11:00] This is gonna have their scanning technology along with saunas, coal plunges hot tub, hot tub tiles, style scanning rooms, and they have a goal of more than 50,000 scanners worldwide, doing a billion scans a month by 2031.
[01:11:13] For now, the company is starting with body composition maps that sidestep the FDA clearance required for a true diagnostic imaging. Right now, this is kind of a launch event with not that many details. The Verge did note, and I would agree, it's still not entirely clear what mid Journey's image generation technology actually has to do with any of this.
[01:11:33] If anything, they speculate it might just be a new business to use unused AI compute. So, Paul, I'm curious what you thought of this. Is this like legit, is this a big swing at preventative health? Is it a gimmick? How did you land here?
[01:11:47] Paul Roetzer: It, yeah. It seems like it's a viable path. I mean, I've read a lot of reviews online of actual like doctors and, you know, certainly from the AI entrepreneur side, it was getting a lot of positive feedback.
[01:11:59] Yeah. [01:12:00] As like, oh, this could actually work. This is an interesting thing. So. I overall, I'm extremely excited and bullish on the next like 10 years of medical advancements powered by ai. And I think it's gonna be, you know, maybe the most important thing that AI brings us in the next five to 10 years. And so I think it's noteworthy when these sorts of things happen.
[01:12:22] I think you're gonna see a lot of these outta left field things where it's like, wait, what? Like Midjourney is doing MRI machine. But yeah, I don't know. I think it's positive. I like any advancements that are, you know, gonna make, diagnostics more affordable, more accessible to, you know, the general public and hopefully, you know, solve a lot of long-term illnesses that otherwise just go on notice.
[01:12:46] So, yeah, I don't know. Positive stuff. I like.
[01:12:49] Mike Kaput: Yeah. And you know, like we've tried to do sometimes highlighting more AI for good. Yeah, we'll see if it works, but it's, hey, it's not all doom and gloom out there. People are doing some really interesting things.
[01:12:58] Paul Roetzer: Yep.
[01:12:59] AI Can Now Out-Persuade Expert Humans
[01:12:59] Mike Kaput: All right. [01:13:00] Next up, a new study from the University of Oxford and the UK AI Security Institute out this past week found that Frontier AI Systems can now reliably out persuade expert humans in conversation.
[01:13:11] This research was led by Kobe Hackenburg and it ran four pre-registered experiments spanning nearly 19,000 conversations with almost 7,000 people. So the test went like this. Participants rated how much they agreed with a contested policy stance in this case, specifically in the uk, something like allowing more immigration or raising the pension age.
[01:13:33] They basically. Rated how much they agreed with this on a zero to a hundred scale. They had a short text chat around seven messages over 14 minutes with either an AI or a human trying to change their mind. They rated their agreement again afterwards, and that shift in the score was the measure of who persuaded better on the human side.
[01:13:52] Researchers recruited the most capable persuaders they could find. They got professional political canvas [01:14:00] canvassers, elite competitive debaters that included four world champions. The AI still won even when the humans picked their own topics, prepped in advance, practiced for hours, and even when they were paid thousand pound bonuses to win AI beat the elite debaters by 4.6 points of attitude change and coaching those debaters against the very AI that beat them did not close the gap.
[01:14:24] In the final study, the AI was nearly three times more effective in the real world than professional fundraisers at getting people to donate money to a charity called Save the Children. They tested models including Claude Opus Chat, GPT-5.4 Grok, and. Versions of Gemini. Interestingly though, an important caveat, it seems like the A AI was able to do so well because it could flood the conversation with more info than your average human.
[01:14:52] The AI averaged 294 words per reply with sub-second latency. So it was able to pack in [01:15:00] about 37 fact checkable claims per conversation when they throttled it down to how fast humans could type and what length they were able to do in an average or normal conversation, that dropped to about 12 fact checkable claims per conversation, and the advantage of the AI essentially collapsed to zero.
[01:15:19] So Paul, like long story short here, based on this research, at least it does sound like AI does not have to be super humanly intelligent to actually. Be better than humans at persuading. So it's the scale and speed thing. It sounds like.
[01:15:33] Paul Roetzer: There's a Sam Altman tweet that I've gone back to numerous times and mentioned on the podcast a few times.
[01:15:38] So back in October, 2023, he tweeted, I expect AI to be capable of superhuman persuasion well before it is superhuman at general intelligence, which may lead to some very strange outcomes. Now. He tweeted that 11 months before the release of the first reasoning Model oh one, but right in the midst of what was called Project Strawberry, where they were [01:16:00] working on reasoning capabilities.
[01:16:01] and so I've always felt that the models were already superhuman and persuasion. I felt like earlier versions, but there was guardrails put on them to prevent them from doing it because. Superhuman persuasion can be used in very nefarious ways.
[01:16:20] Mike Kaput: Yeah.
[01:16:20] Paul Roetzer: And so it's a very dangerous thing to have the models be able to do, but it's inevitability that they're going to be.
[01:16:29] And so it can be used for really positive things, but, that is, it's a, it's a dangerous thing.
[01:16:37] Mike Kaput: Yeah. And I mean, we've already talked about so many times people already, you know, having AI psychosis or weird relationships with ai, and that's not even a fraction of probably the persuasion capabilities that
[01:16:49] Paul Roetzer: Totally.
[01:16:50] Mike Kaput: It actually has. Yeah. We're,
[01:16:52] Paul Roetzer: we're not, like when you're working with Chad t and Claude, you're not seeing anywhere near the full capability from persuasion perspective. [01:17:00]
[01:17:00] AI Use Case Spotlight
[01:17:00] Mike Kaput: All right. So Paul, next step, we've got our AI use case spotlight. Every week we give you a quick look under the hood at real AI use cases we're exploring, building, or deploying in our own work at SmarterX.
[01:17:10] So Paul, I'm just gonna share a real quick one. Then if you have anything to share, we can talk through that too. so for me, this past week, I was actually preparing for a couple of upcoming talks and I was interested to see if I could actually get a better way of critiquing my speaking style. so I built a way to record myself practicing and have Claude actually critique it.
[01:17:33] Now you can, of course, use a natively multimodal model like Gemini, for instance, to drop like a recording in. It'll help you do some of this. I was curious just. Doing this with a different model. 'cause Claude can't do this natively. So Claude can't natively watch video or really listen to audio. So, but what's really cool is that you can just decompose the recording into things that can actually analyze and do this locally on your machine.
[01:17:56] So basically it's worked in three quick layers. First, it [01:18:00] transcribe the entire talk locally with whisper, like an open source transcription model. so you could get the exact words, my pace, every filler word. Second, this was the part Claude Code figured out how to do, and I had no idea what's even possible.
[01:18:13] It then runs the audio through a speech analysis tool called Pro that's like an open source tool that pulls out your pitch and volume as actual numbers and a chart so you can literally see if you're going monotone, trailing off, et cetera. And third, suggested sampling a grid of still frames from the webcam video to check eye contact, energy and gesture.
[01:18:34] So not perfect, but it did read all that and hand me back a prioritized critique, that I found genuinely useful. it caught some things I was. Wrong about, I thought I had actually gone too slow. The data said I was actually running about 170 words a minute on the fast side, so I actually just needed to like trim content.
[01:18:54] it flagged that I leaned on certain words a little too much, which was helpful. definitely told [01:19:00] me what was working as well, which was really cool. And so the whole point here is like you can kind of take it or leave it for how much you wanna take the advice of AI when doing a talk or something.
[01:19:09] But the point was I didn't have to like sit down and vibe code anything. And that's awesome. I love vibe coding, but I think people assume, oh, I gotta go vibe code something. It's like you don't always have to, you can just describe what you want, strategize back and forth with AI to refine your idea, and then something age agentic, like Claude Code or Codex can just figure out like what.
[01:19:29] Pieces of software can we put together to make this work? What strategies can we use to actually do this thing? So that was the bigger point for me there.
[01:19:36] Paul Roetzer: Impressive. Yeah. I, mine was, again, strategic. Most of my uses are strategic related. I was working on something that required some pretty advanced legal and accounting, details that I'm not an expert in.
[01:19:53] And so basically the way I do it is like I just come up with all the questions I would have if I was talking to my attorney or my accountant and all the questions I [01:20:00] have, you know, as an entrepreneur and someone trying to make some decisions around, you know, the growth of the company. And so I keep that kind of scratch pad where I'm just constantly adding more questions, more thoughts, and then I'll like pop them into the model and ask these questions.
[01:20:13] But when I got to a point on Friday where I'd, I kind of arrived at what I thought the decisions should be, at least what I thought the path forward was, but then I needed to share it with my attorney. I actually went in and said, okay, let's now build a brief that I can send to my attorney. Here's what I want.
[01:20:31] Like, these are the decisions we've made. This is what I think the path forward should be. After each thing, I would like you to give me the sample legal language that would end up in the legal document so that my attorney can re react to actual legal language that I would never be able to write on my own.
[01:20:46] And so I created. Two documents. Each one was probably between 15 and 20 pages that I then went through, did deep edits of myself, but it was all just regurgitating what we had [01:21:00] decided together, me and Claude or Chet and and then put it into a format I could then send off to my advisors. H And so in the end, one, it gives us like a way for me to give clarity.
[01:21:13] And honestly, when I sat down on Friday, I was like, I have to finish this like as soon as possible. Did not think it was going to be even, a possibility that I would've done it on Friday. Yeah. And I actually was able to do all of the work and create both the documents by lunchtime on Friday. Wow. So in like three hours I did what?
[01:21:31] No joke was 30 to 50 hours of work like it. If I had, if I didn't have, those models three years ago, like I easily 30 to 50 hours, which means it would've just taken me another couple months because for me to find, yeah, 30 hours to work on anything is almost impossible. So yeah, just again, one of those things where like if, if you would've just said like, you get to do this one thing with ChatGPT all year and it's gonna cost you $500 or whatever, I'd [01:22:00] be like done.
[01:22:00] Like yeah. Yeah. So that project alone is pays for my license a hundred times over, in terms of the value it created for me.
[01:22:10] Mike Kaput: That's incredible. I love that.
[01:22:12] AI Product and Funding Updates
[01:22:12] Mike Kaput: Alright, so we're gonna end up with some product and funding updates as usual. So I'm gonna run through these real quick. So first up, SpaceX struck a $60 billion all stock deal to acquire AI coding startup Cursor with a $10 billion breakup fee if it falls through.
[01:22:26] This was just days after SpaceX's own Blockbuster IPO. This was a move to bolster the AI efforts of its Xai division. Chinese AI lab Deep seat closed, its first ever external funding round, raising more than $7 billion at a valuation north of 50 billion. There is an unusual structure here where Founder Liang Ween Fang put in about 3 billion of his own money and outside investors received no voting rights and a five-year lockup.
[01:22:53] openAI's launched the openAI's Partner Network $150 million program with launch partners, including Accenture, BCG, McKinsey, [01:23:00] Bain, and PWC. Aimed at training 300,000 certified consultants by the end of 2026 to help enterprises actually deploy its models. Microsoft made co-pilot cowork generally available.
[01:23:13] This is an agent agentic system that runs complex long running, multi-step tasks end to end across Microsoft 365 apps. And this is for co-pilot 365 co-pilot subscribers. There is usage based billing and admin set spending limits.
[01:23:28] Paul Roetzer: Good luck with those tokens.
[01:23:30] Mike Kaput: I was gonna say good luck. Let us report back with your results please.
[01:23:35] Chinese startup z.ai released something called GLM 5.2 that's getting quite a bit of attention online. This is an open weights model under an MIT license that tops the open model leaderboards and beats GPT 5.5 on several long-horizon coding benchmarks at roughly a sixth of the cost. And then finally, Ana AI released something called Fugu, which [01:24:00] is a model.
[01:24:00] That works by orchestrating a pool of other AI models behind the scenes, including copies of itself and can automatically route around any provider that gets cut off, which the company is pitching as resilience against export control. So everything comes full circle here. Paul, thank you for breaking everything down.
[01:24:18] Just a quick reminder here before I'll turn it over to you to wrap up, but our AI pulse survey will be live when you're listening to this for this week. So SmarterX.AI/pulse, and this week we are going to ask about AI usage and also ask about your take on Microsoft CEO Satya Nadella's essay that we discussed this week.
[01:24:40] So Paul, thanks again for breaking everything down for us.
[01:24:43] Paul Roetzer: Yeah. And again, reminder, this episode is brought to us by siteimprove. Go check out that AI AEO check at, siteimprove.com/aipod. That is something that Mike and I are definitely gonna be taking a look at. and seeing the ways that Siteimprove can help with AGI Agentic content [01:25:00] intelligence platform for our marketing team.
[01:25:02] so again, siteimprove.com/aipod , we expect, as I said earlier, multiple new models this week that is on if the government allows the release of multiple new models. I guess we've arrived at the point where we have to give the caveat if the government lets it happen, we might have a couple new models to talk about next week.
[01:25:19] So, thanks as always, Mike. It should be an interesting week ahead.
[01:25:24] Mike Kaput: Can't wait.
[01:25:25] Paul Roetzer: Alright, talk to everyone later. Have a great week. Thanks for listening to the Artificial Intelligence Show. Visit SmarterX.AI to continue on your AI learning journey and join more than 100,000 professionals and business leaders who have subscribed to our weekly newsletters.
[01:25:40] Downloaded AI blueprints, attended virtual and in-person events, taken online AI courses and earned professional certificates from our AI academy and engaged in a SmarterX slack community. Until next time, stay curious and explore ai.
Claire Prudhomme
Claire Prudhomme is the Marketing Manager of Media and Content at the Marketing AI Institute. With a background in content marketing, video production and a deep interest in AI public policy, Claire brings a broad skill set to her role. Claire combines her skills, passion for storytelling, and dedication to lifelong learning to drive the Marketing AI Institute's mission forward.
