Anthropic filed confidentially for an IPO this week, the first formal step toward going public on the heels of a $65 billion raise. Paul and Mike unpack what we know about Anthropic, how the race with OpenAI and SpaceX is shaping up, and why the same filing came alongside a warning that AI may soon build itself.
Plus: Trump's AI executive order, government stakes in AI labs, Uber capping Claude Code spend, Apple's AI reset at WWDC, and the week's model and product news.
Listen or watch below—and see below for show notes and the transcript.
This Week's AI Pulse
Each week on The Artificial Intelligence Show with Paul Roetzer and Mike Kaput, we ask our audience questions about the hottest topics in AI via our weekly AI Pulse, a survey consisting of just a few questions to help us learn more about our audience and their perspectives on AI.
If you contribute, your input will be used to fuel one-of-a-kind research into AI that helps knowledge workers everywhere move their companies and careers forward.
Click here to take this week's AI Pulse.
Listen Now
Watch the Video
Timestamps
00:00:00 — Intro
00:05:53 — Anthropic IPO & Talks Recursive Self-Improvement
00:25:52 — Trump's AI Executive Order & Government Stakes in AI Labs
- Promoting Advanced Artificial Intelligence Innovation and Security - The White House
- X Post from David Sacks
- X Post from White House OSTP
- Fact Sheet: President Donald J. Trump Signs Historic Directive on AI in the National Security Enterprise - The White House
- National Security Presidential Memorandum/NSPM-11 - The White House
- Trump to meet with artificial intelligence companies on government profit share plan as soon as next week - Politico
- Trump Signals Interest in US Owning Stakes in Top AI Labs - Bloomberg
- Senior U.S. Officials Eye Government Shares in AI Giants - Notus
- What's inside the House draft bill to regulate AI - Axios
- Bernie Sanders: A.I. Is a Public Resource. You Should Own Half of It. - The New York Times
00:37:34 — The Soaring Cost of Intelligence, Part 2
- Uber Caps Usage of AI Tools Like Claude Code to Manage Costs - Simon Willison's Weblog
- X Post from Factory
- X Post from Patrick O'Shaughnessy
- Microsoft Says Anthropic Models Are Too Expensive - Bloomberg
- Outcomemaxxing - Sierra
- X Post from The Transcript
- Charging for Intelligence: How to Price AI Software - Emcap
00:57:34 — Apple WWDC
- Inside Apple's Secret Meeting That Led It to Finally Take AI Seriously - Bloomberg
- What to Expect From Apple's AI, Siri and iOS 27 Launch at WWDC - Bloomberg
01:01:36 — OpenAI Is Merging Codex and ChatGPT
- Codex for every role, tool, and workflow - OpenAI
- X Post from OpenAI
- Inside OpenAI's Decision to Combine Codex and ChatGPT - The Information
01:06:19 — OpenAI Distances Itself from Brockman's Super PAC
- Our Views on AI Policy and Political Advocacy - OpenAI
- OpenAI's Greg Brockman and the Leading the Future Super PAC - Business Insider
- X Post from Sam Altman
01:08:55 — AI Roll-Up Targets the Accounting Industry
- Thrive Holdings to Bet $1 Billion on AI-Powered Accounting Roll-Up - Forbes
- Mercor CEO on Why Application Layer Companies Have No Defensibility, Interview with Brendan Foody- 20VC
- X Post from Harry Stebbings
01:12:23 — AI in Higher Education
- AI on Campus - The New York Times
- AI Tools at UChicago - University of Chicago
- Stanford Study Finds AI Beats Law Professors 75% of the Time - Forbes
01:16:29 — AI Use Case Spotlight
- Outcome + Context + Interview
01:20:29 — AI Product and Funding Updates
- Gemini Spark
- Gemini Spark is the most impressive and terrifying AI experience I've had yet - Apple News
- I put Google's 24/7 AI assistant Gemini Spark to work, and it's actually pretty useful - TechCrunch
- Gemini Spark – Your 24/7 personal AI agent for productivity - Gemini
- The Gemini app becomes more agentic, delivering proactive, 24/7 help - Google Blog
- SpaceX IPO
- SpaceX Targets $135 IPO Price at Valuation of $1.77 Trillion - CNBC
- SpaceX (SPCX) IPO: Live Updates - CNBC
- Musk's xAI, SpaceX Combo Is the Biggest Merger of All Time, Valued at $1.25 Trillion - CNBC
- OpenAI and Anthropic New Models
- Anthropic leapfrogs OpenAI with a record $965 billion valuation, says Mythos is coming in weeks - Fortune
- Anthropic Expands Mythos to 150 Additional Organizations in More Than 15 Countries - CNBC
- Meta AI Instagram Hack
- Anthropic Expands Project Glasswing
- OpenAI's Frontier Safety Blueprint
- Google Releases Gemma 4 12B
- Meta Business Agent
- Microsoft's MAI Models
- Building a hill-climbing machine: Launching seven new MAI models - Microsoft AI
- X Post from Elie Bakouch
- Microsoft Is Building AI Agents to Compete with OpenAI - The Verge
- Mayo Clinic and Microsoft's Healthcare AI Model
This episode is brought to you by AI Academy by SmarterX.
AI Academy is your gateway to personalized AI learning for professionals and teams. Discover our new on-demand courses, live classes, certifications, and a smarter way to master AI. Learn more here.
This episode is also brought to you by the AI for B2B Marketers Summit. Join us on Thursday, June 25th at 12 PM ET, and learn real-world strategies on how to use AI to grow better, create smarter content, build stronger customer relationships, and much more.
Thanks to our sponsors, there’s even a free ticket option. See the full lineup and register now at www.b2bsummit.ai.
Read the Transcription
Disclaimer: This transcription was written by AI, thanks to Descript, and has not been edited for content.
[00:00:00] Paul Roetzer: The canary in the coal mine, like AI research is the first discipline that AI will be applied to at this super human level where it completely redefines the role of an AI researcher, and it's gonna happen in the next 18 months. Whatever happens here, extract that out to whatever your profession is in the same premise will then follow as soon as more resources, meaning money gets thrown at your industry to do the same thing.
[00:00:29] Welcome to the Artificial Intelligence Show, the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Roetzer. I'm the founder and CEO of SmarterX and Marketing AI Institute, and I'm your host. Each week I'm joined by my co-host and SmarterX chief content Officer, Mike Kaput.
[00:00:49] As we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career, join us as we accelerate [00:01:00] AI literacy for all.
[00:01:04] Welcome to episode 218 of the Artificial Intelligence Show. I'm your host, Paul Roetzer, along with my co-host Mike Kaput. We are recording Monday, June 8th, right around 9:00 AM Eastern Time. Important timestamp this week 'cause we expect a very busy week. We're, we'll talk about some of the stuff that's coming, but we're definitely gonna get insights into what the new series is going to look like from Apple.
[00:01:28] We are expecting multiple model releases, potentially from openAI's and Anthropic. So it is gonna be a very busy week, Mike. for sure. Or you have a work, work cut out for us on episode 219 or 220. I don't know which one it is gonna be 'cause I think we may have a, another what a state of the industry when we're gonna do, I think.
[00:01:47] Yeah, somewhere in here. So
[00:01:49] Mike Kaput: that'll be 220. So the next weekly will be 219. All right, so
[00:01:53] Paul Roetzer: episode 219, buckle up. There's gonna be a lot going on, so, all right, we'll get through what we can this week. There was, [00:02:00] it was a very busy week last week. lots of big things related to IPO plans, executive orders from the government, more about the cost of intelligence and tokens.
[00:02:10] Just a ton going on, it seems. I thought we were gonna get a little bit of a summer lull and that seems to not be happening, Mike. Right. All right. So this episode is brought to us by AI Academy by SmarterX, which helps individuals and businesses accelerate their AI literacy and transformation through personalized learning journeys and an AI powered learning platform, in addition to a community that includes thousands of AI forward professionals and leaders.
[00:02:37] New educational content is added weekly, so you always stay up to date with the latest AI trends and technologies. Our very popular AI for departments collection features seven core series and certificates. Designed to jumpstart AI and understanding and adoption across organizations. There are certificates, available right now for marketing, sales, customer success, [00:03:00] hr, finance, operations, and legal.
[00:03:02] So literally your entire company can get upskilled in AI in about a month. cramming into a weekend if you want. But, those are all ready to go on demand. they've all been recorded in the last, you know, six to eight months. We lea a new one each month. So, it's a lot of fresh content. And then those are gonna be updated as we go along.
[00:03:21] And then they're enhanced with our Gen AI app series and our academy lives. So there's obviously this always continuous learning journey that people can do once they get through these on-demand course series. So the series are an ideal launchpad for organizations that wanna level up their teams and accelerate responsible AI adoption and impact across the company.
[00:03:40] Individual and business account plans are available. You can also buy single courses and series for one-time fees. Go to academy, do SmarterX.ai to learn more. And you can use, code pod 100, that's POD100 for $100 off any individual annual plan. [00:04:00] So again, academy.SmarterX .ai. Alright. And then every week if you're new to the podcast, we, and we have new listeners every week, so I'll give a quick recap.
[00:04:08] We do three main topics, where Mike and I dive a little bit deeper into those topics. And then we do, you know, usually about seven to 10 rapid fire items ending with AI product and funding news, which we cover like a dozen things rapid fire within those. So that's the format. But we always start off with our AI pulse survey.
[00:04:26] So this is an informal poll that we run each week of our podcast listeners. You can go to SmarterX ai slash pulse to participate in these polls each week. so we just asked two questions based on the topics that we covered in that week's, podcast. And so this week, the responses we have from last week's is your organization spending on AI starting to outpace the value you're getting from it.
[00:04:51] 43% said, we're not spending enough yet to know. Okay. 36% said too early to tell. We're watching it. So, okay. That's the [00:05:00] vast majority. Have no idea, basically. Yeah. And then 21% no, the value clearly justifies it. So, yeah. So not too many people saying yes, costs are getting hard to justify. Actually, nobody said that.
[00:05:12] Nobody, I don't think. All right. That's pretty interesting information. and then the second one is, over the past year, has your personal view of AI become more positive or more negative? Okay, that's 50% more positive. 31% about the same, and 19% more negative. That's good. That's encouraging. I, I like the more positive, despite all the heavy stuff we talk about on the podcast each week and the challenges we're facing from a political perspective, societal perspective, economical perspective.
[00:05:41] I like to see that people are still staying positive and trying to find the, the good paths forward with ai. for sure. That's encouraging. All right, Mike. Take us into our main topics for the week.
[00:05:53] Anthropic IPO & Talks Recursive Self-Improvement
[00:05:53] Mike Kaput: All right, Paul, so first up, this past week, Anthropic confidentially filed a draft registration statement, a [00:06:00] Form S one, it's called with the SEC.
[00:06:02] This is their first formal step towards an IPO towards going public. Because the draft is confidential, the company is not disclosed share counts, pricing or detailed financials. Yet, though we do know a little bit because the filing comes on the right on the heels of a massive raise. So on May 28th, actually Anthropic closed a $65 billion series H round that at that time valued the company at roughly $965 billion.
[00:06:29] The business behind the filing has scaled quite fast. Anthropic said its run rate revenue has crossed $47 billion, and reporting indicates it is approaching its first profitable quarter. Now, interestingly, alongside this IPO News, which is very practical, there's also some very important, heavy, high level stuff Anthropics been talking about because they published a piece concurrently from its in-house research arm, the Anthropic Institute, and it is titled.[00:07:00]
[00:07:00] When AI builds itself, it is co-authored by Anthropic co-founder and head of Policy, Jack Clark. And the piece argues that AI is now doing more and more of the work of building ai. It's moving the industry towards what is called recursive self-improvement, which we have talked about in the past on the podcast to support this argument, Anthropic says more than 80% of the code that it now merges into its own code base is written by Claude, and that its engineers shipped roughly eight times more code per day in the second quarter of 2026 than they did in all of 2024.
[00:07:39] The piece also notes that the length of tasks that AI can complete on its own has been doubling roughly every four months. Now in this essay, Clark and his co-author also warned that as models start building and training their successors, alignment failures could compound growing more frequent, but less [00:08:00] understood until we lose control of them.
[00:08:03] And it argues for verifiable systems that would let labs credibly pause frontier development if they ever needed to. So, Paul, two kind of important, but very differing perspectives. Here we have the practicality of the IPO, and then we have this talk of fears of recursive self-improvement. which ones stood out to you, or kind of what's the bigger story here with these?
[00:08:27] Paul Roetzer: Well, the IPO story, I mean, basically comes down to SpaceX, and we'll touch on this a little later too, but they're gonna go public this week, June 12th. SpaceX, LIPO. we talked earlier this year, they X ai, which was Elon Musk's AI Lab, has been rolled into SpaceX. So as part of the IPO, it's really our first AI lab going public, I guess you could say.
[00:08:48] I mean, obviously Meta and Google are already, publicly traded companies. Anthropic, openAI's and Xai were the ones we were watching. So Xai is gonna, in essence, be folded in and, and go public through this. [00:09:00] So we'll learn a little bit there. and then it becomes a race between Anthropic and openAI's as who's gonna get to, you know, market first and then which company's going to have the higher valuation.
[00:09:10] So that's what we'll continue to watch because of the confidential filing, we don't have a ton of insight yet into, you know, anthropics business, but we've been learning a ton that they're growing fast, they had profitable months. Um. It's gonna be really interesting and I, I think I posted, on LinkedIn a month or so ago.
[00:09:26] It's like, okay, you got a hundred thousand dollars to invest. You can only put it into one of these companies. Like, which one are you betting on? So yeah, it'll be fascinating to see how the market responds to both of those companies. Then just overall, the trend line of AI labs overall and the impact they're gonna have on the economy.
[00:09:42] The article, about when AI builds itself, I found infinitely fascinating. I actually. Wrote about that in my exec AI newsletter, editorial on Sunday. because I think it's an extremely important concept that a lot of people probably haven't really stopped [00:10:00] and thought deeply about. We've talked about it, numerous times on the podcast, especially over the last six months, that recursive self-improvement was likely one of the Keyon locks to achieving AGI or artificial general intelligence, along with world models and a couple others.
[00:10:17] But in essence, the labs are all sort of betting on, there's probably needs to be like one to three. That's, and I'm just categorizing based on like how Des would explain it de like there's probably one or two or three breakthroughs out there. Some people like Demis and I think openAI's and Anthropic think that it's basically taking large language models, scaling them up, and then layering over a couple of breakthroughs.
[00:10:40] Then you have people like Jan Koon who think language models are useless and that is not the path forward. But I would say the. Dominant, consensus is that language models are certainly, essential to the path. And if we can layer on things like recursive self-improvement, that's probably gonna be enough to get us to [00:11:00] AGI and beyond.
[00:11:01] So I'll, what I'm gonna do is I'll break down some of the key elements of that article, and I'm gonna recommend people definitely go read it. You're probably gonna have to read it a couple times. I think there's just a lot in there, and especially if it's a newer topic to you. So, um. It said the, the subhead is our progress toward recursive self-improvement and its implications.
[00:11:21] So then it says, for most of AI's history, humans drove every step in its development cycle. But at Anthropic, we are delegating a growing share of AI development to AI systems themselves, which is speeding up our work taken far enough and given enough compute that trend points to an AI system capable of fully autonomously designing and developing its own successor.
[00:11:45] This is called recursive self-improvement. We are not there yet, and recursive self-improvement is not inevitable. But it could come sooner than most in institutions are prepared for. So just to stop and unpack that real quick there, the, the [00:12:00] models are being used to improve and create future models of themselves, like they're absolutely, all the labs are using the mo, like the current version of the best model to help develop the next model.
[00:12:11] But it's heavy involvement of AI researchers, like they're very, very in the loop. What they're saying with recursive self-improvement is, in essence, the humans largely come out of the loop and the model is in essence. Continually improving and rewriting its own next version. That's the basic premise. So what they then highlight, and you kinda alluded to this, Mike, is how fast these things are improving at this capability.
[00:12:36] Mm-hmm. So the models are improving way faster than they were six months ago, in part because they've gotten so good at coding and this is directly impacting productivity. So Anthropic uses itself as sort of, patient zero in a way. Like they're saying, Hey, this is what we're seeing. So I'll call out a few elements of the article to highlight what they're saying about what's going on [00:13:00] within Anthropic.
[00:13:00] And this is like how you and I might try and do this with SmarterX. We're trying to say like, listen. You can look at all the studies and all the research in the world, and all the thought leaders and all the experts who are claiming different things about jobs and the economy and all these things. What we always try to do is say, but we're living this every day.
[00:13:17] And like, what are we seeing? Mm-hmm. And so I, I always like when companies take this like very inward looking and saying, listen, the data may say something else, but like, here's what we're seeing. So they said the rate at which AI models improve is accelerating. The length of tasks that they can reliably complete on their own has been doubling roughly every four months up from an earlier trend of doubling every seven months.
[00:13:37] Now, that seven months is a key stat that Mike and I have talked about many times over the last 18 months. That comes from Meter, which is an organization that actually studies the reliability of, AI agents to do tasks that would take humans X amount of hours. So they look at a human expert and say, okay, it would take a human expert four hours to do this thing.[00:14:00]
[00:14:00] The AI can complete that task way faster at a 50% reliability or an 80% reliability. So they look at how long a human would take. And then they say, can the AI do it? And how reliable is it for the AI to do it? So what they're saying is their internal data has gone from seven months, which is what meter showed to four months, and it's dropping fast.
[00:14:21] they said in March, 2024, Claude Opus three could complete software tasks that take humans about four minutes to complete a year later, Claude Sonnets 3.7 managed tasks that took about an hour and a half a year after that. Claude Opus 4.6 managed 12 hour tasks. If this tread holds tasks that take a skilled person.
[00:14:48] Days could come into the range this year in 2027. AI systems could be capable of tasks that take a person weeks. Mm. Now keep in mind, and Mike, you and I talk [00:15:00] about this all the time, they're focused on AI research and development of like coding, programming stuff. But what we're always saying is like this translates out to all knowledge work at some point.
[00:15:11] Like they're focused on AI research 'cause that's what all the labs are trying to build first. But whatever happens there, you as a listener, like you need to extract that into your own industry. And it may be a year or two or three behind what they achieve with AI research, but the same premise holds true.
[00:15:28] No matter if you're a lawyer or an HR professional or a CEO or like whatever it is, something that takes you weeks, you are gonna start to be able to do in minutes through simple prompts. So then they say large performance gaps persist when it comes to clawed exercising judgment in choosing goals in both engineering and research.
[00:15:48] So they talk about building AI models takes two major disciplines, engineering and research. And what they're saying right now is the gap is the model doesn't have great judgment and taste yet. That's where humans still [00:16:00] excel. But they don't see that as a scientific limitation, meaning like they think they can get there, it's just gonna take a little bit longer.
[00:16:08] they said in April, 2026, so just two months ago, Claude shipped over 800 fixes. That reduced a class of API errors by a factor of 1000. The engineer overseeing Claude estimated that a human would've taken four years to complete this work. Now again, it's incredible. This is computer stuff, but like. Take that and extract it to other professions on the most open-ended tasks.
[00:16:38] Claude's success rate reached 76% in May, up 50 percentage points in six months. Wow. So again, all of this is just like context that they are seeing rapid acceleration to the point where they have to write this article and say, we don't even have good intuition as to what this means in two years, but this is the data we're seeing.[00:17:00]
[00:17:00] They then said in May, 2025, Claude Opus four averaged a three X speed up over the starting code by April, 2026. Claude Mythos preview, which is the one we've talked a lot about over the last two months that was not released yet, may be released this week, but only like the government and some other select people have access to this.
[00:17:19] Claude Mythos preview was achieving 52 x for calibration. A skilled human researcher would need four to eight hours to reach four x. Hmm. In this part of research workflow where they're optimizing steps within a clearly defined experiment. Claude has gone from super helpful to super human in under a year.
[00:17:41] then it says, an area of human comparative advantage for now is research, taste, and judgment, which I alluded to earlier, including choosing which problems matter, which results to trust, and when an approach is a dead end. So again, think of all of this as I, as I said in the, the, the newsletter a can, the, the canary in the coal [00:18:00] mine, like AI research is the first discipline that.
[00:18:04] AI will be applied to at this superhuman level where it completely redefines the role of an AI researcher, and it's gonna happen in the next like 18 months. Whatever happens here, extract that out to whatever your profession is, and the same premise will then follow as soon as more resources, meaning money gets thrown at your industry to do the same thing.
[00:18:26] We'll talk about this happening in the accounting industry in a couple minutes. So then the article presents three possible futures. They said one, the trend stalls, but today's AI capabilities are widely diffused. The article features many exponential trajectories, but these trajectories may actually turn out to be S-curves where it kind of goes up and then it flattens out.
[00:18:44] We may be approaching the bend in the curve where the returns diminish, and the line straightens then flattens. Even if model capabilities were frozen at today's level, we would expect major changes to occur in the world. We are still early in diffusion of [00:19:00] today's models into the wider economy. And this is real important where a hundred person company can increasingly do the work of a thousand person company because each employee will sit atop a pyramid of agents.
[00:19:12] They include this scenario, they say for completeness, but do not believe it's likely. So this idea that we're just, today's it, like we're just gonna basically freeze the development and this is gonna be the smartest models we ever have. Highly unlikely. But what they are saying is today you could in theory, build a thou a hundred person company that could do the work of a thousand person company.
[00:19:32] And I a hundred percent agree with that. Like I, that's probably actually not that hard in most industries if you know what you're doing. Second, the AI labs continue to see compounding efficiency gains. In this scenario, AI development becomes substantially automated, but humans continue to set research direction and judge results.
[00:19:50] So in this case, organizations that use AI systems will become much more efficient as time goes. And we would expect to see significant productivity multipliers on each person. [00:20:00] 100 person companies could do the work of 10,000 or a hundred thousand person organizations. This would revolutionize knowledge work in government services.
[00:20:08] The evidence they say that they've laid out suggests that this is the most likely scenario that you can build a hundred person companies doing the work of 10,000 thousand. When I think about SmarterX and what we're building, I think we can be a hundred million dollar a year company with 100 or fewer people.
[00:20:24] Hmm. So, what that would equate to is a million in revenue per employee. I actually think two to three is very likely. So you could build a hundred million dollars company that traditionally might have taken four or five, 600 employees, a thousand employees, with dramatically fewer people because those people are gonna be empowered by agents and they're gonna orchestrate these agents.
[00:20:44] So I think that is very realistic, and especially if you're an AI native company that builds from the ground up. It's gonna be a lot harder to compress down from a big company like a 10,000 person company down, but it's gonna be pretty. Easy. Rel is, is kind [00:21:00] of not the right term, but like it's gonna be a lot.
[00:21:02] Um. Simpler to do it from the ground up. And then their third scenario is AI systems themselves become case capable of recursive self-improvement and begin building their successors. If technology trends and advancing capabilities continue and AI systems are able to develop the capabilities inherent, transformative human ingenuity, then it is plausible that AI systems could design and refine themselves in this world.
[00:21:27] The pace of progress in AI development becomes determined entirely by the availability of compute for AI systems. Humans play a substantially diminished role in their development, likely moving most of our efforts toward oversight, validation, and verification of an expanding virtual lab run by AI systems.
[00:21:46] And then they say, we do not have good intuitions for what this world would look like because our economy is currently driven by humans and human built tools. Hmm. Now all that said, Mike, the thing that everyone latched onto, at least in the media, was this [00:22:00] headline you saw everywhere of like Anthropic calls for slowdown in.
[00:22:03] AI development. They're pulling that from the very end of the article. They said if it were possible to effectively slow the development of this technology to give ourselves more time to deal with this immense implications, we think that would likely be a good thing. We believe it would be good for the world to have the option to slow or temporarily pause Frontier AI development to enable societal structures and alignment research to keep up with the advance of the technology.
[00:22:32] Demi Sabas would 100% agree with them on this. He has said that publicly, as has other AI leaders. it's not happening like that. I, I, we can, you can have as many articles about this as you want. You can have Congress get involved, which they're going to, you can have executive orders on this. That is not going to happen because China won't agree to it.
[00:22:53] And unless China agrees to it, and even then, you need the frameworks to make sure that they're [00:23:00] adhering to it. Right. So it's this whole thing, like you would first have to have all the infrastructure in place to monitor. It's like atomic weapons. You can monitor the enrichment of uranium. Like you can monitor if someone is adhering to a policy where we're going to slow down that stuff, AI wouldn't work that way.
[00:23:17] Mm-hmm. And so they presented some ideas and said like, Hey, we're gonna research and try and come up with ways to do this, but a slowdown just isn't. Gonna happen. And even if it did, if, if publicly they said it was happening, no way. Like it, they, they would be doing it in using dark money and like hidden lab, like the, the US government is not going to slow down no matter what they say publicly they're doing.
[00:23:43] And so people have to understand like that. Scenario three, there's. I, I, I mean, I'm even running outta like energy and chip availability are, are literally like the only two things I could think, right? Where the government doesn't like pour trillions into it. Like they will do [00:24:00] whatever it takes to be the ones to build that thing, no matter how dangerous it is.
[00:24:05] So, super important article. Like I, I mean, I know we're spending a lot of time upfront on this one, but I, I really think people need to comprehend the idea of recursive self-improvement and the fact that every major lab is pushing forward with it, even though they know how dangerous it may be.
[00:24:23] Mike Kaput: And just to reiterate a really important point you made, even if you're listening to this as a business leader and saying, look, this is sci-fi, I've got enough on my plate, like.
[00:24:32] Read it and imagine it's 5% true. Yeah. And you have some thinking to do. I would argue so.
[00:24:39] Paul Roetzer: Well, yeah. And even Mike, just like go to the bottom, go to the end of it and just read the three scenarios. Yep. The first scenario, like think about just that one, like a hundred person company doing the work of a thousand person company, and then think if you're a legacy company, like an existing EN enterprise.
[00:24:57] Someone is building the a hundred person version of your a [00:25:00] thousand person company, and how are you gonna compete with them? So like you should be thinking about the other things, but you need to at least be solving for the reality of anyone can come up and build a competing version, an AI native version of your legacy company.
[00:25:16] Now you may have product and distribution and things that are gonna be really hard for them to replicate. And so not every industry is gonna be just disrupted over the next three to five years, per se, but in a lot of industries, there's not very many obstacles to someone just building a smarter, more efficient, more profitable version of your company.
[00:25:33] And that's gonna put leaders in a really tough spot where they're gonna have to make some very unpopular decisions around staffing structures and cost structures, because that, that's coming like that is, I don't even see that as debatable. It's inevitable across every industry.
[00:25:52] Trump's AI Executive Order & Government Stakes in AI Labs
[00:25:52] Mike Kaput: All right, so our next big topic this week somewhat related to just how serious this is all getting is.
[00:25:58] This past week, president [00:26:00] Trump signed an executive order titled, promoting Advanced Artificial Intelligence, innovation and Security. And this executive order says that Frontier AI Labs can now opt into a federal process to have their most powerful models designated as quote, covered frontier models. If a model qualifies, the lab can give the government 30 days of early access before public release so that government agencies can evaluate its cybersecurity capabilities.
[00:26:31] Now, the White House Office of Science and Technology policy made a point to say that it is not conducting oversight of all new models. And the White House has framed this process as applying only to models that represent a meaningful step change in cyber capabilities, not routine version updates. The order itself states that nothing in it creates a mandatory licensing, pre-clearance or permitting requirement for new AI models.
[00:26:59] So that's [00:27:00] one piece of this. However, some of these assurances are getting a little overshadowed by some headlines this week that suggested the government may be open to taking financial stakes in the AI labs themselves. So according to Politico quote, president Donald Trump said he will likely meet with AI companies at the White House next week.
[00:27:19] This was last week to discuss what he called a federal government quote, partnership that would allow the American people to profit in their success. Also, according to Politico quote, the policy concept first reported by an outlet called notice, which is actually interestingly, a media firm founded and headed by Politico's former owner, quote, has been floated by openAI's, which issued a policy paper backing a public wealth fund.
[00:27:45] And broadly discussed by Anthropic and billionaire, Elon Musk, who runs XAI, Senator Bernie Sanders, recently proposed a bill that would create a 50% government ownership in AI companies. Additional sources. Did stress these talks a really early, [00:28:00] the legal mechanism for the government to even do this is unclear.
[00:28:04] It may not happen at all, but Paul, lots of chatter around this. Like how should we be thinking about both the executive order and are you taking seriously this idea that the government might take a financial stake in AI labs?
[00:28:17] Paul Roetzer: I mean, they've already done it with Intel, like the precedent's sort of already set.
[00:28:20] yeah. Yeah, so this is what we've been waiting for is AI taking over Washington like it is. Both sides are constantly posting on X about this. There's bills being proposed, like it's now front and center within politics. It's gonna become more so in the next five months as we, you know, race into midterms here.
[00:28:41] So it is the talk of the town in Washington. The executive order was a little bit of a surprise because it's almost. The exact executive order that was planned like a month ago or whatever it was. Mm-hmm. Yeah. And Trump had called like 24 hours notice all the big AI leaders minus Dario Amide to come to the White [00:29:00] House if, for the signing of the executive order, then date of his Sachs shows up at the last hour and convinces him not to do it.
[00:29:06] the only thing that apparently changed was they went from a 90 day window to a 30 day window. so there's a fact sheet, like if you want to just read the fact the, the order itself isn't very long. I mean, it's probably 1500 words or something. It's nothing crazy. But there's a fact sheet we'll put a link to.
[00:29:23] You can read it. I, I'll just highlight a couple of the pieces of that. So it said, the order directs federal government to establish a voluntary framework and collaboration with AI developers regarding covered frontier models, which would provide the federal government with secure early access. For trusted partners to strengthen cybersecurity, to promote, secure innovation.
[00:29:39] Voluntary is, I just, I would put personally put that in quotations. The way the government does voluntary is like, yeah, you can choose not to participate and then we can choose not to give you government contracts. So it's not, it's like wink, wink, voluntary. they go out of their way in the fact sheet.
[00:29:58] There's a whole [00:30:00] section dedicated to it. To talk about how the administration is the most innovative administration in history and they're, you know, we're not trying to stifle anybody and this is not, you know, true regulation. We are all in on this and we're gonna do everything we can to drive it because there's a lot of people who don't want the federal government involved in regulating this, including people like David Sachs.
[00:30:22] Hmm. So, they can't come across as stifling innovation because they have to be, they have to continue to show strength in the race for AI supremacy against China. And so this is this fine line. They're walking, between it. Now, interestingly, two days later, a bipartisan group of house lawmakers on Thursday unveiled a proposal that would regulate AI that could override some state laws.
[00:30:46] This one is a 269 page framework. So there's a link to Axios, we'll, we'll put in that had a nice little breakdown on this. it would preempt state laws on the development of AI models for three years. Formally established the Center for AI Standards and [00:31:00] Innovation, tasked with making voluntary standards and guidelines and app, appropriate a hundred million per year, from 27 to 29.
[00:31:07] This would require large frontier developers to write and implement plans addressing risks prior to releasing new models, as well as to report critical safety incidents. And the draft legislation would also protect AI whistleblowers, which is gonna be very important. Mm-hmm. Increase fines for AI enabled fraud and try to boost funding for AI literacy education and research.
[00:31:28] It would create a mechanism for the government to study AI's impact on the workforce and add some workforce protections around ai. So again, just bills are gonna start popping up everywhere. Interesting. Side note, what's going on, and especially related to the cybersecurity is Anthropics. Claude Mythos model is apparently being used by the NSA, the National Security Agency, and the four deployed engineers that we've been talking a lot about, they have them embedded with the NSA.
[00:31:54] So while Anthropic is being blacklisted and talking about, you know, basically being the antichrist by the [00:32:00] government, they're embedded with the NSA and they're apparently using it to be. proactive, cyber attacks like preparing to do these things. So who knows what is actually going on with the government and Anthropic, but it's this continuous dance.
[00:32:13] Like, we're gonna be really angry with them over here, and then we're gonna like use them over here to do this other stuff. Now the Bernie Sanders thing I think is interesting. Like, I don't even know where the origin of some of these ideas are coming from. Like, Trump does an interview on Thursday or Friday where he is like, oh yeah, we're gonna like look at taking 50% stakes.
[00:32:28] I'm gonna meet with Sam Altman next week. But just like three days prior, Sanders had an op-ed in the New York Times where he presented this idea. So I don't know if like Trump's latching onto a Bernie Sanders idea or if this came, I'm not, I dunno. But Sanders op-ed, that again, we'll link to, he said. AI will almost certainly be the most transformable technology in the history of the world.
[00:32:51] It will profoundly affect the life of every man, woman, and child in our country. It will bring and is already bringing unimaginable changes to our economy, our democracy, our emotional [00:33:00] wellbeing, our environment, and how we educate and raise our children Further, there is a very real fear that ai, as AI becomes smarter than humans, it could eventually function independently with potentially catastrophic consequences.
[00:33:12] The question then is not whether AI will change the world. It will. The question is, who will own and control that future? Who will benefit from it and who will be hurt? Will AI be used to make life better for working families? Will it enrich our quality of life? Will it help us eliminate poverty, extend life expectations, and solve the climate crisis?
[00:33:30] Or will the future of humanity be determined by a handful of billionaires who have promoted and developed AI with virtually no democratic input, who stand to become even richer and more powerful than they are today? Hmm. That is why I Bernie Sanders, soon to be introducing the American AI Sovereign Wealth Fund Act.
[00:33:48] This legislation would give the public a direct ownership stake in the largest AI companies in our country, how it would create a sovereign wealth fund through a one-time 50% tax, not on the profits of [00:34:00] openAI's, Anthropic, Xa, and other companies, but paid with something far more valuable the stock. So.
[00:34:06] Again, I, I don't, I don't know if that the administration liked that idea and then they said, okay, yeah, let's go do that. I, I don't know. But this is, both sides are continuing to try and figure out their positions here. Are we for against acceleration, are we for, against government ownership? it, yeah, it's gonna be really messy, but again, this is like a, it's this bipartisan weird thing where people just aren't.
[00:34:33] Really sure. 'cause they don't really fully understand the implications, but obviously they're starting to do a lot more research and paying a lot more attention to it.
[00:34:41] Mike Kaput: Yeah. putting my tinfoil hat on, it feels like perhaps both parties got some polling numbers back. Yes. I totally say this is why we're starting to hear this more.
[00:34:52] Yep. Definitely. Interesting, man. Very interesting times to have, quickly gotten to being, [00:35:00] talking so much about soft nationalization so quickly as swallow. I mean,
[00:35:03] Paul Roetzer: I feel like, yeah, it was like six weeks ago or something. We're like, Hey, it's gonna start happening and this would've been inconceivable
[00:35:09] Mike Kaput: six months ago, right?
[00:35:11] Fast, right. It becomes normal and like two
[00:35:12] Paul Roetzer: weeks later it's like, oh yeah, the government's taking 50% stake and openAI's is totally normal. It's like, wait, what? When did that become normal? And I, I don't even, like, I don't even know my, I mean, I haven't really formed my own. Like strong beliefs of what I think should happen.
[00:35:27] Yeah. I'm more of an observer at this point and just listening to all these different perspectives, but there's definitely some things that come, it's like, well, that's a terrible idea. Like that, that would not go well. Republicans might love that idea today, but if the Democrats come back in power now, they're gonna hate the fact that the Democrats have a 50%.
[00:35:45] Like it's, I don't know, like it always swings. Like it might be a great idea when your political party is in power, but then you're gonna hate that idea when the other party's in power. And I think there's just a lot of that going on right now where it's just. [00:36:00] Good ideas today might not be good ideas in 2028 for some party.
[00:36:04] Mike Kaput: Yeah, I've, I've never prayed more for sound policy making because regardless of party, I just don't see a future where a technology, this consequential doesn't get more, rather than less state involvement. So yeah, formulating sound policy would be really important here.
[00:36:22] Paul Roetzer: Yeah. And I, I, I was trying to figure out a way to like weave in the Hunter Biden thing, but I'll just like touch.
[00:36:27] Yeah. So if you were on X at all in the last like. 72 hours, hunter Biden has like taken over X and both political parties just seem to all of a sudden love this guy. It is like literally the greatest communications thing I've, I've ever seen. But any, like, the reason I'll bring it up. one, if you want to be entertained, like go read his tweets, they're amazing.
[00:36:45] But he actually addressed AI and I thought it was great. Like, I thought it was like this really, really don't think so in this long thing about like things most Americans agree on. And he's basically saying like, Hey, we all could just like love each other and be kind to each other and listen to each other and like learn from each other.
[00:36:58] It was the whole premise. [00:37:00] But he had this excerpt where it said AI is like my new best friend that also might be trying to take my job, my ability to think for myself and my humanity in the process. Yo, like, I love you but W2 F, but I still love you. And I was like, that's perfect. Like that, that is basically like AI right now.
[00:37:15] Like we're all afraid of it, but it's pretty awesome. But like chill out a little bit. But yeah, it's, it's great though. Let's go like. You know, prompt some stuff.
[00:37:22] Mike Kaput: Yeah. It'd be wise for AI Labs to take note that that's probably far closer to a reflection of your average person's feelings. Yes. Than the narratives being told one way or another.
[00:37:33] Agreed.
[00:37:34] The Soaring Cost of Intelligence, Part 2
[00:37:34] Mike Kaput: All right, so our third big topic this week is last week we had dug into a bit the soaring cost of AI and all these problems around usage. So we're gonna talk about this in kind of a part two because this past week we had a few more stories come out building on this, that show just how much companies are struggling to figure out a, let's call it AI budgeting.
[00:37:54] So for one, Uber has now started capping how much its employees can spend on AI [00:38:00] coding tools like Cloud Code and Cursor. Right now there's a limit internally of $1,500 per month per tool. And again, keep in mind people are u often using this through the API, so they're not paying like 1500 a month for licenses per se necessarily.
[00:38:14] This is obviously directly related to what we talked about where Uber said it burned through its entire 2026 AI budget in about four months due to heavy usage. Uber's, CEO has now come out and said, you know, the company is being forced to adjust. Uber is now using, he says, the most expensive AI models for early experimentation.
[00:38:34] But plans to shift to cheaper or open source options as those use cases scale up. On a recent earnings call, they also said about 10% of Uber's code is now being written by ai. we had some commentary from Simon Willison, who is a AI expert. We've cited quite a bit on the pod. he did a little math here that just might be interesting to people.
[00:38:56] He estimates that if an engineer, let's say at Uber is actively [00:39:00] using two of these tools, the cap works out to about $36,000 per year, which is roughly 11% of the $330,000 a typical Uber software engineer makes in total pay. He actually called this cap a rational response to overspending and noted that it actually shows how much value these tools deliver since people are clearly willing to run up bills this large.
[00:39:24] At the same time, we've got some reporting that showed the same issue is becoming a problem at Microsoft this past week. Must, Mustafa Soleman, Microsoft's AI chief told Bloomberg that quote, Anthropic is extremely expensive. I think people, many people are urgently looking for alternatives. He said Microsoft.
[00:39:40] Off pay philanthropic a lot of money. And that the company's goal is to reduce and ultimately eliminate that cost. To that end, Microsoft announced seven of its own in-house AI models, including one that it claims matches Andros, Claude Opus 4.6 on a popular coding test at a lower price. We're also hearing about companies trying to [00:40:00] build workarounds to solve this.
[00:40:01] For instance, the AI coding company Factory announced a new feature it calls Factory Router, which automatically picks the cheapest capable model for each task they claim. It keeps top tier performance while cutting costs by 25%. So Paul, we talked about this at length. It sounds like people are trying to explore different options here.
[00:40:21] What did you make of all the developments on the AI usage and spending front?
[00:40:25] Paul Roetzer: Yeah, just continue to be a trending topic. Again, like a lot of where I get my information, a lot of where I'm trying to kind of like analyze trends and figure out the things that are gonna be really relevant for our audiences and the stuff to talk about.
[00:40:36] The show comes from Max. So, I've, I've shared this before, but I have. highly curated lists of AI tech companies and ai. Researchers, entrepreneurs, journalists, things like that. And then I get alerts to probably, it's, it's probably north of 400 accounts now, and that's largely the way I monitor what's going on.
[00:40:56] And then I track those and grab links and put 'em into a sandbox for Mike and I [00:41:00] each week. And so this is one that's just like all over my feed right now of the accounts that I get alerts for. and I get that that's usually months ahead of like what the mainstream media might pick up on his stories.
[00:41:12] But I think x is actually a really good signal for what's gonna be really important. So Business Insider had an article where they talked about Sam Altman saying that the top lead token leader at openAI's is about a hundred billion tokens a month. but in some other examples, openAI's employees seem to spend even more than the a hundred billion.
[00:41:31] Peter Steinberger screenshot showed 603 billion tokens spent in 30 days. And the New York Times reported that one openAI's employees spent 210 billion tokens in a week. So now, like if you're not familiar with tokens, in essence, they're just like running agents on loops, like all day long. They just have agents going and it's like infinite use of the API, or you know, in this case, they're sitting within openAI's.
[00:41:57] They have this infinite access to burn as many tokens as they [00:42:00] want, and they're just like running all of these things simultaneously. The only way you could possibly do that, so. But then Sam, I thought was a very interesting and very vulnerable thing to admit. Hmm. He said the cost question came up quite suddenly at the beginning of 2026.
[00:42:17] So just six months ago. The issue never came up. There's a quote from him. The issue never came up. Then he said, quote, people were totally happy with the amount they were spending. Now ar costs are, quote, a huge issue. So out of nowhere, companies are now talking to openAI's all the time about the spend of these tokens.
[00:42:39] Sierra, which we've talked about numerous times. Brett Taylor, who is the chairman of Opening Eyes Board runs Sierra. they had an article come out called Outcome Maxing. So they've been. Very outspoken about the need for outcome-based pricing rather than token based pricing or metered pricing. And so they said that traditional enterprise [00:43:00] software is about productivity tools for teams.
[00:43:02] Given AI can do the work, customers should pay for outcomes, not usage. They said the SAS apocalypse that we touched on beginning of the year as it has been dubbed is the market internalizing that the future is not productivity tools for teams, but AI agents that deliver outcomes. Outcome-based pricing is more complex than seat based or consumption Pricing.
[00:43:24] Operationally, contractually, accounting wise, people telling you it's simple are selling something. Hmm. If only works where it only works, where the software is highly autonomous and highly attributable. If the work isn't autonomous, you're really just selling a tool and the tools priced by usage or by seat.
[00:43:42] If the outcome can't be cleanly attributed to the software, you'll get into an endless debate. So then they linked to an article I had not read yet that I thought was really good, so I'm not gonna spend a lot of time on this, but if you're interested in this, be based on your company's trying to figure out how to pay for AI technology, or you're a [00:44:00] software company that's trying to figure out how to charge for AI stuff, or you're a service company that's sort of in the middle.
[00:44:05] There's this really good article from earlier here. It's called Charging for Intelligence, how to Price AI Software. MCAP was the, is the URL, but again, we'll put the link in. So they talked about founders that are specifically writing for, like software companies that are trying to charge for this.
[00:44:20] Our wrestling with the fundamental question about value capture in the AI era, including should I use seat usage or outcome-based pricing, or a hybrid? How do I price when my product replaces human labor? How should falling large language model costs factor into my pricing strategy? So today's model, let's say 4.8 or whatever, Claude, 4.8 or 4.7, whatever number we're on this week, I think we're about to get 4.8 or five.
[00:44:44] Um. That it'll cost 10 x less to deliver in six to 12 months. So like how do we factor that in that the models just keep getting cheaper, but then the frontier model is gonna keep getting more expensive, like the best model is gonna get more expensive. So they said, to help answer these questions, Jake, Jake Saer, [00:45:00] general partner at Emergence Capital, which is probably mcap, hosted a lively discussion at the emergence offices with two of the best minds in pricing.
[00:45:07] Moh Rama Ramadan and Josh Bloom, both general partners at 49. Elms Ventures and former managing partners at Simon Kutcher, were they advised 500 plus companies on pricing. MOH also wrote Monetizing Innovation, the most widely read book on pricing strategy. And I thought it was really interesting.
[00:45:27] They broke pricing into two categories, autonomy. So how independently does the AI operate? Does it augment human or replace them entirely? And then attribution. How clearly can the AI's actions be linked to measurable outcomes? So it's not as simple as like, Hey, let's just do outcome pricing. You know, where we're monitoring outcomes or usage-based pricing, or seat pricing or human replacement pricing.
[00:45:52] You actually have to look at what is the AI enabling? Is it when it does the thing, we can directly attribute it to a [00:46:00] value? It's an A equals B thing. Or is it like not so clear exactly what it does? And based on the attribution level, it actually starts to dictate whether a seat based pricing, a uses brace, pricing or outcome is preferable.
[00:46:13] But obviously. The human labor replacement cost is really easy to understand as is outcome if you can do it. So then that leads to like, okay, what do we do about this? You can find cheaper models like you, you can, like as a team, like say, Hey, let's just use the cheapest model. You can do model routing, which is the thing you're gonna hear a ton about.
[00:46:30] That's what you were referring to Mike earlier. Mm-hmm. Where it's just like the AI picks the best model for the prompt or the task that you're doing. The, the labs themselves can just make the algorithms more efficient. there can be more transparency from the labs as to how many tokens are being used, which right now we don't know.
[00:46:49] you can teach your people to be better at prompting and better at model selection. So that's what you as a leader can do. And then the thing I was thinking about as you were talking, Micah, doing this intro is like, [00:47:00] you almost need to do this token analysis by roles and workflows. Where Yep. When I'm gonna say, okay, we're gonna give the sales team access to Claude.
[00:47:07] Here's what the standard sales person would do in a day. Here's the 25 tasks they're gonna use Claude for these 20 Don't need opus. Like they just need sonnet 4.6 and like, right, right. That's all they use for it. And so just that. Would actually limit the number of tokens that you're using as a company if you did a token audit upfront and like guided your team what to use or even better restricted their use to only those things.
[00:47:36] So I just think there's a lot to go on. And then the last thing I'll say, Mike, is, Brian Armstrong, who's the, co-founder and CO of Coinbase, had a tweet that was getting a lot of attention and he said that, you know, where this kind of all goes at Coinbase, they're working hard on routing prompts to cheaper models where appropriate, and in some cases have been able to keep costs roughly flat while token usage continues to grow [00:48:00] exponentially.
[00:48:00] So he had some other thoughts in that tweet, but the basic premise he said is like, we're just trying to route it to like cheaper models when they don't need the most advanced, model to be used. So. It's early, but a lot of companies are gonna be racing to try and solve for this, and there's gonna be a lot of pressure on these labs to figure it out.
[00:48:17] And it's gonna be one of the big questions when they, IPO is like, Hey, aren't people just gonna start using open source models to do 80% of the work because your models are really expensive.
[00:48:27] Mike Kaput: Mm-hmm. You know, Paul, we're actually working pretty closely trying to solve some of these questions for ourselves at SmarterX in the next couple weeks.
[00:48:36] I'm curious, like, do you, I, I have no idea what the way forward is, but I did think it was interesting with Simon Willison saying like, Hey, this is roughly like 10% of the engineer's salary, it's being capped at. Do you, is there any merit to like. A percentage of like token spend per employee based on like salary?
[00:48:56] Or does that even Yeah. On a long enough timeline, it might not matter as [00:49:00] things get cheaper. How do you think about that?
[00:49:01] Paul Roetzer: So, I haven't sat down and done this analysis yet, but I'm gonna kind of think, I'm gonna do now what I would do if I was sitting by myself, the way I would probably start to think about this, Mike, is I would look at, say, the next 12 months of the growth of SmarterX.
[00:49:17] Mm-hmm. And I would say, okay, in a, in a traditional environment. Based on where we're going and the growth I would need to hire for these five roles. So I would hire a event marketing manager. I would hire this person. I would hire this person. I would then do an analysis and I would probably pull you in, pull a couple other people in, and I would say, let's sit down and really think about these roles.
[00:49:36] Do we need this role as a full-time employee? Like what that this person would do? Mm-hmm. Is the human still gonna do? And so I would look at it and say, okay, what was gonna be, let's just say arbitrarily a hundred thousand per person. So I have, I have a half a million budgeted for five hires. If I look at it and say, I think we only need two, like we're only gonna hire two out of that five, that leaves us 300,000 [00:50:00] that we would've spent on salaries.
[00:50:02] Mm-hmm. That we can now say, let's take 20% of that and let's put that into token budgets for the marketing team to go and do the work of what would've been five people. And we're gonna basically do it with two plus. AI agents that are gonna take up 200,000. Now I'm just making up numbers here, making up a scenario.
[00:50:19] Right? But for me, that's how I would think about it. It's like, what would we be spending and is it still gonna put us on that path to get to like a million in revenue per employee? And so I would look more at revenue per employee. I would look more at margins we're trying to achieve. I would look at what the cost of payroll would've been, and then I would back into what is my, I don't think just arbitrarily saying, oh, we're paying someone $150,000.
[00:50:44] Their token budget should be 10%. What does that mean? Like they may burn. 'cause if they're not educated how to use that token budget, they're gonna burn through it and it's gonna be useless. Versus someone who knows how to properly prompt or knows how to pick the right model, they're gonna get way more out of that 10% token budget.
[00:50:59] So [00:51:00] the percentage of budget means nothing if the person doesn't know how to use the tokens wisely. So I think it's a combination of looking at. What would we have spent on payroll? And then how do we provide the I literacy necessary for the person to efficiently use the tokens they're gonna be provided.
[00:51:15] And then you need governance structures to actually ensure that they're using tokens in the right way, which might be new third party software, right? It might be internal systems, like, I don't know. But that's how I would start to think about it. And I would say, you know, if I had like a week to just work on this and like tap into some of my friends in the industry, like smarter than me, when it comes to a lot of this stuff, I, I think you could probably build like a beta structure of how to start thinking about this pretty quickly.
[00:51:41] Mike Kaput: Yeah, yeah, yeah. I think 'cause yeah, to your point, if you can categorize certain tasks, like we all, we can at least triangulate on at least a handful of things we do all the time that we know don't need the level of tokens that they need. Right. So you can at least crack part of the problem, I
[00:51:57] Paul Roetzer: think. Yeah.
[00:51:57] And I actually think like I'll, again, I'm just gonna. [00:52:00] Give something away that I probably shouldn't say out. Wow. But what, you know, who cares? I'm not gonna have time to do this myself. Yeah. So Jobs GPT that we built, Mike, as you were doing the intro here, I was like, oh shit, I could just totally go into jobs GPT and I could train it to do token prediction.
[00:52:15] Mm-hmm. So like Jobs, GPT takes any job. So let's say Mike's job is a chief content officer, and I give it some context, here's what Mike does. Jobs, GPT will analyze how AI could be infused into Mike's jobs. We could probably go into the custom GPT and train it to then also actually predict what model would be best to use for the different tasks.
[00:52:36] Hmm. And then what the token budget should be related to those based on how often Mike does the task. So in my mind, I think there's actually probably a way to build an app or AGI PT in probably an afternoon, right? That could at least get you in the direction of starting to think, okay, first we analyze what tasks and workflows AI can help with.
[00:52:55] Now we need to go to the next level and say, but what model is best to use? And what is the [00:53:00] estimated token cost of that based on how often Mike does the thing? And I think there's probably something there. And honestly, if I was running a consulting firm, I would be. Probably racing to like build some services around this because the demand for this is gonna be through the roof.
[00:53:14] Mike Kaput: Yeah, for sure. And just one final note, your comment about AI literacy was also something I was thinking about because I was looking at this and as we're trying to solve for these things, it's like not only just token efficiency and AI literacy around that, but also like give two people the same amount of tokens.
[00:53:31] They're gonna use them very differently. So like how do you even figure that out? Because I could. Very much see the argument that there are some people that don't even know which problems they should be working on using the token. Totally. So it's like this whole mess to figure your out. Well,
[00:53:47] Paul Roetzer: yeah. And again, like the AI literacy is so fundamental because like I, I just pulled up Claude right now.
[00:53:51] Yeah. My options are Opus 4.8, high sonnet 4.6, or Haiku 4.5. And then [00:54:00] I also have effort, low, medi high, extra max. I can toggle on thinking, can think for more complex tasks. And then I have a dropdown for more models, which is Opus 4.7, 4.6 in Opus three. Mm. The average knowledge worker has zero idea what the difference is between those things.
[00:54:20] Maybe they understand that thinking means it's gonna like work harder. Right? But if they don't know what a reasoning model is, they don't know what chain of thought is relation to models. They have no idea how to use claw effectively. So if 4.8 is the one that's picked. Like outta the box and high, they're gonna burn tokens to write an email that Haiku 4.5 could have done just as well.
[00:54:45] And now all of a sudden. And so that's, that is just the most very basic thing. And so just simple AI literacy. Yeah. Would would like dramatically change the token burn within companies?
[00:54:56] Mike Kaput: Not to mention then we'll move on, but this [00:55:00] becomes several orders of magnitude more complicated when you factor in something AGI agentic.
[00:55:05] Like a Claude code. Like for instance, cowork. If I ask coworker Claude code to go, let's say I want to do a research project. That's four big things I wanna research. It. I don't have control out of the gate unless I specify how it does that. It will say, Hey, you know, I'm gonna fire up four agents in parallel.
[00:55:24] Well, guess what? That's like having four windows open on the same account that all use a hundred thousand tokens each. So it's like, I don't, I have a better intuitive sense of that. Now, having used the tool, I don't know how to predict that. Mm-hmm. I, you would have to say, don't ever spin up agents or something.
[00:55:40] I perhaps I, it's right. I have no idea. The agent thing is outta control. The usage is so vast. I don't know how you're supposed to regulate, even if you nail down the model piece of it.
[00:55:51] Paul Roetzer: And I don't even think Mike like that. We as administrators in that account can control that for our. Users. Like I can't go in and say, okay, Mike isn't allowed to use [00:56:00] these models.
[00:56:00] Or like, right, right.
[00:56:02] Mike Kaput: I don't think you can. Yeah. Or like specify for it to work in like low. And I'm sure they try to optimize a bit on the back end of like the token efficiency. But again, it's like you can't stop it unless you explicitly say so from like, don't spin up an agent's warmth. Don't go do this the way you think it should be done is basically what you're saying.
[00:56:21] Right. Yeah. So. Interesting stuff. I can sit around. I'm sure we're gonna have sit Yeah. All day. Yeah. We'll have more to share at some point as we try to figure this out. Yep. Alright, before we dive into Rapid fire, one more announcement this week. This week's episode is brought to you by our upcoming AI for B2B Marketers Summit.
[00:56:38] This is a virtual event happening June 25th. It is presented by our friends at Intercept. If you're a B2B marketer trying to actually put AI to work, this is built for you. It is a half day of practical, real world content. You can apply right away. Obviously, AI is quickly becoming standard in B2B marketing, so this event is designed to help you stay ahead of [00:57:00] it and.
[00:57:01] During the summit, you'll learn things like how to create better customer experiences with ai, accelerate your content with ai, align sales and marketing with ai, build the right team, structures, roles and workflows for AI adoption, and navigate some of the harder parts of AI like ethics, transparency, and data security.
[00:57:18] So this event actually, thanks to our sponsors, has a free registration option if you're interested. So head on over to B2B summit.ai, B2B summit.ai to register and save your spot for June 25th.
[00:57:34] Apple WWDC
[00:57:34] Mike Kaput: Alright, so diving into Rapid fire this week, Paul. So first up, this past week we are getting kind of some news front running Apple's Worldwide Developers Conference.
[00:57:44] WW DC Bloomberg published some new reporting about how Apple kind of behind the scenes finally got serious about AI and kind of what to expect once the event hits, when they're expected to unveil kind of their big AI [00:58:00] reset. So Bloomberg is just some interesting context we've learned. They reported that back in early 2025, Apple's top executives with the notable exception of CEO Tim Cook held a secret meeting to confront how far the company had fallen when it comes to ai.
[00:58:15] So at the time, as we've talked about, apple Intelligence has kind of, been a bit lukewarm. The promised overhaul of Siri was being delayed again. At the time, several products, including a tabletop robot and AI glasses were stuck because the underlying AI was not ready. Now according to Bloomberg, that meeting led to a push for fresh AI leadership Cook himself became far more directly involved in the company's AI roadmap.
[00:58:41] And so at ww DC, this is where Apple is hoping for a turnaround. So at the event Apple is making its AI turnaround a centerpiece, the showcase will be on a completely rebuilt Siri that is at least hopefully fully AI forward. this includes something we have previously reported on, which is Apple is actually set to [00:59:00] pay Google around a billion dollars a year for a custom version of its Gemini model to Power series cloud features, along with Apple's own on device models.
[00:59:09] iOS 27 is also expected to let people pick third party models like Gemini Cloud and chat GPT through a new system-wide search or ask in interface. So Paul, after gears, at this point of Apple kind of falling behind on ai, like what are you hoping to see at WD?
[00:59:28] Paul Roetzer: Just something more functional than we have.
[00:59:30] Like I, I, it's not hard to improve on Siri. It's a relatively useless technology for the most part. yeah. So I, I, I don't know. I've, this reporting was somewhat shocking to me. Like, I mean, we've been talking about how far Apple was behind since, you know, 2023. Mm-hmm. Like when chat really started exploding and it's like, what, where's Apple in all this?
[00:59:53] And the fact that the, like the early 2025 is when this like secret meeting happened. Like what, [01:00:00] where were they for the two years? Right. so that was wild to me to read that. And so obviously someone in the room told the, the writer everything. Like, it was, it was pretty crazy. so I think that we're going to get a much better version of Siri.
[01:00:17] I think it's going to be good. I, I don't think it's gonna be like life changing, replacing your use of chat GPT or Claude or anything like that. But the idea of an always on assistant that's connected to the tools and apps on your phone and able to. Do things and hopefully go through a chain of thought and not just, you know, give you the weather or the wrong song you asked for when you, you know, you're driving and you ask Apple Music for something.
[01:00:39] if it's, if they just solve voice, actually working, if they solve, um. Me like talking what the text message I wanna do and actually get the words correct. Like I, it would not take much for me to like really appreciate Siri more than I currently do, right? So I don't expect it to [01:01:00] be transformational yet, but I expect it to be much better and give an Apple's distribution, like that's gonna be enough for now.
[01:01:09] And then I think they can kind of grow over time and improve on it. But it does seem like they finally snapped out of it and realized how bad Siri really was and are doing something about it. So I'm very hopeful. Now, you will have likely heard what it is by the time you listen to this 'cause on June 8th Monday is when they're supposed to like announce a lot of this.
[01:01:29] So we'll definitely cover more, on the next episode and talk more in detail about exactly what they announced.
[01:01:36] OpenAI Is Merging Codex and ChatGPT
[01:01:36] Mike Kaput: Alright, next up, OpenAI held a livestream event called Intelligence at Work. This past week they talked about how they're kind of combining their Codex coding tool with chat GPT. So Codex has started out as a tool that helps software developers write code.
[01:01:50] OpenAI now says it will put Codex inside the chat GPT app everywhere on desktop, mobile and the browser. And that's happening over the next few weeks. [01:02:00] codex is evolving pretty quick instead of being just a coding assistant. openAI's is kind of shopping it as now and turning it into a general work agent, meaning software that can carry out multi-step tasks on its own for any kind of job.
[01:02:13] The company said Codex can now handle work like data analysis, sales prep, marketing assets, product design, not just programming. So. To make all this possible, openAI's launched actually six role specific plugins, which are add-ons that connect codex to the tools. Different teams already use these cover data analytics, creative production sales, product design, public equity investing, and investment banking.
[01:02:36] Together, those plugins bundle dozens of popular business apps and connectors and including to Snowflake, Figma, and Salesforce. They have over a hundred built-in skills. At the same time, OpenAI also announced a feature called Sites. So with sites you can describe what you want in plain language, and Codex will build an interactive internal tool like a dashboard, project tracker, or a workflow app, and [01:03:00] then host it at a shareable link with Workplace Sign In.
[01:03:03] Built in, this is rolling out first to business and enterprise customers. So Paul, I'm curious, what do you make of openAI's is basically sounds like turning Codex into, some type of version of like, Claude Cowork sites is really interesting. This, this could get very compelling very quickly.
[01:03:20] Paul Roetzer: Yeah. And they, they were basically saying like they don't see the chat interface as the future and they're just trying to build this super app.
[01:03:25] But I, I mean, you know, you and I, and I'm sure a lot of our listeners have experienced the power of Claude Code and Codex and what it like being able to build interactive tools, even if you're just doing HTML versions of things or working with a tool like Lovable or Rept. Yeah. When you give non coders the ability to just spin up apps and websites with words, I, I, it's so transformational to the way work is done.
[01:03:53] Um. The implications are just so vast. Like it's hard to really wrap your head around what this looks like. [01:04:00] And unless you've built in it, unless you've done it yourself where you're like, oh, I got this idea for an app or a dashboard, and you talk to Claude and, and like five minutes later you have a functioning HTML file or an actual working app that does the thing that six months ago you had no ability to build yourself 'cause you're not a coder, it, you just look at everything differently.
[01:04:20] And so I think that over the next, you know, few weeks, we're gonna really start to see a lot of this. I would imagine the new model will get from openAI's probably this week. Like it's just gonna start happening so fast. And going back to our first conversation, like most businesses, most leaders just are not prepared for the accelerated rate of how quick the model is gonna improve and how quick the capabilities are gonna evolve.
[01:04:44] That changes the way you think about the structure of your team and how you hire and train people. It's just, I dunno, it's just coming so quick.
[01:04:53] Mike Kaput: You know, one question that came up for me as I was reading through this, I have no idea what the answer is, is basically [01:05:00] just if we assume that agents will be baked into chat, GPT, which is what this sounds like.
[01:05:05] Yeah. Like what happens to how we think about AI policies? Because we've kind of been thinking of this almost separately. Like, Hey, you're probably using chat. You might not have agents enabled or turned on, but like what happens when chat JPD is just. Agent gets it,
[01:05:21] Paul Roetzer: or like it starts to see your prompt and say, Hey, you know, this is a great prompt.
[01:05:24] This actually would make a great agent. I can spin up an agent for you to do this, or I can create a skill for this, or I can build you an app that does this and mm-hmm. So the prompt, you gave it the thing you thought you needed, it actually thinks in first principles and says, no, no, no. You don't need what you think you need.
[01:05:38] Let me actually build you something that does that thing. Right? Yeah. Like how do you manage that? How do you manage people being empowered to do that on the fly and maybe somebody inadvertently obsolete their coworker. It's like, Hey. Right, right. I just prompt them with Claude and it sort of recommended I build an agent that does the thing you do.
[01:05:56] Sorry. Like I know, like that's real, like [01:06:00] Yeah. It's having a proactive agent builder that just analyzes your prompt. Like, that's totally doable today. Yeah. I don't know, man. It's
[01:06:07] Mike Kaput: interesting times.
[01:06:08] Paul Roetzer: Every time we talk, every time we do this episode, every week, I'm just like, as a leader of a company or more and more like, oh man, I'm not even prepared for the future.
[01:06:17] Like, this is crazy. Right, right.
[01:06:19] OpenAI Distances Itself from Brockman's Super PAC
[01:06:19] Mike Kaput: Alright, so some more openAI's News. They did something a little unusual this past week, so they put out a public statement to distance themselves from a super PAC backed by one of Open AI's own co-founders. So, as a reminder, a Super PAC is a political fundraising group that can raise and spend unlimited money to influence elections and policy in the US as long as it doesn't coordinate directly with a candidate.
[01:06:42] So the PAC we're talking about is called Leading the Future. It's a pro AI super PAC that we've talked about in the past. It pushes for policies friendly to the AI industry. It opposes certain state level AI regulations. It is raised well over a hundred million dollars from major tech players, [01:07:00] including notably openAI's president and co-founder Greg Brockman and his wife.
[01:07:04] They're some of the biggest backers of this group. And in this statement, OpenAI said basically there have been questions around leading the future, which has re received support from Brockman and his wife. They added that any engagement with the organization has been in a personal capacity, not on behalf of the company.
[01:07:20] They said they do not direct the activities of the group or have visibility into their operations and said that no outside political group speaks for openAI's. So, Paul, why do you think they felt the need to put this statement out? Now, this is a group that's been around for a little bit, at least
[01:07:37] Paul Roetzer: I, I mean, I assume they're just getting blow back from investors and preparation for the IPO, you know, it's gonna be more scrutiny on this sort of thing.
[01:07:45] Um. That's a tough one. I You can say whatever the hell you want. The guy's, the president of the company or whatever his title is, like he's top three, top five in the company. Yeah. It's kind of hard to separate his personal right. [01:08:00] Beliefs and investments from the company because he's making decisions that influence the direction of the company and like his personal money's different, I guess, but like.
[01:08:12] You don't like, forget that you're also the biggest backer of that pack. Right. And then like, go about your job in a unbiased way. So it's a, it's a really tricky one for any company. You know, this is just opening eyes specifically, but, you know, other leaders of other companies give money to political parties and super pacs and it's hard to distance it.
[01:08:32] When you get that involved in politics, you're, it's who you are. Like it's what you believe and you're making it very public. And so it's hard to just say, yeah, but I'm unbiased when I do my job every day. It's like, no, you're not. It's human nature. You're not right. Like nobody is. So, I don't know. Yeah, it's, it's a tough PR.
[01:08:49] Campaign to win.
[01:08:50] Mike Kaput: Yeah. And they have plenty of PR problems as it is, I suppose. Yeah. All right.
[01:08:55] AI Roll-Up Targets the Accounting Industry
[01:08:55] Mike Kaput: So next step. This past week, Forbes reported on a very interesting example of investors using AI to roll up a single traditional industry, in this case, accounting. So a roll up is when a company buys up a bunch of small, independent businesses in the same field and combines them into a larger operation.
[01:09:14] So the company behind all this example that we're about to discuss is Thrive Holdings. This is an AI focused holding company spun out of a famous venture firm called Thrive Capital Run by Joshua Kushner. Thrive says it plans to commit $1 billion to acquire local brick and mortar accounting practices across.
[01:09:34] The US and they're basically going to apply AI to these businesses all as one business and make it much more productive and performance driven using that technology. They've actually had about two years of acquisitions through their accounting platform that they have, which is now called current. the strategies basically buy these practices and equip them with ai.
[01:09:54] current, that AI platform or accounting platform says it's openAI's power [01:10:00] tax AI tool process, 7,000 tax returns this past season and cut tax prep time by about a third with up to 98% accuracy. So Paul, I'm curious, this is not only interesting for accounting, but seems like an obvious playbook for certain industries, perhaps most industries.
[01:10:20] Paul Roetzer: Yeah. This is the thing we've been, you know, talking a lot about for years, is if you're private equity, this is the play. You just pick an industry and you go in and you roll up firms and you unlock margins and. Just apply AI like this. This is. Again, it's like an inevitable thing. This is what's gonna happen across industries.
[01:10:38] Now the account industry is ripe for it because they actually have a talent shortage. So the articles as, as, as a number of CPA candidates declined 30% between 2016 and 2021. the need for AI became greater. Like it's, it's a demanding industry. They work long hours, especially during tax season.
[01:10:57] There's a lot of, [01:11:00] monotonous work, like data-driven work where you're just doing these repetitive processes. And so if you can infuse AI that can all of a sudden start taking some of that burden off, one, it, it helps with this gap in talent. But two, it actually accelerates the automation of the industry completely, or to a large degree.
[01:11:18] So I think it's a good article for people to read. If you want to, again, you may not be in the accounting industry, but look at this as a example of what's gonna happen because the tax AI that they're talking about was just specifically trained, you know, for the tax returns. And I think it was like these, what.
[01:11:34] I forget the exact thing that they did it on, but it was a very like niche though. The 10, 40 and 10 41 tax returns. Yeah. So like a very tedious area, but they proved it could work there. And now it's like, okay, now we're gonna expand it out and we're gonna go to like, all, functions in accounting. So this is what'll happen, like you're just gonna have PE firms or venture capital firms that just, or, or, I don't know, venture studios who just like, they just build AI native [01:12:00] versions of something or they just go roll up existing ones and then they infuse AI into 'em and they just make the industry more efficient, more productive, and hopefully more innovative.
[01:12:09] But, um. Yeah, it's kind of like the AI researcher thing. Yeah. Like you gotta watch for these early instances of it and then try and extrapolate, okay, what's that gonna mean when they take this out further and start going into these other areas?
[01:12:21] Mike Kaput: Mm-hmm.
[01:12:23] AI in Higher Education
[01:12:23] Mike Kaput: All right. Next up, some, AI and higher education updates.
[01:12:26] So, a few different stories we're kind of tracking here. So there's a piece in New York Times magazine that took a look at how AI is reshaping American universities. So they actually did a deep dive into California's, California State University because, this is a university system which we had reported on a while ago that spent nearly $17 million on a deal with openAI's to provide half a million licenses of chat GPT to students, faculty, and administrators at the time.
[01:12:52] This was described as the largest single institution deployment of chat GPT in the world. But this article now reports that [01:13:00] chaos and conflict have followed. So the faculty union organized against the deal. Professors delivered a petition urging CSU to cancel the contract. They argue that chat GPT was not designed, trained, or optimized for ED education.
[01:13:13] they a lot of the frustrations around the education systems priorities, the system leaned into AI while facing a serious budget crisis and cuts elsewhere. A survey of more than 94,000 students and employees found that 52% of faculty said AI was having a negative effect on their teaching. Even so, this system recently renewed its openAI's contract.
[01:13:34] Meanwhile, the University of Chicago President Paul Vetos, told students and staff the university is partnering with Anthropic to provide Claude access. Faculty and staff get access starting in July, and all students will have access before the fall term. Finally, some new research, from Stanford Law School had 16 law professors evaluate nearly 3000 anonymized head-to-head comparisons of answers to contract law [01:14:00] questions.
[01:14:00] Some were written by ai, some were written by other professors. In that blind evaluation, the professors preferred the AI answers 75% of the time. They also flagged AI responses as what they would call pedagogically harmful only 3.5% of the time compared to 12% for the answers written by fellow professors.
[01:14:21] So Paul, a couple of, not exactly related stories, but kinda showing that higher ed is facing some challenges and opportunities, just like anyone else trying to integrate ai, it sounds like
[01:14:31] Paul Roetzer: Yeah. That, that, the Stanford one's fascinating. I I, yeah. When I first saw those data, I was like, geez, oh man. Like, that's right,
[01:14:37] Mike Kaput: right.
[01:14:37] That's hard
[01:14:38] Paul Roetzer: to figure out the implications of that, but I think more and more that's gonna be the case. We're just like. AI answers if they're, if you can't directly tell it's ai. 'cause I think there's a bias by humans to say, oh, I don't like the AI version of this. It's like, but if you don't know, you truly don't like, I think more and more there's AI answers are gonna be preferred and a lot of disciplines, not just law.
[01:14:57] Mike Kaput: Yeah.
[01:14:57] Paul Roetzer: yeah. I would imagine the universities are [01:15:00] struggling with the adoption, just like any enterprise would be. If you just go buy the tools and infuse 'em in, like that's not gonna solve everything. You gotta think about things in cohorts. There's gonna be people who hate ai. There's gonna be people who, you know, are early adopters and love it and can't live without it.
[01:15:14] And there's gonna be people in the middle who just haven't had time to learn how to do it. You're gonna get pushback from the crowd that doesn't like it. Or if you don't deploy it with a change management strategy, then yep, it's just gonna fail. So, you know, it's just, education's a microcosm of what's going on in enterprises too.
[01:15:30] It's not that different. And I think we're just starting to finally get these case studies in a way where we're now one two years into universities that have actually gone all in and tried to do this. And you're gonna start getting these hard lessons learned by the people who are the early adopters, just like we are with enterprises.
[01:15:47] Mike Kaput: Yeah. I imagine there's also just a, a magnified version of the sentiment and PR problem within universities too, is it's kind of ground zero for all the disruption happening.
[01:15:57] Paul Roetzer: Well, yeah, and you know, it's a hotbed for obviously, [01:16:00] activism. Like, you know, you're gonna have students who are gonna get involved and, and have feelings about the environmental impact, economic impact, like it, they're, they're at the center of this.
[01:16:09] And so you have to deal with also with that within an environment. yeah, it's really complex. Like to do this within a university is very hard. and I think we're just, like I said, we're just learning the lessons now and it's just gonna take a while to figure out how to do this the right way and to address the, the real concerns people have about the technology.
[01:16:29] AI Use Case Spotlight
[01:16:29] Mike Kaput: Yeah. Next up we have our AI use case spotlight. So every week we give you a quick look under the hood at the real AI use cases we are exploring in our own work at SmarterX. I'm gonna share a quick one, Paul, and if you've got anything to share, let me know. for mine this week, I want to kind of similar to last week also point to like a way more of a way I'm starting to work with AI rather than one specific use case.
[01:16:52] And this kind of was inspired by a post from Allie k Miller, who you should definitely follow. She's really sharp, and really good [01:17:00] with, actually putting AI to work. So we'll include a link to the post I'm about to talk about. But basically she posted something last week that I have started to experiment, building my own work around, and she basically said, look.
[01:17:13] The way most of us are taught to find AI opportunities in your work is already behind. the old way is you look at your tasks and you speed up the slow ones using ai, which I don't disagree with. I think it's a huge part of what we teach in a necessary first step. But she also argued that you can aim way higher, way faster.
[01:17:31] because speeding up your work assumes the work itself should stay the same and just be faster, but in many cases it shouldn't. So it, she says, instead of starting just at looking to do things faster, start with three things, your goals, context, and an interview. So basically first, tell the AI what you're actually trying to achieve, not what you produce or how you do it.
[01:17:52] Then give it context. We know this is super important from a prompting perspective. You connect it to whatever context you are allowed to give it [01:18:00] your docs, email, data, et cetera. And then finally just have it interview you. One question at a time about the goal of the work itself, like who hands what to who, what you are doing on a repeating basis, who receives it, what's supposed to happen to it, and basically she just advocates working backward.
[01:18:18] From first principles, which is for everything you're doing, does this thing even need to exist? What outcome was it meant to create? And she actually provides a prompt in the post itself. I've used a great effect to kind of almost re-architect some of the things I'm doing. It'll basically challenge you, like, does this have to exist in its current format?
[01:18:35] Here's a smarter, better way to do it. So I found this really helpful, Paul. Just challenging myself, not just to get more productive, but to innovate more in what I'm trying to do still very early. I'm not saying this is the only approach people should take. All the other, you know, standard advice and workflow mapping and things that we teach, I think is deeply important and a really good way to get started.
[01:18:56] But this was just really good food for thought and I'd highly recommend people take a look. [01:19:00]
[01:19:00] Paul Roetzer: Yeah, I, I, so one of the kind of tricks I've used for a while on strategic work, which is a lot of what I use AI for, it's like high level strategy or building like MVPs of things just to like get apps outta my head and you know, build interactive charts, things like that is I will just.
[01:19:16] To tell it, Hey, here's what I wanna do. Let's do this one step or one question at a time. Like, what do you need to know from me? But like, let's go through one at a time. And it is, it's very therapeutic. It's very helpful. As like, you know, someone who's trying to think about something to just have it do, because AI wants to just solve stuff for you, wants to just like give you the plan or give you, here's the 20 questions.
[01:19:40] Like that doesn't do anything. Like, let's just go through this a step at a time and then you may deviate and say, Hey listen, actually let's go in this other direction now, based on what we just talked about. So for me that is like the vast majority of the way I use AI is I just have these like conversations about it and I try and like keep it focused on going through these steps with [01:20:00] me.
[01:20:00] So yeah, I love the approach she's talking about there. I find it to be extremely helpful.
[01:20:05] Mike Kaput: Yeah.
[01:20:05] Paul Roetzer: And then the context has historically was whatever you could put into the brief, right? It's like, here's what you need to know. Here's a bunch of bullet points. Now the context's increasingly becoming what are you connect it to?
[01:20:15] Here's my CRM database, here's my Google Drive, here's my inbox. and that becomes the ongoing context that lets you really work with these models and agents in new ways.
[01:20:25] Mike Kaput: Yeah. Yeah, it's really, really valuable approach. I would recommend everyone go take a look at that.
[01:20:29] AI Product and Funding Updates
[01:20:29] Mike Kaput: alright, Paul, as we wrap up here, I'm gonna run through very quickly some AI product and funding updates, and as we close out the episode.
[01:20:38] So first up, SpaceX, as we've talked about, is expected to go public around June, June 12th. Under the ticker SPCX. This is going to be perhaps the largest IPO in history. There's underwriters targeting a valuation of roughly $1.75 trillion. There may be a raise of up to $80 billion. Next up. As we've also uh [01:21:00] mentioned, Anthropic and openAI's are expected to drop new models very, very soon.
[01:21:04] Anthropic is reportedly set to widen access to its mythos class model. openAI's is rumored to ready be readying. Its next GPT update, though neither has officially confirmed timing. openAI's is also published a policy blueprint for governing frontier AI in the us, proposing a, what they call reverse federalism approach, where states pioneer AI safety rules and the federal government absorbs the consensus.
[01:21:29] Anthropic has expanded Project Glass Wing, its program that gives organizations access to its Claude Mythos model to find security flaws in critical software. They've expanded to roughly 150 new organizations across more than 15 countries. Google has rolled out Gemini Spark in beta, which we mentioned on a previous episode.
[01:21:49] It was announced. It's a 24 7 personal AI agent that runs in the cloud even while your devices are off and can carry out multi-step tasks across Gmail, calendar, drive, docs, and [01:22:00] sheets. Google also released Gemma four 12 B. This is a free, open weight multimodal model. that processes text, images and audio without separate ENC coders, and it can run locally on a laptop with just 16 gigabytes of memory.
[01:22:15] Super important to start tracking that kind of thing with all the usage and cost questions we've been talking about. We also of course, talked about how Microsoft launched its seven new in-house AI models. Just a little more on this, this includes one called MAI thinking one, which is a reasoning model that the company says scores 97% on the AI ME 2025 math benchmark.
[01:22:39] They say it is preferred over Claude Sonnet 4.6 in blind human evaluations, Microsoft and Mayo Clinic also. Together announced a collaboration to build a frontier AI model for healthcare that will be owned actually by Mayo Clinic and will be trained on its de-identified clinical data to support earlier diagnoses and treatment [01:23:00] decisions and later distributed worldwide through Microsoft's Azure Foundry APIs.
[01:23:06] A couple other updates here. Hackers actually gained access to some high profile Instagram accounts, including the Barack Obama White House account and the brand Sephora. They did it by simply asking Meta's AI support chat bot to change the email address link to those accounts, which it did without proper verification.
[01:23:25] And finally, meta introduced the Meta business agent, which is an AI tool that is free to get started. There's paid tiers coming. It basically lets businesses automate customer conversations such as answering questions, recommending products, and booking appointments across WhatsApp Messenger. And now Instagram.
[01:23:44] Paul Roetzer: I wanna keep an eye on their mike. the Microsoft one, that's their effort to become one of the major Frontier Labs. Yes. So that's, we'll, we'll talk more about Microsoft and their efforts, you know, in the, in the coming weeks. But, they came out and basically said they want to be one of like the four or five [01:24:00] major labs, and this is their effort to break free of.
[01:24:02] Reliance on open AI's models in essence. Gotcha. Yep.
[01:24:07] Mike Kaput: One final announcement here. We talked about our AI pulse survey at the top of the episode, SmarterX AI slash pulse. This week's survey we will have live, when you listen to this. we're gonna ask a little bit about how much of your day-to-day work AI is already doing and get your take on the US government's plans to possibly take stakes in AI labs.
[01:24:27] So Paul, thanks for breaking everything down. Feel like we're a bit in the calm before the storm here, before all these model drops, but still plenty going on this week.
[01:24:34] Paul Roetzer: Yeah, it should be a busy week. Thanks everyone for joining us. As always. Thanks Mike for curating everything, and we'll talk with you again next week.
[01:24:43] Thanks for listening to the Artificial Intelligence show. Visit SmarterX.AI to continue on your AI learning journey and join more than 100,000 professionals and business leaders who have subscribed to our weekly newsletters, downloaded AI blueprints, attended virtual and in-person events. [01:25:00] Take online AI courses and earn professional certificates from our AI Academy and engaged in the SmarterX Slack community.
[01:25:07] Until next time, stay curious and explore ai.
Claire Prudhomme
Claire Prudhomme is the Marketing Manager of Media and Content at the Marketing AI Institute. With a background in content marketing, video production and a deep interest in AI public policy, Claire brings a broad skill set to her role. Claire combines her skills, passion for storytelling, and dedication to lifelong learning to drive the Marketing AI Institute's mission forward.
