Josh Woodward: Google Labs is Rapidly Building AI Products from 0-to-1
As VP of Google Labs, Josh Woodward leads teams exploring the frontiers of AI applications. He shares insights on their rapid development process, why today’s written prompts will become outdated and how AI is transforming everything from video generation to computer control. He reveals that 25% of Google’s code is now written by AI and explains why coding could see major leaps forward this year. He emphasizes the importance of taste, design and human values in building AI tools that will shape how future generations work and create. Mentioned in this episode: Notebook LM : Personal research product based on Gemini 2 (previously discussed on Training Data .) Veo 2 : Google DeepMind’s new video generation model. Paul Graham on X replying to Aaron Levie’s post that “One approach to take in building in AI is to do something that's too expensive to be reasonably practical right now, and just bet that the costs will drop by 10X or 100X over time. The cost curve is on your side.” Where Good Ideas Come From : Book on the history of innovation by Steven Johnson. Project Mariner : Google DeepMind’s research prototype exploring human-agent interaction starting with browser use. Replit Agent : Josh’s favorite new AI app The Lego Story : Book on the history of Lego. Hosted by: Ravi Gupta and Sonya Huang, Sequoia Capital
- Published
- Published Mar 18, 2025
- Uploaded
- Uploaded Jun 11, 2026
- File type
- POD
- Queried
- 00
Full transcript
Showing the full transcript for this episode.
AI-generated transcript with timestamped sections.
[00:00] What I found too, building products over the years is it's very common. Everyone talks about product market fit. You'll know it when you see it and all that, which is true. But at least for me, I've always felt in the first part of building products, you iterate a lot on the product and sometimes you forget to iterate on the market. [00:15] And finding the right market side is also just as important as the right product. And you have to connect those two. [00:21] And so I think that in these early stage things with Mariner, that's where we are. Is it possible for an AI model to drive your computer? Yes. That's a huge new capability. Is it accurate? Sometimes. Is it fast? Not at all yet. That's kind of where we are in terms of the actual kind of use case or the capabilities. And then it's about finding the right market. [00:51] Bye. [01:02] Today we're excited to welcome Josh Woodward from Google Labs, the team behind exciting Google AI launches like Notebook LM and the computer use agent Mariner. [01:11] Google Labs is Google's experimental arm that's in charge of pioneering what's next in how we interact with technology. [01:17] by thinking about how the world might look like decades from now. [01:21] Josh is helping to reimagine human-AI interaction. [01:24] From the provocative claim that writing prompts is already becoming archaic, [01:27] to the emergence of multimodal AI as a default user experience.
[01:32] He shares insights on the rapid innovation culture in Google Labs, [01:36] offers a glimpse of what's next in generative video, and much more. [01:41] Josh, thank you so much for joining me and Ravi today. We are excited to hear everything that you're doing over at Google Labs. Maybe first to start, you mentioned a provocative topic to me on your way in here. Writing prompts is old-fashioned. What do you mean by that? Okay, so thanks for having me. I do think it's old-fashioned. We'll look back at this time [02:06] paragraph level prompts into these little boxes. So I kind of see it splitting a little bit right now. On the one hand, as a developer, an AI engineer, I'm going to say, well, I'm [02:15] You should see some of the prompts that we're writing in labs right now are these beautiful, like multi-page prompts. But I think for end users, they don't have time for that. And you have to be almost like some sort of whisperer to be able to unlock the model's ability. So we're seeing way more pull and traction. I kind of seeing this in other products in the industry, too, right now. How can you bring your own assets maybe as a prompt? [02:45] giant paragraph writing. So I think it's going to kind of divide. I think as engineers, AI engineers, you'll keep writing long stuff. [02:53] But I think most people in the world, we're probably in a phase that'll sort of fade out here pretty soon. [02:59] So the form of the context will change, right? That's right. So you still have to give the model something, right? But it might be that you can communicate it via picture or communicate it via like, just look at this set of documents. Yeah. Your voice, a video, any of that. These models love context.
[03:15] So the context is not going to go away, but we're making a lot of bets right now that the type of context and the way you deliver the context, that's changing really fast right now. [03:26] I love it. Okay. We're going to go deeper into the future of prompts and multimodal models in this episode. Maybe before we do all that, say a word on what is Google Labs? You know, what's the mission? And tell us a little bit more about how you sit, where you sit with inside Google. Yeah. So Google Labs, if anyone's heard about it, and we had one a long time ago that went dormant for a while. And this is kind of back about three years ago, it got started. It's really a collection of builders. [03:53] We're trying to build new AI products that people love. So they can be consumer products, B2B products, developer products. It's all zero to one. It tends to attract an interesting mix of people, maybe people who have been at Google a while, but also a bunch of startup founders and ex-founders. And so we kind of mix these people together. And we basically say, what's the future of a certain area going to look like? So the future of creativity or software development or entertainment. [04:23] building and shipping. And so that's how it operates. And it sort of sits outside the big traditional Google product areas, but we work a lot together. But there's kind of an interesting interplay there. And I think that's been part of what's been fun about it is you can kind of dip in and maybe work with search or Chrome or other parts of Google, but you also kind of have the space to explore and experiment and try to disrupt too. And that's kind of what we're up to.
[04:48] How do you create the culture inside a lab that you want? Right. If you think about there's got to be a lot more failure, presumably, than there are in other parts. There's got to be a different metric for success than there is at just the sheer scale of Google. [05:00] So what is the culture you're trying to create and how do you create it? [05:03] So we really pride ourselves in trying to be really fast moving as a culture. So we'll go from an idea to end users hands 50 to 100 days. [05:12] And that's something that we do all kinds of things to try to make that happen. So speed matters a lot, especially in kind of an AI platform shift moment. The other thing is we think a lot about sort of big things start small. And one of the things, if you're in a place like Google, you're surrounded by some products that have billions of people using them. And people forget that all these things started with solving usually for one user. [05:35] in one pain point. And so for us, we get really excited if we get like 10,000 weekly active users. It's like, you know, we'll celebrate that. That's a big moment when we're starting a new project. And for a lot of our other kind of groups inside Google, their dashboards don't count that low. [05:52] So there's kind of this moment where, you know, the size of what we're trying to do is very small, [05:57] It probably looks a lot like companies you all work with, honestly, from that perspective. And I think the other thing we're trying to do is because we sit outside the big groups at Google, we kind of have one foot in the outside world. We do a lot of building and kind of co-creating with startups and others, but also one foot inside Google DeepMind. And so we've got kind of a view of where the research frontier is and more importantly, where it's going. And so we're often trying to take some of those capabilities in.
[06:27] lot of pride in sort of finding people who are very creative, people who are almost like see themselves as underdogs. [06:36] um they have kind of a hustle to them and so we have this whole doc called labs in a nutshell and my favorite section in the doc is called who thrives in labs and there's like 16 or 17 bullets that just list them out um and that's kind of how we try to build the culture but you do have to normalize things like failure you have to think about things differently around promotion compensation all these things that you kind of would do in a company too [07:00] You mentioned the deep mind links. I think that is super cool. What have you found is the kind of ideal kind of product builder persona inside labs? Is it somebody with a research background? Is it somebody who comes from a successful consumer product background? Is there the magical unicorn that's great at both research and products? Yeah, we take as many unicorns as we can find. And we actually have found some, which is great. [07:30] like a consumer sensibility in terms of people exist. They exist. They're great too. Um, if you can find them, uh, and we also kind of have found ways to kind of train or develop people. So that's another thing we think a lot about is like, how do you bring in often people that might not be the normal talent that you look for? So like, we're always in the interesting kind of zone of like who's undervalued, who's kind of like really interesting, but maybe not on paper.
[07:58] But when you interact with them, you look at their GitHub history. I mean, there's all these different signals you can look at. But, yeah, that's kind of how we would think about it. [08:05] Really cool. How do you decide what projects to take on next? Is it bottom up, top down? How does that work? Yeah, great question. We kind of do a little bit of a blend, actually. So at the top down side, we're looking at what are the areas that are kind of on mission for Google that are strategic to Google because we sit inside it. So we're thinking about ourselves in that broader context. So that may be, for example, like. [08:27] what would the future of software development look like? There's tens of thousands of software developers at Google, and obviously this is an area that AI is clearly going to make a big change in. So we'll be thinking about, could we build things for other Googlers, but also externally, how do we build things like that? So we take that kind of top-down view. Think of it as almost, I'm from Oklahoma, we like to fish a lot in the summer, but you're trying to figure out what's the right pond to fish in. So we put a lot of thought into those ponds to fish in, [08:54] But then we let a lot of these teams, often they're four or five person teams, come up with the right user problems to go try to solve. [09:02] And that's where we kind of meet in the middle. And I think for a lot of other teams, they might look at what we do as a little chaotic. You know, we don't have like multi-quarter roadmaps. Like we're trying to survive to the next whatever 10,000 user milestone and then try to grow it. But I would say it's kind of that sort of blend. [09:19] What's one of the products that you guys have built that you're excited about now? [09:23] Oh, yeah. So I guess if you've ever used the Gemini API or AI Studio or Notebook LM or any of VO, any of these things, these are products that we've kind of worked on from labs. I mean, maybe I'll talk about one that's maybe well better known and one that's coming up. So the very excited about where Notebook LM is going. I think we've hit on something where you can bring your own sources together.
[09:46] into it. And really, AI sort of like grips into that stuff. And then you're able to kind of create things. So a lot of people maybe have heard the podcast that came out last year. There's so much coming that follows this pattern. [09:59] um to watch this space uh they um there's just a lot you can do with that pattern and i think what's really interesting is it gives people a lot of control they feel like they're steering the ai we have this term on the team and actually one of the marketing people came out was like an ai joystick oh yeah kind of controlling it so that's interesting um i would say there's a lot of stuff coming right now we're very excited about vo um google's imagery model and sort of video model [10:29] really interesting products coming along in this space. I think maybe we can talk about that some at some point, but I think generative video is kind of moved from this moment of almost possible to possible. And I think this year, yeah, yeah. Well, I think it's, it's interesting. These models are still huge to run like VO2 takes hundreds of computers, right? So the cost is very high, but just like we've seen with the text-based models like Gemini and even the ones [10:59] times in the last year. So if you kind of assume cost curves like that, what you're going to see with these VO models, what's kind of brand new, say with VO2, is it's really cracked, really high quality in physics. So the motion, the scenes, if you talk to a lot of these AI filmmakers, they talk about what's your cherry pick rate, which is a term for like, how many times do you have to run it to pick out the things that's really good? And what we're seeing with something
[11:29] One time got what I want. [11:31] And so the instruction following, the ability for the model to kind of adhere to what you want is really cool. So I think when you put that in tools, you're now able to convey ideas in a whole different way. [11:43] What do you think are the solved problems and the unsolved problems in AI video generation? Because I remember, you know, last year it was like, you know, even last year there were all these, you know, there was so much talk about, you know, generative video is, you know, a physics simulator, for example. Right, right. It can kind of emulate physics. And it's like, that's amazing. Is the physics stuff solved? Do you think? Yeah. Like what else is, you know, what's done and then what's to be solved? Yeah. I would say physics is a hard thing to solve forever. It's close. [12:13] enough yeah but a year six months ago a year ago a few years ago you had will smith eating you know pasta it was a disaster and then even last year you had kind of these videos of like knives cutting off fingers and there were six fingers you know it was like that's where we were um so i think physics tons of progress the ability to do photorealistic quality uh very huge progress the ability to kind of do jump scenes and jump cuts and different sort of camera controls that's [12:43] There's paths to solve all this stuff. Still going to solve the efficiency and serving costs, I would say, and probably still have to figure out a little bit more of like the application layer of this, because I think this is another big opportunity is we've seen like a lot of other modalities with AI, you get kind of the model layer.
[13:01] You get kind of the tool layer and then the real value we think is in this application layer. And so I think that's really interesting to rethink workflows around video. And I think that's pretty wide open right now. [13:13] Do you think that models are capable of even having video that is malleable at the application layer? So for example, if I want to have character consistency between scenes, are the models even capable of that? Or I imagine you want model steerability in order to be able to kind of work with it at the application level. Like what is model readiness and what's required in order to be able to do magic at the application? Yeah. So I was talking to a couple of AI filmmakers this week, and what they're really interested in is exactly what you're saying. [13:40] character consistency, scene consistency, camera control. It's almost like we need to build an AI camera. [13:49] You think of some of the cameras that are kind of filming us right now. This is sort of like decades of technology that's kind of been perfected for a certain sort of input output. And I think we're on the verge of kind of needing to create a new AI camera. [14:02] And when you do that, you can generate infinite number of scenes. You can generate like, oh, you're wearing a red sweater. Now make it blue. And not just in that scene, but in like a whole two hour film. [14:13] So there's all kinds of ways that we're starting to see these prototypes that we're working on too internally, where this is here, like it's coming. We're kind of entire, I think, things that used to either be too expensive or too timely or required a certain skill level. We kind of talk internally on the team about how do you kind of lower the bar and raise the ceiling? And what we think about that when we're building products is how do you make something more accessible?
[14:40] Or how do you make like the pros take it and just blow, you know, the quality out of the water and make it incredible stuff? So that's what we're seeing with video. It's kind of right at that point where both are happening. There was an interesting tweet from or post from Paul Graham recently on this idea, I think, of. [14:56] based on this pace of progress, he's like, you sort of want to be building things that kind of don't quite work. Yes. And are way too expensive. Yes. Right. Yes. Because they're going to work. Yeah. And their cost is going to come way down. Yep. Right. And so I would imagine that has applicability for you guys to particularly in video. That's exactly how we do it. Yeah. I mean, right now, I don't know off the top of my head, but each video eight second clip generated is obscenely expensive. But we're basically building for a world where this is going to be like, [15:26] All right. [15:26] One of the actual principles I've kind of learned just over the last few years working on this AI stuff is make sure your product is aligned to the models getting smarter, cheaper, faster. And if your core product value prop can benefit from those tailwinds, you're in a good spot. [15:43] If any of those are not right, question your existence. That would be my summary takeaway on that. How far do you think we are from having economics of video generation that are right side up? Where it costs less to generate the thing than the economic value of generating it? Oh, wow. This is tough. This is a prediction you're never really sure about. I don't know. But I would say one thing we're seeing just as we're modeling out a lot of costs,
[16:13] Yeah. [16:15] we're probably going to need innovation on the business model side, in addition to just the product and the application layer. And what I mean by that is you could, our first thought was, oh, let's just make a subscription and then just charge per usage on top. [16:28] That might be a way to do it. Another way to do it is when you talk to some of these creatives, whether they're in Hollywood or even these AI filmmakers that are popping up, they're kind of like – [16:38] Okay, I want this output, and I'm willing to pay this much. And it's kind of a pay per output, kind of, which you've seen in other cases, AI companies are starting to do some of this too. But for sort of film and video, that's, that's a little bit how you'd think of doing a project if you're a producer. [16:56] But now you're kind of imagining it at the individual creative level, which is kind of interesting. So that's more like almost like an auction type model, potentially. So I think there's a lot to explore. I think we're probably, though... [17:07] The pace things are moving, it's on the scale of quarters, I think, where it starts to get interesting, as opposed to many, many years. So that's, yeah, I think there's a path. You talked about the pace of progress a couple of times. Yeah. Do you think it's accelerating? You have a unique view in the deep mind, and let's use that as a... [17:26] I don't know, Harbinger for some of the others. Yeah, yeah, as a proxy. Yeah, yeah. What, what, where are we at? Are we accelerating? Are we, you know, on a crazy trajectory and maintaining the same one? Like, yeah, I'm interested.
[17:56] author on the team we actually hired his name is steven johnson he co-founded notebook lm well when we first brought him on and he talks about this notion of like there's adjacent possibles [18:05] He has this really interesting book on the history of innovation. And I feel like right now it's like you walk into this room and there's all these doors that are opening up into these adjacent possibles. And there's not just like one room and one door. It's like one room. Like it feels like 30 doors that you can go explore. So I think that's what it feels like on the inside. I love that visual of the rooms and then the adjacent possibles. [18:30] I'm going to steal that and maybe take it and call it my own. Classic VC over here. [18:38] What do you think the future of video consumption looks like for us as consumers? Am I still looking at Hollywood-style feature films that are created by Hollywood studios, just done a lot more cost efficiently? Am I looking at a piece of content that's dynamically generated to what you know about me and it's only for me to watch? What do you think the future of consumption is as a consumer? [19:00] This is one of those that could go in spider in many different ways, I would say. I'd say some of the things we're excited about and what we see. So I think the future of entertainment is way more steerable. [19:10] So right now you think about you sit on your couch like this and you maybe scroll through something or whatever you cast it on, bring it up on the TV. So it's going to be way more steerable where you can kind of interject if you want and maybe take it certain ways. We think that's one area. We think another is personalization, like you said. If you think today about YouTube, TikTok, any of these algorithms that can kind of figure out this is what you're interested in. Imagine that I think way more extreme that could be kind of fine tuned to sort of what you want to share with the model.
[19:40] bit is a lot of this i think is going to be generated on the fly [19:42] So another theory we have is that just like there was a rise of kind of a creator class couple, whatever, 10, 15 years ago that powered YouTube and the rest. [19:52] There's going to be a shift or maybe it's a different set of people that we think of as like curators where you curate stuff. [19:59] And you work with the model to maybe create things. And I think another loop in that is how you can remix all this. And so that's another big part of what we see in the future of entertainment is that there will be like, oh, I kind of like that, but then make it more like this. And if you think, you know, at some level, the cost, the time, the skills required of this is literally maybe just like tapping a button or just describing it. And you get kind of different versions. That's kind of where we see some of this going. [20:29] percentages hold. Like we know today that a lot of times certain percentage, like 90, 95 percent, just consume from platforms and you're a very small creator class. So like, will that balance change? But I see a totally different ways you could think about content platforms that have some of these native controls. Like, for example, will we expect UIs that have a join button [20:53] where, you know, today are you guys maybe have a play, pause, whatever, save, bookmark something, star, heart it. Like, will there be like new things where you join and they're like, oh, hey, Sonia, Ravi, what do you want to talk about? You know what I mean? And I think like, that's totally possible. We're building that in the notebook LM today. So that you can imagine, play it forward. You've got avatars or human-like characters or not with lip reanimation,
[21:23] Thank you. [21:24] Do you think movies and games start to blur? [21:27] Yeah, I think that's a real possibility. Yeah. There's a whole interesting intersection that's happening right now between... [21:34] movies or video content, games and sort of world building and 3D. [21:39] And it's really unclear to us right now where that's going to go. But there's so many areas right now where we're seeing learnings from each and even down to some of the training techniques. We're finding things like that. [21:51] So actually, that was going to be one of my questions. Like if you look at all the companies building generative video models right now, some people are kind of going straight from the, you know, the pixel stream, so to speak. And some people are going from the 3D angle with the idea that, you know, to really do video right, you need to get 3D. Do you have an opinion on that? Yeah. We actually got bets on both sides right now. I don't know. I don't know. You can't lose. Yeah, we're hedged on this one. So on the 3D side, we have this project we got started where we basically said like, [22:21] take six pictures of a sneaker and create a 3D spin of it. And we put that on search. It's been really great. And it's amazing how it fills in the details. But I think what's interesting is we've been going down that path. Something like VO2 shows up. Now you don't need six photos anymore. You need like two or three. [22:38] And you can basically do like an entire product catalog, like every product that's ever been indexed at Google just overnight. [22:44] sort of can create it. So now you've got a 3D object basically of any object, bookshelf, chair, whatever, from any angle that you can pan, tilt, zoom, relight. And now that's like an object.
[22:56] that you can drop in anywhere. So that's kind of the 3D angle. [23:00] From the video angle, it's interesting, kind of the world building. We had this little prototype we built. We were like, wouldn't it be cool if you could recreate landing on the moon for every classroom and give teachers a tool where they could put the kids in the lunar module as it's coming down? So we built this thing. It's kind of terrifying, actually, because we also built a little side panel where you can inject problems where it's like, oh, no, something's on fire in the back. They simulate things. We had a little fun with it. [23:30] You could say like, look right. [23:32] And it would actually fill in the details of, [23:35] And so you start to get this, that's where it feels like it's kind of blurring. And I guess why we're hedging on both sides right now. Yeah, we're not sure. 2025, everyone's talking about agents. Yes. Yeah. Computer agents. Yeah. [23:46] You just said it three times. [23:49] Exactly. [23:51] I've been called a VC twice today. This is a very big insult. Can you talk to us about Google Mariner? Yeah. Yeah. So Mariner is one we put out in December last year. This is a fun one, actually, because we started seeing this capability developing in the model. We're trying to understand if you could let these models control your computer or your browser. [24:10] What would happen? Good and bad. And so that was a good example of a project where we went from, hey, this capability is kind of showing up. Let's put it into right now to Chrome extension, just because it was quick to build idea in people's hands, 84 days. [24:26] So very fast, very fun. A lot of memories made on that. But I think what's interesting is you're seeing both across Anthropic, OpenAI, obviously Google and a bunch of other startups in the space.
[24:38] are all hitting on kind of the same idea that models are not just about maybe knowledge and information and synthesis and writing. They can do things. [24:47] And they can scroll, they can type, they can click, they can not only do this in one browser in one session, but like an infinite number in the background. So I think with Mariner, what we're really trying to pursue is like, of course, there's the near term thing of like, can it complete tasks in your browser? But the bigger thing is, what's the future of human computer interaction look like when you have something like this kind of not just one of these things, but basically like an infinite number? [25:15] kind of at your disposal. And so that's what we're chasing with that project. [25:19] What do you think the ideal use cases are, maybe even in the near term for Mariner? Because I think all the demo videos I see, not necessarily from Mariner specifically, but with computer use more broadly, are, you know, here, have this agent go book a flight for me or go order a pizza on DoorDash for me. Right. Like, that's nice, but I like doing those things. Yeah, yeah, yeah. You're pretty good on those on your phone. Booking a flight is one of my delights in life. And so what do you think are the killer kind of consumer use cases? [25:49] It may not be consumer. [25:51] Maybe enterprise. And one of the things we're seeing when we do all the user research right now on Mariner, because we have it in Trusted Tester and people are playing with it and giving a lot of feedback, is it's really these high toil activities. Toil is kind of an old fashioned word that doesn't get used a lot. But this is when people talk about it. It's like, this is what makes me grumpy. And this thing is helping me solve it. But what's interesting is a lot more of those are showing up on the enterprise side.
[26:21] and they're basically, they have this co-browser use case. So imagine you're in like, [26:26] call center somewhere, some customer calls in. They right now have this very complicated way the agent in the call center can like remotely take over your machine that's not working, browse through things and do something for you. They were like, we would love to have Mariner do this. [26:42] um and that's like a way another one we heard which was kind of interesting was people they're like part of a sales team or something they have a [26:50] take a customer call, then they've got all these next steps they need to do. And they just want to fan that out. And it's often updating different systems that are all probably I don't know, some SAS subscriptions, they're paying everywhere. And they're just like the UI is clunky, it takes a long time. I just want to say Mariner do all this. [27:08] So these are the kinds of things that are kind of interesting that are just naturally coming up. On the consumer side, I don't know. Have you found one yet in your mind that you like? Because we've got a few, but it's – [27:18] I'm curious. I'm trying to think what the toil I have in my everyday life. Yeah. Talking to Ravi. I'm kidding. I'm kidding. Talking to Ravi is the best part of my day. I appreciate that. I like the framework, even if we don't have the exact use, the framework of what are the things that are the... [27:39] heavy lifting that you don't enjoy throughout the day that take up time away. And I do think that that was actually the same logic that [27:47] yielded things like door dash or instacart right right um
[27:51] You see how I had to get Instacart in there? I did. I'm just making sure that that was there. Um, [27:56] On the enterprise side, when you think about it. Yeah. Um, [28:00] How are you testing that? Are you testing that with existing, you know, customers? Are you testing that with Google Cloud customers? Like, who are the enterprises that you guys will actually like test things with? Yeah. So in that case, we kind of go across big and small. So there will be some cloud customers. We have a lot of cloud customers who always want the latest and greatest. Give us that. They have like labs equivalents inside their companies. Right. So those are awesome test beds. We also work with a lot of startups. [28:30] we're always trying to learn kind of from different sides of the market. What I found to building products over the years is it's very common. Everyone talks about product market fit. You'll know it when you see it and all that, which is true. But at least for me, I've always felt in the first part of building products, you iterate a lot on the product and sometimes you forget to iterate on the market. [28:49] And finding the right market side is also just as important as the right product. And you have to connect those two. [28:55] And so I think that in these early stage things with Mariner, that's where we are. It's like, is it possible for a computer to, like an AI model to drive your computer? Yes. That's a huge new capability. Is it accurate? Yeah. [29:07] Sometimes. [29:08] Is it fast? Not at all yet. Like that's kind of where we are in terms of the actual kind of use case or the capabilities. And then it's about finding the right market. But yeah, to answer short, it's kind of in these early days, we do lots of stuff really quickly. And what I kind of coach our product managers on and other people on the team, because we have engineers and UXers, they all go to these sessions is like, don't look at the dashboards.
[29:33] It's too small numbers right now. [29:34] look at their eyes. Look at the customer's eye. And when you show them stuff, do they light up or not? You know what I mean? And that's kind of the signal you're following. It's way more art than science. [29:45] at this stage. Can we go back for a second just to the context point? Because I was thinking about this vis-a-vis like you working at Google, right? And you talked about bringing your own, you know? [29:56] Is there a world where someone can just opt in of like, Google knows a lot about me, right? Already, you know, my searches, my Gmail, my calendar. Is there a world where you can just sort of opt in and be like, I don't want to bring it all now. I just kind of want you to use what you got and [30:15] be able to do something like that, probably more so than anybody. [30:19] Is that something that you guys can play with in labs or have a possibility for? Or is that not possible? We do some more kind of internally with some of our own like data on the team. [30:30] Right. Where I like I've opted into a lot of things, which is like, take it all. [30:33] Like, let's make good stuff. But I think you'll see some of that come through in the Gemini app, too, where you can link different things. But I think it's actually an area that's like actively kind of being explored, too. [30:45] of like what types of data [30:47] is like the most interesting and the most useful. [30:50] And of course, also the right controls where people feel like, okay, I'm not just giving it away. [30:54] Yeah, so I think that is an area, though, that we do experiment on some, but I'd say right now a lot of the experiments are more on our own stuff as we're trying to figure out. You're going to have to tell us separately some of the things that you could have done now, now that they know everything about you. You know, like, what is the magic that can be created for you? Yeah, I think certain things immediately come to mind that are pretty powerful is you can...
[31:15] You can see things like in my own data. I feel like I have a second brain. [31:20] Is it true? There's always been this vision of a second brain and tools for thought and all this stuff. [31:24] And I feel like you can get pretty close to that. [31:28] And I think the Gemini model specifically is really good at long context, the ability to have this like impressive short-term memory. [31:35] And so Gemini 2, that's an area we're really trying to exploit right now, like how to use that. [31:40] On Mariner. Yeah. Similar question to what I asked on on VO. Yeah. When do you think we'll have computer use that is accurate enough and is fast enough to do some of these use cases you talked about? Yeah, that's another one. It's kind of hard to tell at the pace, though, right now. I mean, not just inside Google, but what you're seeing from some of the other labs, too. They're on like about an every month or two rev. So you can imagine just this year, we're going to see four or five, six revs of each of these things. Right. [32:10] again, that's just what we know is happening. I think the areas that are a little bit trickier, harder right now is how the computer like finally or precisely navigates like the XY coordinates. Almost, you almost want like a lat long of your screen. And that's still kind of really interesting, jagged edges on that, I would say. The other big area I would say is like this. It's more of a
[32:40] not and kind of creating the right construct almost is like hey i'm about to buy something oh no i want to know about that or i'm okay for five dollars but nothing more than that do you know what i mean and so there's a whole bunch of almost like hardcore like hci like research and like really going deep on the empathy of like how you set those controls [33:01] that I don't think any of them, including the Google Mariner one right now, we don't have, I mean, we do certain very blunt things like don't buy anything. Don't consent to any toss. [33:12] There's some like crude things right now that you can do, but I think people are going to want a more fine-grained way. So these are some of the things that are, I consider more unsolved. Again, that principle, just banking on the model is going to get [33:24] smarter faster cheaper um and you're gonna get like four or five six or seven revs this year um yeah [33:31] Okay, I have a meta question. Yeah. How come all of the research labs converged on computer use at, like, as far as I can tell, the same exact point in time? Was that an accident? Was that just all the technology happened to converge at the same time? Like, what happened there? It's a good question. I mean, this is... I don't know the specifics there of each of the other labs, but I would say... [33:50] You know, when you read about the history of innovation and there's like all kinds of things on this there, it's not uncommon that discoveries kind of happen around the same time. And I think there's kind of a new paradigm now with these models. And I think lots of people are seeing the potential in certain ways. And I'm sure there's also, I don't know, people changing labs and other things that are cross pollinating all these ideas, too. But it does feel like it's one of those is kind of how I'm interpreting it is like things similar with coding. Right.
[34:20] you saw there's already even the agent stuff right now there's lots of this stuff kind of bubbling um which makes it really fun but also keeps you on your toes right because this is kind of the underdog mindset here are you going to hire any other authors the reason i ask is i was thinking about i think matt ridley is the one who's written about some of these things about like adjacent innovations and you know you have stephen johnson maybe why did you hire stephen johnson yeah that happened yeah and [34:45] Are you going to think about other people that don't have obvious backgrounds that you would bring into labs? Yeah, yeah. So the quick story on Steven was the guy who kind of restarted Google Labs was a guy named Clay Beauvoir. A mutual friend. Mutual friend, exactly. And he and I are big fans. We've basically read everything Steven had written. And Steven was a very interesting guy because for like decades, he's been in search of the perfect tool for thought. [35:10] And so Clay cold emailed him. We were both subscribers to his sub stack. We kind of messaged him and we're like, we love you. Will you come work with us? We can build the tool you've been wanting to build. That's where it started, actually. And this was like, I mean, it was like summer 2022. So like before any of the, you know, chat with GBT moment or anything. And Stephen picked up the phone. He was like, yeah, let's do it. So he came in. He was a visiting scholar.
[35:40] I had to go figure out with our HR person how to create a role that he could take on. This is very kind of unconventional in that way. And then the rest is kind of history, obviously. I've read a bunch of Matt's books. I don't know Matt. He'd be awesome. So if he's listening, please. Yeah, exactly. He's listening. Come talk to both of us. That's right. I would say we've done this quite a bit. So we've actually brought in musicians. [36:03] I'm actually really, we're trying to figure out right now, like a, like a visiting filmmaker. That's cool. So it's kind of a model. Steven kind of pioneered it. He was the first one that it's like how to bring in, it's a big value in labs and how do we co-create? [36:17] We don't want to just make stuff and throw it out there. We actually want to co-create it with the people that are in the industry. And what we find when we do that is you actually get way beyond the like, oh, that's cool toy AI feature. You get into the workflow. [36:31] And if you're working with someone like Steven Johnson, who's written, you know, a dozen plus books, there's a certain way he thinks about and almost like a respect for like the sources and the citations. All that stuff comes through in Notebook LMP. [36:43] And we're doing similar stuff with music and video and other stuff. [36:47] Is the goal to create new products that you can take from one to 100 to a billion standalone? Or is the goal to find product market fit with things like Notebook LM and then really fold them into the Google mothership, so to speak? Yeah, it's interesting. So when we first started, I would say it was all about... [37:05] Build something, graduate it. [37:06] So kind of a traditional incubator sort of model. It's been interesting as it's gone along. We've done that some cases like AI Studio and the Gemini API. We graduated and it's now in DeepMind and they're kind of running with it. Something like Notebook LM, we're just going to keep in labs right now for the foreseeable future because it's kind of a different creature. Like it's only possible with AI.
[37:28] And a lot of the stuff we're working on now, I mean, we'll have to see how many of these we can put together that actually can kind of get escape velocity. But we're really interested in turning them into businesses and making them sustainable and kind of, you know, that's been a lot of the focus actually is like take big swings. And that gets back to your point. A lot of these won't work. [37:46] Um, cause if you're just, if they're all working, you're not swinging big enough. So it's like trying to find that balance, but that's definitely, we start with kind of, could we make this a business work backwards from that? And if we end up graduating it, that's still a good, good outcome for us. Another good outcome is we stop it. [38:03] And it was like, cut the losses. We did our 100-day sprint or whatever. Move on to the next thing. Yeah. [38:08] You mentioned at the top of the episode that you try to do some top-down thinking of, you know, what are the most interesting pools for us to be building in? Yeah, yeah. What are your predictions on the most interesting pools to be building in for 2025? Like, where are you hiring talent? Like, where are you sniffing around? Where are you co-creating with the DeepMind folks? Yeah, yeah. There's a lot happening with agents. There's a lot happening with video, some of the things we've talked about with computer use. [38:33] But I think about those ponds a little bit different. I think about them. We have this doc called Labs is a Collection of Futures. [38:40] And it's 82 predictions about the future, which is always dangerous to make one prediction about the future, let alone 82. But the thought experiment on the team where we got to this was imagine you're in a room like this. [38:51] The ceiling just opens up and this little capsule comes down. We all jump in it and it slings us into the future.
[38:58] It's 2028. You can get out, you get five minutes, look around, write down everything, and you're brought back to the present. [39:04] And then write what you saw. And that's what this doc is. So what's the future of knowledge look like? What's the future? Even though prompts are old fashioned, that's a pretty good prompt that you gave to the team. Maybe, maybe. I was just telling you right now. Yeah, yeah. So that's, you know, we think about it at that level, at kind of a high level. So say something like, what's the future of knowledge going to look like? We think it's going to be one piece of that prediction, one of the 82, is that it's infinitely remixable. [39:29] And anything that comes in can be transformed and become anything on the way out. If you believe that, then you take certain bets and you build products kind of with that future in mind. So that might be one of them. But I think like going back to maybe some of the ones that a lot of people might be listening or building, I do think we're kind of at the moment for video. We're at the moment for very interesting agent stuff with the thinking and reasoning models. And I think there's also maybe something kind of under the radar right now a little bit. [39:59] we're going to see this year. [40:01] um and so those would be some of the ones that are top of mind for are you guys doing work on coding out of labs too yeah we are we are so right now at google 25 of all the codes written by ai yeah i saw that jeff team said that yeah that's right that's right and that's up a lot in the sense of just how fast the progress is um this is an area that i think there's kind of two approaches you could think about like how again think of lower the bar raise the ceiling right
[40:31] Correct. [40:31] Massive opportunity. Like Sonya. You know, like I've been coding my whole life. I mean, Sonya is some of that. Well, it's kind of interesting because some of the most interesting stuff happening here, I don't know if any of you have played with like Replit's agent stuff. [40:44] Really interesting, right? [40:46] A couple of weekends ago, I'm with my fourth grade son. We are struggling right now in our household to implement chores. We created a chore tracking app, 28 minutes, 45 cents. [40:58] Done. We're daily active users. And so it's a way to kind of get into software and a world of kind of software abundance. That's really interesting. So we've got some stuff in that area. We're also interested in how do you take a professional trained SWE programmer and make them like 10x to 100x. And there's kind of, I think, interesting bets on both sides of that. [41:21] Yeah. [41:22] What do you think is overhyped in AI right now? Oh, that's an interesting question. I wish we'd move beyond the chatbot interface a bit. That's one area that feels like we're kind of reusing that in a lot of places, Google included. Yeah. [41:37] I'm also not sure... [41:39] There's still a lot, I think, of people jamming AI into stuff. AI itself is a bit overhyped. I wish we were a little more precise about how disruptive or where to apply it. [41:51] And so I think, again, we're trying to think a lot about like workflows, not just take an existing product and bolt on AI. So I think that's maybe a little there's a race like you're seeing the first generation of AI put it in. And it reminds me a lot. Actually, when I first started at Google, it was like right as the iPhone moment was kind of just happening and taking taking hold. You know, when Steve walked on stage in 2007, said this is the iPhone. If you look at the app store.
[42:18] three years later which is roughly where we are in this ai revolution the app store in 2009 ish i went back and checked [42:25] websites that have been shrunken down to fit on your phone. [42:29] flashlight apps and fart apps. These were like the highest top downloaded things that were happening. So I think we're kind of in this stage where the real stuff is going to start to come out kind of this year, next year, the next year. That's when you start to see the Ubers, the Airbnbs, the Instacart, the things that really change kind of how you do stuff. And so that's kind of my thought on it. All right. Then Sonia asked you the overhype question. I'll ask you the under [42:59] within... [43:00] AI. [43:01] We talked about coding a little bit. Maybe just one other thought on that is I think – [43:06] If you can get code models that can kind of write code and self-correct and self-heal and migrate and do all this stuff, you think the pace is fast now. That totally changes the curve. [43:17] So I think that's a huge I still think it's underhyped. It's hyped a lot, by the way. But I think as hyped as it is, it could be hyped more. That's one. [43:27] I don't think we fully internalize the notion of like what does long context or like infinite context mean? [43:35] It gets to some of your personalization questions potentially, but it also gets at some of the stuff we were talking about around how can you make things like a mariner literally just keep going. [43:47] And so that whole notion of long context, I mean, you see a lot from Google, but we're investing a lot in that because we think that's a strategic lever that's important, especially as you get more agentic, chained together kind of workflows.
[44:05] There's... [44:06] There's not enough talk about taste. [44:08] And like, I think if you believe the value is going to be in the application layer, if you believe there's going to be some percentage of AI slop, you can just see a few of these trends. And I think there's going to be a value in good taste and good design. [44:22] And it doesn't mean it has to be human created necessarily, although I think there's going to be a high value on that, too, as like human crafted content becomes more artisan. But I think that's another one, I would say. I think maybe related to that, it's like veracity and truth and sort of what is real. Like these are things that I think are going to become way more important than they already are today. [44:43] I think the context point within there, I really firmly agree with on what... [44:49] can happen with you um your infinite context point because if you think about the relationship in your life where you have like the most context shared context is probably with your spouse yeah right and if you think about that what ends up happening is you can communicate with your spouse literally with just like like the flick of an eye yeah and all of a sudden they know exactly what you mean they know it's time to leave the party whatever it might be that's right right and you think about that's the aspiration for what can happen with infinite shared context we know that's the ceiling exactly right and so you think about you're like [45:18] think of how far away that is from now where you're like typing things in about what it is. And your point of like, well, hold on. There's all these different ways you can communicate it and they can get to know you better if it has memory. And so I think there's so much gold in there of it just being able to keep going. Yeah. Right. Yeah. Giving it the right context and whenever it needs. We think of any company that you all back or even Google, like what's one of the most painful things is when a long term employee leaves because all that context
[45:48] work relationship. Yeah. [45:50] Okay, we're going to wrap with a rapid fire round. All right, yeah, sounds good. Okay, favorite new AI app? Oh, I mentioned it earlier. I'm having a lot of fun with Replit. [45:59] Love it. The new agent thing and on the phone. I think they're doing some really interesting stuff there. You know, one of our partners, Andrew Reed, is known for slinging, like creating these amazing memes and sending around. It's now so easy to create an app. He just creates these all the time and sends them to me. They're really good. Yeah, we have this concept of like disposable software. You use it once and you kind of throw it out after you're done with it. So, yeah. [46:24] Okay, what application or application category do you think is going to really break out this year? Video. [46:30] Okay. Recommended piece of content or reading for AI people. Ooh, that's an interesting one. [46:37] You know, this one's not a traditional AI pick. I think probably a lot of the listeners here, I was going to say, over the break, I read a lot. And one of the books I picked up was actually is the Lego story. [46:51] And it's the history of Lego. That's awesome. And it's on its third generation of family ownership. [46:56] Um, I'd recommend that one. It's a really interesting, uh, [47:00] Yeah. Here's why though. There's a pivotal moment in the company's history where they had 260 products. And maybe for a lot of founders that are listening, you can imagine your company could go in like all these different ways. You're trying to figure it out. And [47:14] The grandfather, the CEO at the time, basically identified like the little building blocks. This is it. And he bet the company on it. And he bought these incredibly expensive machines. And so I think it's like an incredible – I like to read biographies a lot. And this is one that really stood out.
[47:29] Josh has an incredible taste in books, and he has this wonderful reading list that he's been kind enough to share with me. Oh, no way. That's really wonderfully curated. It has this very good formatting as to when it's something you really got to read versus not. And so you should, to all the listeners, you should take Josh's suggestions seriously. I actually really want a great AI reading app. That's like my wish list app. What would it do for you? In part because I have terrible memory. [47:59] on the planet. Like there's all these things that are kind of on the tip of my tongue and ideas that connect, but you know, they're all kind of in an abyss and they're all pretty inaccessible to me. And, and so something that surfaces some of those thoughts and ideas that I've had, things that I've read, you know, that next layer of thought I have from reflecting on two different things that I've read and the connections probably across them. It's a good idea. [48:23] I think even within that, like just the hard copy version, the Kindle version and the audiobook version being like, you know, seamlessly intertwined, you know, like even just at the most basic level, you know, so that you can continuously pay attention to something that you like. And then we can get to the version that you said. [48:42] Request for startup. Okay. Pre-training hitting a wall. Agree or disagree? [48:47] Sigh. [48:48] Maybe lean agree. I think there's still stuff to squeeze out there, but I think a lot of the focus has shifted. [48:56] Yeah. [48:57] NVIDIA, long or short?
[48:59] I don't give stock advice. Index fund. Do you ever sit with Demis and be like, look, as someone between us, we won a Nobel Prize. Do you ever start with that? Because, you know, that feels like something that's true. Between the two of you, there's one Nobel Prize. It's all one directional. It's Demis and John Jumper. Those are the people that won the Nobel Prize, not Josh Woodward. [49:28] Any other contrarian takes now? Any other contrarian takes? I, I guess maybe I'll leave it with this. I think we are kind of one thing is like what a time to be alive and building because I feel like there's this window where there's like so many adjacent possibles opening up. I think the second would just be like I'd encourage people listening to like really think about. [49:49] Of course, there's the models and who's winning and the back and forth. But like, what are the values you're building into your company? Because I think this is one of those moments where there's going to be like tools created that shape like follow on generations. I think it's really important people think about that. And like, are you trying to... [50:06] Replace and eliminate people. [50:08] Or are you trying to amplify human creativity? I mean, there's like one that's like, you know, it immediately comes to mind when I'm thinking of video, for example. I'm on the side of wanting to amplify human creativity. But I think there's like, there are these moments that happen in our valley here where like things change and they change often for generations and they can change for good or bad. And so I would just encourage people that are in spots where you're building and you have this incredible technology that's only getting smarter and faster and cheaper to
[50:38] think about the consequences downstream. [50:40] Thank you so much, Josh, for joining us today. We love this conversation. Yeah, thanks again. [50:45] Music.
Want to learn more?
Ask about this episode