Training Data · 2025-07-11

DeepMind's Pushmeet Kohli on AI's Scientific Revolution

Hosts: Unknown

Guests: Pushmeet Kohli

Alpha EvolveAlphaFoldCo-scientistFunSearchAlphaTensorGeminiAI for sciencealgorithmic discoverymulti-agent systemschip designprotein structure predictionscientific methodinference-time compute

Read summary Jump to transcript Original podcast

Podcast feed URL

Open feed

Why it matters

Alpha Evolve couples Gemini language models with programmatic evaluators in an evolutionary search loop, removing the template requirement that limited FunSearch and dramatically reducing the number of function evaluations needed.

Key claims

Alpha Evolve couples Gemini language models with programmatic evaluators in an evolutionary search loop, removing the template requirement that limited FunSearch and dramatically reducing the number of function evaluations needed.
The system generalizes across domains and languages (Python, C++, Verilog) and has produced state-of-the-art results in areas like data center scheduling and chip design, contingent only on having a reliable function evaluator.
Kohli frames the generator-verifier architecture (as in FunSearch, AlphaTensor, Alpha Evolve, and Co-scientist) as a consensus paradigm for AI-driven science that closely mirrors the scientific method.
Co-scientist, released earlier this year, uses multiple Gemini agents playing roles such as hypothesis generator, critic, and reviewer in a shared memory, with quality improving over extended compute rather than appearing immediately.

Episode summary

Summary

Pushmeet Kohli of Google DeepMind joins the Training Data podcast to discuss Alpha Evolve, a new evolutionary AI system that pairs large language models (Gemini Flash and Pro) with automated evaluators to discover entirely new algorithms and tackle long-standing mathematical problems. He frames Alpha Evolve as an evolution of earlier work like FunSearch and AlphaTensor, noting that it removes the template restriction of FunSearch, requires far fewer function evaluations, and can search across large algorithmic spaces in multiple languages including C++, Python, and Verilog for chip design.

Kohli situates Alpha Evolve within a broader generator-verifier paradigm that DeepMind has been applying across projects, including the Co-scientist multi-agent system where Gemini plays the roles of hypothesis generator, critic, and reviewer. He argues this architecture mirrors the scientific method itself and that the next bottlenecks are validation (bridging digital outputs to real-world experiments) and accessibility of the technology to working scientists. He uses AlphaFold 2 as a case study, recounting a biologist who solved a decade-old protein structure problem instantly, to illustrate how AI advances, accelerates, and democratizes science while also raising the bar for what problems scientists choose to tackle next.

Looking ahead, Kohli says we are in the middle of an AI-accelerated scientific discovery era and predicts human-AI teams will increasingly dominate Nobel-level work, with domain impact spanning energy, materials, healthcare, and chip design. He is bullish on inference-time compute (somewhat), robotics, and humanoids in the medium-to-long term, and flags calibrated uncertainty as a missing capability in current models.

Alpha Evolve couples Gemini language models with programmatic evaluators in an evolutionary search loop, removing the template requirement that limited FunSearch and dramatically reducing the number of function evaluations needed.
The system generalizes across domains and languages (Python, C++, Verilog) and has produced state-of-the-art results in areas like data center scheduling and chip design, contingent only on having a reliable function evaluator.
Kohli frames the generator-verifier architecture (as in FunSearch, AlphaTensor, Alpha Evolve, and Co-scientist) as a consensus paradigm for AI-driven science that closely mirrors the scientific method.
Co-scientist, released earlier this year, uses multiple Gemini agents playing roles such as hypothesis generator, critic, and reviewer in a shared memory, with quality improving over extended compute rather than appearing immediately.
Math is presented as the gold-standard benchmark for testing novel scientific discovery because results are precise and objective, citing AlphaTensor improving on Strassen's 50-year-old 4x4 matrix multiplication result (49 to 48 multiplications).
AlphaFold 2 is cited as a precedent: it advanced, accelerated, and democratized structural biology, expanding what researchers can study rather than replacing them, a pattern Kohli expects to repeat with Alpha Evolve.
Identified next bottlenecks are real-world validation of computational predictions and making the technology accessible and usable by working domain experts.
Rapid-fire: recommends Alpha Evolve and Co-scientist papers, names the wake-sleep algorithm as underrated, is somewhat bullish on inference-time compute, and bullish on robotics and humanoids over the medium-to-long term.

Source material

Transcript

So I went to a biology conference and after I gave my talk, a biologist approached me and he said, "Pushmit, I have been working on this protein over the last 10 years."

And I had collected so much lab data to characterize this protein, to figure out its structure, but somehow this has awaited all kind of investigation and we still didn't know the structure.

But we had all this data.

If we knew the structure, we could sort of validate it very quickly.

I ran AlphaFold 2.

It gave me the structure.

It perfectly fit the answer.

I've been working on this for 10 years.

Wow.

What do I do next?

What happens when AI stops just answering questions and starts asking them?

In this episode, Pushmit Khali discusses DeepMind's Alpha Evolve, a breakthrough evolutionary AI system that discovers entirely new algorithms.

Pushmit reveals how coupling language models with evaluators creates something unprecedented, AI that can tackle decades-old math problems and generate human interpretable code that outperforms expert design solutions.

Pushmit shares stunning examples of AI uncovering hidden mathematical truths and explains why we're witnessing the emergence of a new scientific method.

One where AI doesn't just accelerate discovery, but transforms which problems we can even attempt to solve.

Enjoy the show.

Pushmit, thank you so much for joining us today.

We've all been eagerly waiting for the moment that AI is capable of making novel scientific discoveries.

Do you think that Alpha Evolve is that watershed moment?

Yeah, so it's certainly a key sort of milestone.

What we have shown is that you have an AI sort of model, a large language model, when coupled with a harness, it's able to discover new algorithms.

And not only that, it's basically able to view new mathematical results, which have been studied for many, many years.

You use the words when coupled with a harness.

Can you tell us more about that harness?

Yeah, so if you now go back to AI models, like the history of AI for science is very long.

We have a number of different models that have tried to do scientific discovery.

Like one of the key models in this category is Alpha Fold, right?

Which is the prototypical example of what can be achieved by AI in science.

We released Alpha Fold 2 at the end of 2021 and it won the Nobel Prize last year.

So the impact of AI in science is very well understood.

Now the question is whether LLMs and foundational models, how can they impact science?

Around two years back, we had an agent called FunSearch in which we took an LLM and we coupled it with an evaluator.

And the evaluator allowed the LLM to figure out when it was making new conjectures or coming up with new ideas to solve problems, whether they were hallucinations or whether they were brilliant insights.

So essentially in this particular case, hallucinations were great because some of those hallucinations were in fact brilliant new insights that nobody had thought about.

So this is where the harness comes in, that you have this evaluation function and essentially a search protocol associated with the LLM that together is able to come up with completely new discoveries that are really impactful.

You mentioned FunSearch.

Could you say a word on the difference between the results you all accomplished with FunSearch versus Alpha Evolve?

Yeah, so FunSearch was our first instantiation of taking a large argument model and trying to see if it can discover new algorithms.

The models at that time were weaker, right?

And the type of search that we were trying to do, we had not sort of explored things much further.

So what we asked the LLM to do was essentially try to complete a small function and see if it can do that much better.

And surprisingly, it was able to discover completely new algorithms that mathematicians had been trying to study for a long time.

But the limitation was that the mathematician or the researcher had to give a template in which the algorithm should be found.

With Alpha Evolve, we have removed that restriction.

Alpha Evolve is not just sort of searching for a few lines.

It's basically looking at whole algorithms themselves, very, very large pieces of code and optimizing them over a long period of time.

And secondly, FunSearch, our original model, used a lot of function evaluations to discover to make these new discoveries.

Alpha Evolve can work with many fewer function calls and it can basically, by looking at fewer proposals, it can discover new algorithms much more quickly.

Can you tell us about the role that the evolving Gemini models play in the capabilities of Alpha Evolve?

And I think I saw in your blog post, you have both Gemini Flash and Pro involved in the harness.

What is each responsible for?

Yeah, so I think, see, we have been evaluating as Gemini gets Gemini improves with various sort of generations.

It is becoming much, much better at its understanding of code.

Now, if you have a proposal generator which can understand code much more effectively than it gives, it generates proposals which are not only syntactically correct.

They are also semantically trying to solve the task.

And then you are sampling what are the different ways in which the task can be solved.

So as the baseline model, Gemini's abilities to perform coding improve our sample effectiveness in searching for the right solution on these very hard maths and computational problem becomes much better.

So if you want to search in a large space, there are two sort of elements.

One is the speed of how you can generate these proposals and then the speed at which you can evaluate those proposals.

So first, how quickly can you say, can you give me a new sort of candidate algorithm?

And then secondly, how quickly can you evaluate whether the algorithm is any good or not?

And both things are really important.

And the fact that you have these variants of Gemini, like Gemini Flash, which can do that very efficiently and very quickly, this is really important.

I know Alpha Evolve is more of a broad domain model than some of its predecessors.

How broad is it?

What's in scope?

What's out of scope?

Yeah, so Alpha Evolve essentially allows you to search not only in terms of the size of what you can search over, right?

You can now discover whole new algorithms, but it also is extremely general in its ability of thinking about algorithms in various different languages.

So not only can it sort of search in C++, but it can also do it in Python.

It can also sort of do in Verilog, which is what the language is for describing chips, right, in chip design.

So the generality of Alpha Evolve is in its ability to search for these large algorithmic spaces, but also in different syntactic and semantic representations, right?

It's not restricted to a particular language like Python, but it can sort of do that search across many different types of languages and many different types of tasks.

The only expectation it has is that you have a function evaluator that you can quickly evaluate whatever proposal there is and say how good it is.

It seems like the rough cognitive architecture, so to speak, of generating a bunch of algorithm candidates, evaluating them, and then I guess evolutionarily deciding which ones to keep and then going forward from there.

And it seems like it roughly mirrors the scientific method.

Is that intentional?

Yeah, so I think there are also, like if you think about it, there is another sort of agent that we released earlier this year, which was called Co-scientist.

And in Co-scientist, essentially what you had was Gemini playing the role of the whole scientific academic process.

So Gemini playing the role of a hypothesis generator, Gemini playing the role of the critique, Gemini playing the role of sort of ranking, different sort of reviewing those ideas and ranking those ideas and then editing those ideas.

So it was Gemini playing all these roles in a multi-agent setup.

And these were all sort of Gemini models prompted differently to play different roles.

And very sort of interestingly, this combined multi-agent system came up with behavior that went much beyond a single Gemini sort of models answer.

So it was able to give much, much, much better proposals and new ideas compared to a single sort of model.

What's the intuition behind why that works?

Yeah, so I think it is something that is still being studied.

But it is a fascinating sort of thing.

The one thing that I actually sort of noticed is that, especially with regards to sort of Co-scientist, you would run sort of Co-scientist on a particular problem.

And the very first answer that you would you might get might not be very different from the baseline Gemini sort of model.

And but what happens is even as you sort of increase the amount of computation over sort of this is you're not talking about just a few minutes or a few hours, but even days.

As the whole multi-agent system sort of looks at the solutions and then refines them and sort of tries to sort of rank them, it just becomes much, much better.

So why that might be happening?

It might be that the proposal that there is deep insights or there is some sort of intuitions that are buried in the tail of the distribution.

And then somehow Gemini's ability to evaluate which sort of proposal, which idea is better, is much better than its capability to come up with sort of a new idea.

It's the same sort of thing in computer science.

Sometimes we are able to find if sometimes we know whether a particular solution is correct or not, but it's very difficult to come up with a solution.

Right.

So it's the same sort of thing appearing again in this multi-agent setup that somehow the agents working together are able to extract many more impactful results.

It seems like the architecture of kind of generators and verifiers.

It seems like that paradigm is being echoed across the broad space, whether it's very general models or very specific kind of like AI systems for very specific applications.

Is that fair that that's sort of the consensus architecture right now?

And do you think that'll be the thing that people continue to push and scale?

Yeah.

So I think there is going to be more work in agents.

Right.

What we are seeing is basically the very start of research on agents.

Whether in alpha evolved, you had a generator coupled with an evaluator.

That generator was a neural network, a foundational model, an LLM, and the evaluator was even hand-coded.

Right.

But together with an evolutionary sort of search scheme, you were able to sort of get these much more effective results.

In co-scientists, you didn't have just one agent.

You had multiple agents working in a shared memory.

Now, like what is the optimal agent configuration?

Like this is still an open research problem.

Super interesting.

Are the results that you're getting, are they different from the ways that humans would derive them?

I'm kind of thinking of the AlphaGo, Move37 stuff.

Are the methods different?

Are the results...

How do they compare to the ways humans would think about them?

So let's go back to the original sort of motivation for why we started even working on the first iteration of using LLMs for algorithmic discovery, which was fun search.

So a few years back, like as you know, DeepMind has done a lot of work in using AI systems for searching over large spaces.

We have done a lot of work in building agents which have been trained using reinforcement learning, which can deal with many complex challenges from the game of Go to playing StarCraft, which are quite complex challenges.

We set ourselves a challenge that can we take the same kinds of models, like the AlphaZero family of models, which were extensions of what we had done in Go and the development of AlphaGo.

Can we use the same types of models for discovering new algorithms?

And we came up with a new sort of agent called AlphaTensor, which was particularly focused on finding solutions for the matrix multiplication problem.

And we found that this agent was able to improve over the past known results which had stood for 50 years.

But the key question sort of remained, can you do something better?

And secondly, can you sort of come up with a solution that is more interpretable?

At the same time, like when we were looking at practical problems in Google, like how do you schedule jobs in a data center?

Now there has been a lot of work on coming up with new algorithms and these heuristics have been designed by some of the best researchers and engineers at Google.

And because they have a huge amount of impact in terms of computer utilization.

If you use a typical reinforcement learning agent on this kind of problem, you might get better results, but it might come at the cost of interpretability because now you have a neural network deciding which workloads go to which computers.

And if something breaks, then you don't know how do you debug this thing?

So what engineers would really prefer is instead of giving them a neural network, you gave them a piece of code that they can interpret and they can run.

And this was essentially the motivation.

Can we now use LLMs instead of searching in the space of specific algorithms like we had done, metric multiplication algorithms like we had done in an alpha tensor or coming up with a neural network policy to directly solve the problem.

Can we come up with an agent which can search in the space of programs and come up with a program that solves this hard problem?

And the benefit of course will be interpretability that you can see the sort of code, you can see what its properties are and so on.

And that's what happened.

We found sort of programs that not only were effective, but when the experts actually saw those programs, they could sort of recover insights.

So for instance, one of the math problems that we had looked at for fun search was called the Cap Set Problem.

This is a problem that Ted and Stau, one of the famous mathematicians, he is very interested in.

And we collaborated with this mathematician, Jordan Ellenberg at NYU.

And when we looked at the program that fun search had produced, he found that there were certain symmetries that were in the problem that had not been recognized before.

And somehow the program, like fun search, the agent had discovered those and was utilizing those to get a better solution.

Can you say what about, you mentioned working with Terrence Stau and other famous mathematicians, is math considered the gold standard for testing and benchmarking if these models are generating novel scientific results?

Yeah, so math certainly sort of has some properties which are very interesting, right?

The fact that it's very precise.

Like you can, you know, whether the property that you have looked looking for, whether you have found it or not, right?

You know, the matrix multiplication, for matrix multiplication, how many multiplications you require.

Like for a four by four matrix, what was known was that you can do it with 49 sort of multiplications.

That's by stressor.

And we showed that you can do it by 48.

So that's a very precise result, right?

There is no sort of arguing about that.

So it gives you a very crisp way of evaluating how well you are, how well you have done.

And there is no sort of like RLHF needed in terms of human feedback, whether this was a nice result or whether this was a nice output or not.

And you don't need to rely on sort of an LN Cisco.

You just know that you're better.

Yeah.

Okay.

So then when you go from the, you know, beautiful pristine environment that is math to the real world, you know, it seems like you all have found a lot of real world applications and data centers in the Verilog world.

Could you say a little bit about, you know, which applications you expect Alpha evolved to be most impactful for?

Yeah.

So wherever sort of you can find a good function evaluator, wherever you can find an evaluator where you can say, I really trust this evaluation scheme.

If you give me a program, I can tell you very concretely how good it is.

If your problem satisfies that setup, then you can use Alpha evolve.

Because Alpha, unlike a human sort of programmer who can try 10 things or 100 things or 1000 things, Alpha evolve does not, like it does not, it does not sort of, it can go on and on and on and on.

Right.

It can come up with very counterintuitive sort of strategies to find, to solve that problem.

Some things that you might not have ever imagined.

Can you have humans be the function evaluators or does that not work?

Humans can be the function evaluators.

It's a question of sort of scale, right?

Like how many can you sort of evaluate and whether you can evaluate the property of the program effectively.

So at scale and with the right level of accuracy.

How do you do that?

Do you build that into the application itself so that there's a human in the loop evaluating as it goes?

Do you do that offline separately before the application is produced?

I guess how do you do that or how do you imagine people doing that?

Yeah, so I mean, like, so we haven't used a human in the loop for Alpha evolve, right?

Most of our evaluators were programmatic evaluators, right?

But imagine a hypothetical scenario where Alpha evolve was told that you have to solve this math problem and come up with a new algorithm to solve this problem.

And suppose it came up with many different types of problems, many different kinds of solutions, which are all equivalent in performance.

Okay.

But then which one is the best?

It's the best is the one which is not only sort of very effective on the problem, but is the most elegant according to a mathematician or the most simple to understand.

Right.

And that's a very subjective human thing.

Like simplicity or interpretability, like we don't have a sort of definition of it.

It's it depends on it is grounded in the human observer.

At what point do you need to pair kind of what's happening in the digital world to any kind of physical world stuff?

I think in your blog post, you mentioned that you could see Alpha evolve being useful for, for example, for material science.

Do you need to be able to connect to a real world laboratory to kind of get any of that feedback or do you think all of this can kind of happen in the algorithmic domain?

Yeah, that's a very good question.

And I think this goes back to how much do you trust the evaluator?

If you if sort of if your evaluation was based on a computational sort of method and the computational method was perfect and you completely trusted it, then you don't have to.

Then you think, well, I believe the computational model, the computational model says that the solution that Alpha evolve came up with is satisfies these properties.

Job is done.

Right.

But if you don't believe that the computational model is the perfect characterization of reality, then you want to make sure that you sort of validate that result in the real world.

Right.

And you see whether that assessment of the evaluator was indeed correct.

As Alpha evolve becomes more and more successful, as Gemini becomes more and more powerful, what do you think happens to these domains and how will the human scientists and engineers working in them adapt?

So, for example, if you take chip design, as an example, you mentioned these models are getting very good at creating new generating Verilog, creating new chip designs.

Does that mean the role of a chip designer goes away?

Changes like how do you think that this changes the world?

Yeah.

So I think that that's that's again sort of a very interesting question.

I'll give you the example of what happened with Alpha Fold.

So we started working on this problem of protein structure prediction.

So for those of you who don't know, like proteins are the building blocks of life.

They are the Lego blocks of life.

And for many, many decades, scientists have been trying to figure out what is the shape of proteins, because if we understand the shape of proteins, we understand how they function and we can use that to sort of develop new drugs to treating sort of the most sort of challenging diseases on the planet.

We can better enzymes and so on.

Now, in 2021, as I sort of mentioned, we released Alpha Fold 2.

Before that, you used to take or even single protein, sometimes one to five years to find the structure of a single protein and it might take a million dollars.

And there were some proteins which are so notoriously hard that people had been trying to study them for almost one or two decades and had not found the solution.

And which is why only 37 percent, roughly 37 percent of the human proteins, their structure was known.

So after we sort of released Alpha Fold 2, I went to a biology conference.

And because Alpha Fold 2, with Alpha Fold 2, we could find the structure of all proteins, not just human proteins, all proteins on the planet.

And we made the structures available to everyone on the planet.

So I went to a biology conference and after I gave my talk, a biologist approached me and he said, "Pashmit, I have been working on this protein for the last 10 years and I had collected so much lab data to characterize this protein, to figure out its structure.

But somehow this has eluded, this has evaded all kind of investigation and we still didn't know the structure.

But we had all this data.

If we knew the structure, we could sort of validate it very quickly.

I ran Alpha Fold 2, it gave me the structure, it perfectly fit the answer.

I've been working on this for 10 years.

What do I do next?

So what has happened after Alpha Fold 2?

What happened is basically suddenly it did three things.

It first advanced structural biology.

What was not possible earlier, it would take a synchrotron and six months and a million dollars is now done in a second.

Right?

So it really advanced what was possible.

Secondly, it accelerated it.

And thirdly, it democratized it.

Like that particular scientist working in Latin America or South Asia or Africa on some neglected tropical disease had no chance to sort of figure out the structure of their protein.

They did not have the funds or have access to instruments that could find them the structure.

Now they have access to those things, to like any sort of parasite that they're working on.

So what do they do?

They are now working in this new model where structures of proteins are not hard to get, they are everywhere.

And so they are working on the next set of things like how do you now use that knowledge to treat diseases and design better drugs.

And I think the same thing will happen with Alpha Fold.

Once you have these agents which can go beyond human abilities in solving these problems, then the question becomes which problems to be solved.

What are the important characteristics of a chip that we need to improve on?

Right?

Like we want to make it much more efficient, much more sort of so that it requires less cooling.

It requires sort of less expensive construction mechanism.

It's more fault tolerant.

Many other things.

You can make the problem more and more sophisticated because now you have more sophisticated systems to sort of optimize them.

Well, I have you.

Something I've always wondered, the Alpha Fold results are phenomenal.

And the story you shared with us is really impactful.

Do you think that it's caused an inflection point in the kind of availability of new drugs?

Or have there been other bottle, are there other bottlenecks now that are just, you know, we're faster at one part, but unfortunately everything else is just hard.

So we're still slow overall.

No, it's so it has it has speeded things up.

But I think there's one has to understand that drug discovery is a long process.

Now, what are the what are the roadblocks for drug discovery?

First, you have to understand the target.

You have to understand the here's a protein in the body that I need to bind because this protein is somehow involved in the disease.

So if I can somehow bind something to this protein and change its function, it will have an effect that can sort of treat the disease.

Like first, you have to come up with that conjecture.

Then you have to say, OK, now I have a target protein.

How do I develop a drug?

How do I develop a small molecule or another protein then binds to it?

So for that, you needed to understand the structure of the protein, which are the proteins that it interacted with?

How did it interact with this molecule?

This would take a significant amount of time, sometimes two years.

Now that process is dramatically sort of accelerated.

Now you can do it in sort of a few weeks or a month or a few months that took you multiple years sometimes.

But that's not the end of the story.

After that, you need to now clinically validated.

So you have to go through phase one trials, phase two trials, phase three trials.

You have to think about toxicity, all these other sort of things.

So what alpha fold did was take one blocker away, made the overall timeline faster.

But there are other sort of blockers, which are new generation of AIF or biology models are hoping to accelerate and make much faster.

So we have taken a big step, but we need to take a few more big steps.

What domains do you think will be most lucrative for this family of models?

I think the question is what is the answer to your question is basically what domains do you think are important for society?

Because AI is going to accelerate everything.

It's going to accelerate healthcare.

It's going to accelerate sort of the ability for us to develop more smart systems from healthcare to sort of material science.

Like if you think about the history of our civilization, we even describe our civilization in the sense of there was first we were sort of cave dwellers and then we went into the stone age.

And then we sort of went to the iron age and then the bronze age.

And now depending on who you talk to, you're either in the silicon age or in the plastic age, whether you're on the stick or feeling a bit sort of sad.

But if you take a step back and you think about what has humanity achieved, what we have achieved compared to any other species is the ability to transform energy, to leverage energy.

We have been able to leverage energy and do big things with that power.

Now, if you can come up with, say, a new room temperature superconductor that completely transforms your ability to handle energy.

What changes will it bring about in society?

They're hard to predict if you can deal with energy in that way.

If we can unlock fusion.

And energy becomes so cheap.

If you think about geopolitics, if you think about economy, a lot of it is about energy.

And suddenly if energy sort of goes down to zero, what will be the impact on the economics of the whole thing?

Similarly, if you think about coding and if you have these agents which can code, what does that mean?

If everyone can sort of code, like intelligence sort of is completely ubiquitous.

Everyone has access to all these different things.

So there will be dramatic changes and everything will be impacted.

So from materials to energy, to sort of coding to healthcare.

Really cool.

Do you think we're going to have a fast takeoff moment for scientific discoveries?

Do you think we're at the ramp of one?

You think we're already there?

I think we are living through the middle of it.

Like when you're in the middle, you don't really see it.

But I think we are already in that era of AI accelerated scientific discovery.

What do you see as the biggest bubble next going forward?

I think sort of two elements.

One is validation, bridging the gap between the digital and the real world.

How do you validate some of that?

That is one sort of key idea.

And really sort of capturing what is important for the problem.

And the second is sort of the other bottleneck is how do you make this technology accessible?

You can build the most sophisticated technology if people don't know how to use it.

Then you will not have the impact that you want.

But you will have to be able to do that.

And then you will have to be able to do that.

But suppose even it was accurate at 99%, the one person who got unlucky with their prediction and then spent the next sort of one or two years chasing a wrong prediction would then sort of say that I should not use it.

I should not sort of use the predictions.

So why is everyone using alpha fold?

They are using alpha fold because not only is alpha fold good at making these predictions, which are accurate, but it's also very good in understanding the limits of its predictions.

When it makes mistakes, it basically holds its hand and says, I have made a mistake.

So now if it is making your prediction and saying I'm confident, like most of the time it's correct.

And that's great.

This is something that the elements of today don't have.

They don't have calibrated uncertainty.

Should we close out with some rapid fire questions?

Yeah, sure.

Must read paper of the year.

Oh, I would say alpha evolve or co-scientists.

I like the like, yeah.

Favorite algorithm nobody talks about.

Oh, the wake sleep algorithm.

And very few people know about it, but it's essentially the idea.

It's a paper from MIT from Kevin Ellis and Josh Tendon-Bohm, which sort of talks about it's a way of sort of doing training where you find some exploration and then you somehow build the gist of it.

Think about sort of library construction and then I'll use library construction.

You don't just want to write programs, but you want to also create the libraries that have common modules that will make all your future programs much easier to write.

Very cool.

Agree or disagree.

Inference time computes will be the next major lag of compute scaling.

Somewhat agree.

Okay, say more.

So I think inference time compute will be very, very important.

I think also test time sort of training time compute will be equally important.

Right.

We also like if you look at distillation, how powerful distillation sort of has been.

Right.

So if these models have a have an ability to sort of understand and conceptualize what these models are able to do and come up with better models.

And come up with better inherent representations.

Then they just become much more effective in making predictions.

Maybe they're sort of uncertainty improves and so on.

They become more efficient even.

Robotics bullish or bearish.

I'm bullish about everything.

So I have to say bullish.

I think everything will be sort of will have an impact.

The question is basically near term or longer term.

Right.

In the near term, it will take some sort of getting robotics to work is challenging.

But like in the medium to long term, I think I'm bullish.

Humanoid robots bullish or bearish.

We have constructed our world for humans.

Right.

We like the human form.

A lot of the the the non natural world around us is made for humans has been designed for humans like from architecture perspective.

Right.

Now humanoids have the same form.

As humans.

So they will fit in in all these different architectures that we have built.

And the human form is a normal thing that is not clear.

But they certainly sort of have an advantage that we designed everything for the human form.

And now human humanoids have the same form.

Future Nobel prizes in the sciences.

Will all of them be won by teams working with AI?

I think we would be getting there, but I think like humans are still winning Nobel prizes in the sciences.

So I think I think there will come a point where I will be indispensable.

So it will be sort of humans and teams working together to achieving these amazing breakthroughs.

Push me.

Thank you so much for joining us today.

These are really fundamental, really general results that you're pushing forward at DeepMind.

And we appreciate you joining us to share more about how you how you managed to do do all this so far and what's ahead.

Thank you.

[MUSIC PLAYING]