NVIDIA AI Podcast · 2026-03-12

Building AI Factories: Red Hat & NVIDIA on Enterprise AI Transformation

Hosts: Noah Cravitz

Guests: Chris Wright, Justin Boytano

AI factoriesenterprise AIagentic AIAI governancehybrid cloudAI infrastructureAI productionautonomous agents

Why it matters

AI factories consist of five layers: data center hardware, software orchestration, AI models, applications, and business outcomes.

Key claims

AI factories consist of five layers: data center hardware, software orchestration, AI models, applications, and business outcomes.
Enterprises must build AI factories with strong governance, security, and role-based access controls to ensure trust and compliance.
Agentic AI systems are rapidly maturing and expected to drive half of global AI investment by 2029.
Hybrid model architectures combining open and frontier models can reduce costs significantly while maintaining data privacy.

Episode summary

Summary

In this episode of the NVIDIA AI Podcast, Chris Wright (Red Hat CTO) and Justin Boytano (NVIDIA VP) discuss the concept of AI factories—integrated enterprise systems that transform data into actionable intelligence at scale. They describe AI factories as a five-layer technology stack ranging from hardware and data center infrastructure to software orchestration, AI models, and applications. The conversation emphasizes the importance of building AI factories that enterprises can trust, with strong governance, security, and operational best practices to move AI from experimentation to production.

The guests highlight the rapid evolution of agentic AI systems, which are expected to account for a significant portion of AI investment in the near future. They stress that AI factories enable enterprises to deploy autonomous agents responsibly, integrating them with existing business systems while maintaining data privacy and security. The discussion also covers practical advice for enterprises starting their AI factory journey, including infrastructure choices, software platforms like Red Hat OpenShift, and the importance of iterative development with measurable business outcomes.

Looking ahead, both guests foresee AI factories becoming central to enterprise operations within a few years, fundamentally changing workflows by embedding autonomous agents that perform complex, long-running tasks. This shift promises substantial productivity gains across software development and knowledge work, marking a new era of AI-native enterprises.

AI factories consist of five layers: data center hardware, software orchestration, AI models, applications, and business outcomes.
Enterprises must build AI factories with strong governance, security, and role-based access controls to ensure trust and compliance.
Agentic AI systems are rapidly maturing and expected to drive half of global AI investment by 2029.
Hybrid model architectures combining open and frontier models can reduce costs significantly while maintaining data privacy.
Starting small with validated blueprints and iterating based on real business outcomes is critical for successful AI factory adoption.
Separation of development and production environments is essential for safe and scalable AI deployment.
AI factories enable integration of AI capabilities into both new and existing enterprise applications, bridging traditional and AI-native workflows.
In the next 2-3 years, AI factories will embed autonomous agents performing complex tasks, transforming enterprise productivity and operations.

Source material

Transcript

Welcome to the NVIDIA AI Podcast.

I'm your host, Noah Cravitz.

My guest today are Red Hat's Chris Wright and NVIDIA's Justin Boytano, and we're talking AI factories.

Why should enterprises build AI factories?

And how can they do so with confidence in building AI factories that they can trust?

By a way of introductions, and I'll keep it brief because both of these guys will work speaks for itself really.

Chris Wright is Chief Technology Officer and Senior Vice President of Global Engineering at Red Hat, and Justin Boytano is Vice President and General Manager of Enterprise Computing at NVIDIA.

Gentlemen, welcome to the NVIDIA podcast.

Thank you so much for taking the time to join us.

Thanks for having us.

Thanks for having me, Noah.

Let's get right into it, and Justin, I'll all start with you, but always both of you guys feel free to jump in as the spirit moves you so to speak as we go.

But Justin, why don't we start with you?

Can you talk a little bit about, well maybe first give kind of a working definition of what we mean, what you mean when we talk about an AI factory, and then get into kind of at a high level, why wouldn't an enterprise be interested, why are enterprises building AI factories, and what are some of the tangible benefits that an enterprise can expect to see from an AI factory?

Sure, Noah.

Yeah, I think it's important to understand kind of the context of where we are as an industry, and building digital intelligence to power the productivity of organizations is going to be as critical in this decade as energy in running our companies.

This is the next industrial revolution, and companies are always asking us, how do we build these factories that basically take data in and then produce the intelligence that helps them run their businesses more efficiently?

As we talk about what is an AI factory, we think of them as really kind of five layers of technology that need to come together.

At the base layer, you know, you've got to make sure that you have got the data centers with power to bring into these factories, you've got to have chips.

Is it either way to talk about it, but we're at this point in building rack scale infrastructure that's six chips with the extreme code design to build the best token efficiency from the power available to you?

Next layer, you typically want to have the software infrastructure to orchestrate everything, and then you want to have models that run that intelligence, and then ultimately the apps and the agents on top.

And so what every business needs to do though is take this intelligence and build, you know, use case specific business outcomes that help them drive innovation, build products faster, and ultimately, you know, grow revenue, top line through deploying this intelligence at scale.

Right.

And so these five layers you're referring to, this is the cake, right?

The five layer cake.

That's right.

This is the five layer cake.

Excellent.

Chris, the world is, I feel like we can say this so often, but things are changing so quickly.

Right now, as we record this, there's a lot of talk about open claw and autonomous agents and kind of long running agents.

Can you speak kind of a little bit sort of to that and how in video and red hat are working together to help enterprise enterprise IT departments kind of step into this, this new world?

Yeah.

Actually, open claw is a great example because there's so much enthusiasm about, I guess, what's possible, what you could do.

It's captured the kind of the builder's imagination, but also built quite quickly, certainly leveraging AI to help produce code quickly, but not with the enterprise in mind.

So when we think about what Justin was describing that kind of data into a factory context that produces business value as an output, we're talking about enterprises, that's their data.

Those business outcomes are really either driving net new growth or focused on the productivity and efficiency.

All of that needs to be done responsibly, safely, respecting access controls, delivering audit trails, things that are maybe not as fun in the builder world, but fundamental to the enterprise world.

And so a lot of what we're doing is taking these these building blocks, the layers of that five layer cake and making them accessible to the enterprise together.

So obviously in videos, we're a class hardware, we're bringing a software layer that enables the higher levels of that cake and that we're building the right guardbrails and security considerations into this combined solution so that our customers can then feel confident about bringing this into their enterprises.

They're all trying to figure out how to do AI transformation, go from a traditional company to really an AI native company and in that context, not introduce undue risk or, you know, essentially undermine the the core of the business.

Right.

So there's research that shows that only one percent of organizations right now have reached the stage of an optimized AI fueled AI native as you're talking about Chris Enterprise, while over half of organizations still remain in the early stages of transformation.

But at the same time, projections have global AI investment exceeding a trillion dollars total by 2029 just a few years out.

And of that trillion dollars, these projections are saying agentic systems are going to account for roughly half of that spending.

That's a big shift from, you know, a year ago, two years ago, you guys know the time for better than I would, but, you know, when agents were kind of this, this buzzword that, you know, nobody necessarily knew there are all these different definitions, etc.

And now we're talking about all of this, you know, resource and spending going in specifically to agentic systems.

Chris, what can we glean from this?

And I know you spoke to it a little bit just now, but, you know, what are the kinds of things that the AI factory can do for an enterprise infrastructure wise, but confidence wise as you're talking about when it comes specifically to figuring out how to deploy an integrate these agentic systems?

Well, if you think about that notion of transforming the enterprise and leveraging internal data and focusing on your core business, but how do you improve it or grow it, there's a whole set of things that are underneath that.

Obviously, the data piece that we talked about, but also it is the existing tools that operate your business that are not going to just go away.

They're their fundamental, they're the baseline.

The business is usual components for the critical and fundamental.

So part of this is how do you carry that forward and really modernize your entire infrastructure to bring these two worlds together, this highly modern AI native world and the traditional set of applications that literally run the business, because you need to bring AI capabilities, not just in the net new, but also in the existing content that runs all the enterprise.

And to me, that's exactly what the AI factory does.

It helps bridge these two worlds.

I mean, the end we've got models, but we also have as just in the script beginning, agentic content or AI enabled applications and then also the traditional applications.

So bringing all of that together and then doing it at a context with consistency across the enterprise so that you're not asking every team to go figure out their own choose your own adventure path forward and that consistency you build best practices across your organization and then you're ultimately improving your chances for success and reducing the failure race.

There's so many studies that suggest a lot of AI projects can fail.

There's a number of reasons for that.

One of those is having the right tools and having the best practices and access to the data and combining essentially combining forces as a company to produce and output rather than devolving into the next generation of shadow IT and everybody building their own thing and creating this highly fragmented internal environment, which is then kind of difficult to get your arms around if not just producing a very little success.

Yep.

Justin, are you seeing some of those things?

Well, I got to say, it's interesting as in the last three months, it feels like the market is really started moving even faster.

I'll just say this.

You look at coding companies.

Everyone of you guess who comes on this podcast is that same thing, moving faster, moving faster.

Well, well, but you can actually really feel it now and I say that because like these, you know, the first area of agents like product market fit, you know, really was in software development and we see it really as a software company ourselves, you know, we we can feel these agents doing so much more work for our developers and running longer, more complex software tasks.

So you give them design goals and they can, you know, work towards those goals.

And at the same time, you know, like you said, this moment of clause came out and clause basically take it sort of to a new frontier of, you know, full autonomy.

And so, you know, we're getting to this point where, you know, agents are going to have a lot more agency within our enterprise.

A lot of those studies that you mentioned where people were having a hard time getting AI to work.

I think was at a previous era of the world where people were trying to do like chat bots, like just very basic chat bots.

And that was before reasoning and it was before, you know, this level of autonomy that I'm talking about.

And so I feel like a lot of what enterprises might have been experimenting with might be a couple generations behind where state of the artist right now.

And so as we deploy agents internally now, they can use, you know, a very, a good deep agent like reasoning framework, they can, you know, plan and reason and act across many different business systems to do a deep research is an example to understand kind of the intent of what a user might be asking and help them get to the information across the enterprise in a way that's faster and more efficient than ever, you know, previously thought imaginable.

And the nice thing about running this on a factory and AI factory within the context of an enterprise is this Chris mentioned that delivers data privacy and security by running that all across open models in this on-prem world.

And then you can do things where you, you still, you know, potentially use the frontier models, but you can use the frontier models in a way where you might only use it for the planning stage of the agent and all the search and summarization is using open models.

And so that drives a lot of cost efficiency.

In some of our newer blueprints, we see a 30X cost reduction by doing a hybrid model architecture across your, you know, private unstructured information.

And so that is a use case, the enterprise search, I think is, you know, very, a broadly generalized use case, they get us from, you know, I'll say these early adopters that we're getting seeing the benefits of agents for coding into really how knowledge workers are going to start to use agents to help them do their jobs in a much more productive and efficient way.

As somebody who sits more on the knowledge worker than software developer side of the fence myself, getting me more towards that and away from vibe coding is probably a good idea, but that's just my own sort of personal use case there.

But that does make me want to double click a little bit on, you know, on security and governance and things like this, which, you know, I think Chris, you mentioned at the top with the advent of, I mean, joking side with the advent of, you know, coding tools, vibe coding tools, and these more advanced, gentle coding tools in the hands of anybody, including folks like me, it's easy to spin something up.

I don't know that it has, you know, a hole in it waiting for a prop injection attack or whatever the case may be, right, and get into that shadow IT world Chris, you were talking about.

So I want to ask you both and just and I'll start with you because you were talking about it a little bit just now.

When you talk about planning and building an AI factory, what are the non-negotiable capabilities that have to build be built in that the enterprise must have to move from, you know, kind of first experiments and prototypes with AI to production, getting into industrial scale production, AI use cases with confidence.

And, you know, just in you mentioned some of these, but there's security, there's governance, reliability, obviously moving to scale.

You talk a little bit about some of these factors.

Yeah, and I think I'll say in the software development where I'm really good at separating the notion of development versus production.

And I think that's obviously the best best practice is enterprises get going is to separate the two.

And on the one hand, you want to help your internal, I'll call it AI development teams, do discovery in a development environment, but separate that access control from, you know, production data until you've basically, you know, proven the verification or done the functional verification of the the outcome that you're trying to get to, you've curated, you pen tested, it's got things like role-based access control so that if users use that agent, it inherits their permissions to access business systems and you're going to promote, you know, the agent from this development environment into a prod in that way.

And so I think, you know, I think the worst thing an enterprises can do is over-analyze this though and try and get to, like how do I get to the, how do I prove the TCO up front before I start to make the investment?

You've got to, you know, believe that AI is this new frontier and the companies that are able to harness it and put it so work for them are going to have a massive competitive advantage.

And so the sooner you get going, the better, and you can start in this, this dev environment with, I'll call it narrow use cases that are aligned to your core business goals and then, you know, scale as you start to see success.

But to your point, so you, you want to make sure these agents, you know, have, you know, there's a clear set of governance, there's, you know, clear ability to trace like the data systems that they access and that you can continuously evaluate them against known business outcomes that you're, you're trying to achieve.

And then that accuracy against certain use cases is what allows you to promote it then in, in their production.

Chris, how can I ask you how things like, well, inference, obviously we did an episode recently about, it was energy focus, but talking about the coming wave of inference and, you know, the shift of the load moving to some extent from training to inference, you know, maybe in this calendar year or whatever kind of the next wave is.

But talking about things like high performance inference and also high-breed cloud agility, how does the AI factory sort of figure in and support these two things in particular?

Simply put, inference is your production environment.

So training, whether it's pre-training or post-training, those are things that have a pre-production and inference is where you're bringing this intelligence to life.

So scale, efficiency, security, you know, robustness, reliability, compliance with policy, compliance with SLAs or SLAs.

These are like the table stakes.

And an AI factory is a significant investment for an enterprise.

The expectations are producers significant business outcomes.

And so we're focused on optimizing that production of outcomes, which you could back up and say those are, you know, business intelligence, or you could back up a little bit more and say it's simply tokens, optimize that, that throughput.

Of tokens in the context of cost and the context of power consumption because we're also power constrained.

And so how do we do that?

That's through this scaled-out inference scene, which is part of the AI factory.

It's really the core underlying platform that you're running all of your models and above that the agents and AI applications on top of.

So to me, it's the critical substrate and the agility that comes with flexibility and choice of where and how you deploy your models or your workloads, that notion of pre-production environments and production environments and where production data versus non-production data is used.

You get some choice and where you deploy.

And that to me is really the hybrid cloud.

You have your optionality, there's cloud environments, there's enterprise environments, there's even edge environments where you may want to deploy your workloads and taking advantage of all of that with a consistent footprint.

Like we're building with this AI factory, it gives you the best of all of your alternatives.

And so, you know, I think we're, we're bringing in the efficiency, we're bringing the flexibility, we're ensuring that we have those confindments where there's confidential computing or guardrails or any kind of sandbox technology that I think becomes really critical as we're building and delivering these new capabilities.

And if you go back in time before the focus on AI, we developed through decades of experience with Justin highlighted that pre-production, you know, dev tests, prod, kind of best practices, there's a whole set of learnings and rigor and discipline that we built in building and delivering applications and production that we're bringing as part of an AI factory for building and delivering AI applications into production.

I'm speaking with Chris Wright of Red Hat and Nvidia's Justin Boytano and we're talking about the AI factory and how enterprises can build AI factories that they can go to production with with confidence and can scale up to the future and really help transform companies into AI natives as we've been talking about.

Wanting to get into a little bit about specific infrastructure and software and platform components and Justin, I'll start with you for customers who are thinking about an initial AI factory footprint.

And might want to start small but have that ability to scale as they scale.

How should those customers think about sizing and selecting Nvidia infrastructure and software?

Yeah, I think it was the customer starts to train build the AI factory.

They got to think through the five-layer cake that I mentioned previously.

So where do I have data center power?

What is the power density of the data center?

Do I want to run air cooling or liquid cooling?

That seems to be a decision point right now.

A lot of enterprises still run air cooled data centers.

And so platforms like RTX 6000 give you very good price performance.

That's kind of a general purpose GPU to do experimentation with.

So if you don't know where to start, that kind of gives you a great platform for many different use cases.

And then from there, you start to ask yourself, well what's the orchestration management platform that want to run my business on?

And that's why we worked very closely with Red Hat Red Hat say I factory takes care of really the next few layers of the technology stack from software orchestration and management model delivery.

All the commercial security patching lifecycle management of all of that open source software so that you can run it with confidence and kind of get the factory up and running.

And then you get up into the application layers.

In the application layers, the way we try and make it easy for customers to start is we provide reference blueprints, which are examples of proven use cases that even we run on RAF factories at Nvidia for things like enterprise search that make it easy to then connect into your enterprise documents and do document ingestion and then start to provide benefits to your users.

And then from there, you can start to expand into your own developed use cases and such.

But that thinking through that full stack is really the easiest way to get going.

And then I think taking some of these proven examples is like kind of the the quick way to get it in early win with kind of your executive leadership team.

Yeah, yeah, with them the benefits.

And then from there, usually pivot into, you know, what's the most important business outcome for the company to be competitive?

They got asked yourself for Nvidia, where chip company were software company and were supply chain company when you really boil it down.

And so we then go super deep into those use cases to make sure that we're enabling, you know, tens of thousands of chip designers or software engineers or all the people dealing with all that components that allow us to have supply and availability of building this rack scale infrastructure and have a world class that those then we can be world class market.

And I think that's generally how companies should think about it.

Chris, on the red hat side, what are the key platform components that you see as foundational for this first AI factory deployment?

Thinking about things like open shift, red hat AI enterprise, AI factory within video, you know, when thinking about this first enterprise AI deployment, what are the key platform elements to start with?

And also how should customers think about sequencing them?

Yeah, I think for us the stack starts with hardware, hardware enablement, and then the distributed nature of Rax CLR architecture, how do you get access to that whole distributed system?

And then, you know, going up from there, we start getting to the specifics of models and, and agentic applications and AI enable applications.

So the bottom of the stack, very clearly, that's, that's the world of Linux, right, hardware enablement, device drivers, low-level systems software, and, you know, near and dear to our hearts, we spend a lot of time in that space and making sure that we work closely together within video to do that first phase, you know, right against the metal enablement.

The next layer above that is the distributed layer, bringing that Rax CLR architecture to life includes a distributed system like Kubernetes, Kubernetes is tried and true in the application space, and it's supporting well delivery of agents or models or other content as containers on this distributed system with access to all the accelerators down at the bottom of the stack.

And then that red hat AI enterprise layers on top.

And this is where we start to integrate directly with some of the key capabilities that Nvidia brings like optimized models like Nemotron or some of the nims.

And that's where we bring that distributed inferencing stack that is the foundation for intelligence for the business.

So, you know, we sometimes call this the metal to agent stack and starting, you know, with that layer, right above the hardware, building up through inferencing and then supporting the models, it is what we're building together to enable those key reference architectures that that Justin highlighted or the validated blue prints or the reproducible plays that you want to bring into the enterprise, because I think it's important to have those early wins that Justin highlighted that it's important to have those early wins.

It's an interesting tension, perfection is the enemy of good enough.

So, if you have this like perfect view of your future world where you've normalized all your data and everything is well-defined, you'll spend all of your time doing that and you'll never be able to get to showing some business value.

But if you over rotate to the easiest thing to do, the flashiest thing I can show, it might not have much business value.

So, picking those right first key use cases and also having in parallel this long-term mindset of it's a pretty fundamental shift in how we operate, you know, living in that duality, that's the future on building the right stack to support rapid movement, consistent, reproducible or replayable plays and, you know, building from infrastructure that IT operations teams already understand, they know Linux, they know Kubernetes, they're, you know, they're learning a lot of new things in this context, so we'll give them as much stability as we can along the way.

Yeah, no, it makes sense.

Going along those lines of the, uh, getting those first wins, right, which is a great strategy for, for lots of workplace projects to take on, but I think talking about such a big shift, you know, to the AI way of working, if you will, let's look at those first 90 days then and can you lay out some kind of practical first steps we've got, the elements of the joint stack laid out, the hardware, the software that, you know, metal to agents as you call to Chris, what are some practical things that folks listening to the podcast enterprise leaders can do and kind of structure their first 90 days to get some wins and really start building that AI factory that can grow?

Yeah, I'll assume the data center infrastructure is built out.

Let's assume these in this built-in infrastructure is built out, so, you know, what we, what we publish is what we call validated designs that sort of walk you through a lot of the design decision points of the software and, you know, you, you have to think of like, how do I want to, you know, how do I bring all of my software into this factory?

How do I make sure I do, you know, security scanning, you know, if I'm going to want to rescan everything and, and operate it.

How do I have automation to stand it up?

And then, you know, quickly, how do I get these first we call them blueprints, but think of them as like Kubernetes services that you deploy on the clusters to then get users on the system.

And then ultimately, what we do is we have, we call them like user acceptance test teams that we will roll an application out too to have them use the application.

So they can start, you can start to survey them and understand how are they doing work now versus how do they do it before, how much time are they saving versus how they did it before.

And really that time savings is the productivity gain that you're going after and you can really quickly get to, you know, from time savings across a user group to productivity, you know, gains.

And so if you can get a two X productivity gain, you know, across a big population of users, then you know, you're, you're onto something really big.

Absolutely.

Yeah.

Chris, I think that the learning that you'll gather along the way, I think is really important.

And so the notion of starting with a focused, you know, is have a hypothesis and a focused outcome and also iterating as you go.

So it's about how quickly can you move forward?

I think that's really important.

Our experience internally is reinforcing that.

And we started with some really focused examples of data that we want to bring together within, within Red Hat, the research we wanted to do across that data.

And having e-vows, I can't understand the importance of e-vows.

I think that it's an off, often overlooked part of the part of the sack, because they help you ensure the quality of what you're trying to produce.

And so, you know, building iteratively towards improving your e-vows, we see this in the public with frontier labs focused on benchmarks and e-vows.

But they're just as important within, within the enterprise.

And that iterative process of refining any portion of the stack, it could be your prompting, it could be how you're managing the data sourcing, it could be even the scoping of the problem that you're trying to solve.

I think that's, that's really important.

And that notion of picking something that's real.

So it's not so artificial that you can just show it.

It's flashy.

You get, you get high fives all around, but it doesn't really change anything internally.

I don't think that's particularly useful.

So focusing on those things that are real, but again, not making it too big.

So, right.

So, the right sizing and the iterative process of learning as you go is how you start building the thing that ultimately is quite big.

But, yeah, I think it's starting small and iterating, which we do a lot of no-pensource.

We do all software development.

And having a little bit of diversity, we have touchpoints across every different function in our organization.

There's different personas, but there's also different use cases.

It's more software development oriented, it's more finance oriented, it's more sort of sales and pipeline oriented.

Each of these brings a little different dimension that is, again, is helping you flesh out your end-to-end view of what's needed to go through whole-scale AI transformation and be operating with a full-till AI factory powering your business.

Yeah.

And as you get these first projects going and not to sort of skip all the hard work in between, but as you mentioned thinking about getting something going with an eye towards building out to scale and transforming the whole org, looking at it from the other perspective, what kinds of guardrails would be not just could people put in place, but what kinds of guardrails would you recommend would be appropriate kind of from the get-go to make sure that as things scale, as things expand, as you know, winds are one and people get excited and want to use the stuff more and go faster.

Where do the things you can lay down kind of from the beginning to make sure that, you know, technical and process and governments and, you know, the guardrails are in place through these kinds of things.

You know, I think, so one thing that we did upfront was we made sure obviously our security teams were deeply involved as we did this just to make sure, I think you learn a lot about your organization as you start to put AI to work and you'll find AI is really good at doing discovery and business systems that it has access to and you might realize, you know, you've got user permissions over scoped in areas and so having the security teams understand, you know, are we allowing two broad of access to what we want to keep confidential within the organization, you'll discover as you start to connect agents into your business systems.

But, you know, there's all kinds of techniques for, you know, guardrailing data access once you do find systems that I might have access to, you're going to realize that you've got to, you know, change permissions of many different business systems.

You know, ultimately what you're going to want to do is scope the agents potentially as users.

You think of digital employees.

So a lot of people start as they scope them to the user that's using the app, their permissions.

So they see access to the information that they've been granted as an employee in the organization.

But as we go forward, you know, these agents are going to start to work more more autonomously and we're going to have to treat them almost like contractors we bring in.

You give them least privilege access into your business systems and then they got to come back to you and check in with you and ask you for access to more business systems and you're going to have to have a process in place where you can, you know, slowly get grant them more access to do the job and fully onboard them into the job that we're asking them to do.

Chris, I'm going to turn this one to you first.

But, you know, Justin, you can be thinking in the background about your answer.

We like to end these.

So the more, the more time passes, the more I feel like it's an unfair thing to ask at the end of the AI podcast, what's the future going to look like, right?

For obvious reasons.

But if we look ahead, Chris, a year or two, maybe three years down the line, if you're feeling really bold, what does the AI factory look like?

As, you know, agente AI develops and models keep developing in the infrastructure, keeps developing.

But especially as, you know, more enterprises put these systems, build these factories and use them and put them to use solving real problems and driving new ways of working.

What do you think the AI factory looks like a couple of years hence?

And think you'd take it from a few different points of view that the one angle would be the layer cake picture.

And that one, we have a pretty good understanding of the layer cake.

So while there might be some subtleties, certainly in terms of specific tools that will come and go over time, that layering of what we're describing from hardware up through AI enabled applications, I don't think it's something that will fundamentally change.

So you look a few years ahead, we'll see something that looks quite similar.

How it's used by the enterprise, I think is what's going to shift completely in that timeframe.

Today, a more sophisticated enterprise has some agents in production, but they're not entirely agente and it's not translated into the core of their operations, essentially.

And so that, to me, is the shift that we should anticipate the autonomous nature of agents and the scoping of tasks will continue to grow.

So initially, it was, you know, the simple chatbot, which is just essentially fetching information, then you, you got a little more sophisticated with stronger and stronger recommendations.

You could call that some kind of an assistant, the doing phase of agents and total autonomy, where we're seeing that time horizon just stretch, it feels like almost daily, stretch out to be longer and longer.

So you can, you know, the coding context.

You can give coding agents very sophisticated tasks and they will spend hours and hours for different areas to get a code as a result.

That's just the coding example.

It's, you know, it's language as well structured.

It's a good template for how we should think about the, the breadth of the enterprise.

And so in the end, the AI factory, you know, the layers look similar.

The sophisticated sophistication of the tasks grows and it becomes the core of the business.

It becomes the place where we do our, we build our, our operational practices around.

And so in the end, it's, it's not that we're going to go through and kind of augment each of today's processes, because if you just think of it like that, you, you take a bunch of questionable in some places, even stupid processes and automate them and then you get an automated, stupid process.

It's really redefining how we work together completely and to end and where agents take on critical tasks in the business that I think is that that future view, which again, you, you put some timeframes on it.

We're not talking decades.

We're talking quarters away, which is right, self kind of phenomenal, but yeah, I think that's, to me, that's the, that, that future outlook.

Well said, Justin, your thoughts?

Yeah, I think the way Chris framed it is right is, you know, software development, even I'm saying in the last six months has evolved, where you can give AI, I'll say, almost like a design document, and let it go off and think and produce the code, and then do, I'll call it functional verification of that code to make sure that it's, it's accomplished, it's task, right, before it comes back to you.

And so it's doing very long-running thinking and work, you know, that is the work of, you know, many, many, many, many software engineers.

I'll just say, you know, I think in the software engineering world, like I said, we've seen this product market fit where, you know, we're seeing a two to three-act productivity gain with software engineers that can use these long-running agents.

And if you extrapolate that out, you know, the productivity gains, you know, for the whole software industry is massive.

But we're now seeing that move into this knowledge work or world, and, you know, CAD designers, engineers across every industry, where they can do the same thing, where they can start to give design document goals to long-running agents, where they can basically explain the exit criteria and give the agent the tools to do the functional verification and say, come back when you're done.

And so I think that's what the future of work is going to look like in two to three years.

You're going to have different agents working for you that you give these more structured, long-running tasks to.

They go off and think and do the work and then they come back to check in in a period of time.

And, you know, that will make us all, you know, infinitely more productive than we are today in, you know, searching through UIs, trained to find information on one side thing.

You know, we're going to live through a big, you know, change in how we work in the next couple of years, but every company across every industry and every job function will, will really be transformed with the use of any eye factor.

Perfect place to leave it.

Chris, for listeners who would like to learn more about your work, the work that Red Hat is doing, places online, they can go obviously website, social media, technical blog, other places where would you direct a listener to learn more about what Red Hat is doing with AF Actors?

The easiest one would be learn more about the Red Hat AF Actors in video.

So that's sort of an easy thing to search.

Chris, you'll find information from Red Hat.com, you'll find more information together with this video on the video website.

And that's a really easy place to start digging into the Red Hat view on, on all this content.

Fantastic.

Chris Wright, Red Hat, Justin Boytino of Nvidia again.

Thank you both so much for taking the time to come on the pod and talk about AF Actors and really the future of work is we landed on Justin.

It's exciting time to be alive.

Thank you guys.

Thanks to one.

Thank you.