
Latent Space · 2025-04-03
The Creators of Model Context Protocol — Anthropic's MCP Origin Story, Design, and Roadmap
Hosts: Alessio (Partner and CTO at Decibel), Sowix (Founder of Small.AI)
Guests: Justin Spahr-Summers (Anthropic, MCP co-creator, ex-Facebook/React), David Soria Parra (Anthropic, MCP co-creator)
Why it matters
MCP vs OpenAPI: Anthropic positions them as complementary.
Key claims
- MCP is Anthropic's client-server protocol for extending AI applications (not models directly), inspired by LSP and built on JSON-RPC, started July 2024, released November 25, 2024.
- Three core primitives: Tools (model-controlled actions), Resources (app/user-controlled data, URI-addressed, ideal for RAG and attachments), and Prompts (user-initiated, supports multi-step/macro use). Anthropic says the ecosystem over-indexes on tool calling.
- New 'Streamable HTTP' transport replaces SSE — plain HTTP POST by default with optional SSE streaming and session resumption, balancing statefulness with operational simplicity for remote servers.
- MCP vs OpenAPI: Anthropic positions them as complementary; OpenAPI is too granular and lacks AI-specific primitives; community bridges already translate between the two.
Episode summary
Summary
In this Latent Space episode, Anthropic engineers Justin Spahr-Summers and David Soria Parra — the co-creators of MCP — give the first major podcast deep-dive on the Model Context Protocol, which they frame as the "USB-C port of AI applications" and the winner of the agent open-standard wars of 2023–2025. MCP is a client-server protocol that solves the M×N integration problem for AI apps, borrowing JSON-RPC and bidirectionality from LSP but deliberately diverging in key design choices. Anthropic positioned the protocol around application developers (not models directly), emphasizing that the client app and user remain in full control of what is exposed and invoked.
The conversation digs into MCP's three core primitives — tools (model-initiated actions), resources (application- or user-controlled data/context, URI-addressed), and prompts (user-initiated templates/multi-step interactions) — and argues the ecosystem over-relies on tool calling. The hosts discuss MCP vs OpenAPI (complementary, not competing), the move from SSE to the new "Streamable HTTP" transport to enable horizontal scaling while preserving statefulness, draft OAuth 2.1 authorization, and registry/trust concerns (advocating the MCP Inspector over social-proof-based vetting). They also explore composability — recursive servers that are also clients, building agent graphs/DAGs out of MCP — while warning against protocol bloat ("don't try to do everything").
On governance, both guests stress MCP must remain an open, multi-company project (SDKs already maintained by Microsoft, JetBrains, Spring AI, Pedantic, etc.) but are wary of formal standards bodies slowing iteration. They highlight the AI Engineer World's Fair MCP track (June 3–5, 2025) and call for more full-spec clients (especially sampling clients), richer uses of resources and prompts, and integrations like Godot/Blender. Notable admissions: several reference servers were largely written by Claude itself, illustrating Anthropic's dogfooding of its own model.
- MCP is Anthropic's client-server protocol for extending AI applications (not models directly), inspired by LSP and built on JSON-RPC, started July 2024, released November 25, 2024.
- Three core primitives: Tools (model-controlled actions), Resources (app/user-controlled data, URI-addressed, ideal for RAG and attachments), and Prompts (user-initiated, supports multi-step/macro use). Anthropic says the ecosystem over-indexes on tool calling.
- New 'Streamable HTTP' transport replaces SSE — plain HTTP POST by default with optional SSE streaming and session resumption, balancing statefulness with operational simplicity for remote servers.
- MCP vs OpenAPI: Anthropic positions them as complementary; OpenAPI is too granular and lacks AI-specific primitives; community bridges already translate between the two.
- Authorization spec uses OAuth 2.1, scope discussions deferred until driven by concrete use cases; deliberate minimalism to avoid locking in bad decisions.
- Recursive composability: an MCP server can also be an MCP client, enabling DAGs of agents and server multiplexing — but Anthropic intentionally resists scope creep to keep MCP focused.
- Trust/vetting: Anthropic warns against download-count-style registries, recommends the MCP Inspector for inspecting traffic; many reference servers were written by Claude itself.
- Governance: open multi-company project already (Microsoft/C#, JetBrains/Kotlin, Spring AI/Java, Pedantic/Python), but no formal foundation yet — Anthropic fears standards-body process would slow iteration in fast-moving AI.
Source material
Transcript
welcome back.
MCPMCPMCPMCP After the NYC Summit, we wrote a popular piece explaining why we think the Model Context Protocol from readable seems to have won the agent open standard wars of 2023 to 2025.
It seems everyone is now jumping on the MCP bandwagon, from cursor and windsurf to OpenAI to Google DeepMind.
Our AI Engineer community is hungry for more, so we are doing two things to explore the MCP phenomenon.
First, today's guests, Justin Spa Summers and David Soria Parra, are the co-creators of the Model Context Protocol, who were kind enough to do their first ever podcast with us about the origin, challenges and future of MCP, and gamely indulged all the questions that we asked from the latent space community.
Second, Sowix has announced that there will be a dedicated MCP track at the 2025 AI Engineer World's Fair, taking place June 3rd to 5th in San Francisco, where the MCP core team and major contributors and builders will be meeting.
Join us and apply to speak or sponsor at AI.engineer.
Watch out and take care.
Hey everyone, welcome back to Latent Space.
This is Alessio, partner and CTO at Decibel, and I'm joined by my co-host Sowix, founder of Small.AI.
Hey, morning, and today we have a remote recording, I guess, with David and Justin from Anthopik over in London.
Welcome.
Hey, good to be here.
Welcome.
You guys have created a storm of hype because of MCP, and I'm really glad to have you on, and thanks for making the time.
What is MCP?
Let's start with a crisp, what definition of MCP is it?
From the horse's mouth, and then we'll go into the origin story.
But let's start off right off the bat.
What is MCP?
Sure.
So Model Context Protocol, or MCP for short, is basically something we designed to help AI applications extend themselves or integrate with ecosystem of plugins, basically.
The terminology is a bit different.
We use this client server terminology, and we can talk about why that is and where that came from.
But at the end of the day, it really is that, is extending and enhancing the functionality of AI application.
David, would you add anything?
Yeah, I think that's a really good description.
I think there's a lot of different ways for how people are trying to explain it, but at the core, I think what Justin said is extending AI applications is really what this is about.
And I think the interesting bit here that I want to highlight is AI applications and not models themselves that this is focused on.
That's a common misconception that we can talk about a bit later.
But yeah.
Another version that we've used and gotten to like is like, MCP is kind of like the USB-C port of AI applications and that it's meant to be this universal connector to a whole ecosystem of things.
Yeah.
Specifically, an interesting feature is, like you said, the client and server, and it's also sort of two-way, right?
In the same way that it's set up, USB-C is two-way, which could be super interesting.
Yeah.
Let's go into a little bit of the origin story.
There's many people who've tried to make standards around agents.
Many people try to build open source.
I think there's an overall also my sense is that Anthropic is going hard after developers in the way that other labs are not.
And so I'm also curious if there was any external influence or was it just you two guys just in a room somewhere riffing?
It is actually mostly like us two guys in a room, I think.
So this is not part of a big deal.
If you roll back time a little bit and go into July 2024, I was like started at Anthropic three months earlier, two months earlier, and I was mostly working on internal developer tooling, which is what I've been doing for years and years before.
And as part of that, I think there was an effort of how do I empower more employees at Anthropic to integrate really deeply with the models we have because we've seen just how good it is, how amazing it will become even in the future.
And of course, you know, just dog food your own model as much as you can.
And as part of that, from my development tooling background, I quickly got frustrated by the idea that, you know, on one hand side I have Cloud Desktop, which is this amazing tool with artifacts, which I really enjoyed, but it was very limited to exactly that feature set.
And it was there was no way to extend it.
And on the other hand side, I like work in IDEs, which could greatly like act on like the file system and a bunch of other things.
But then they don't have artifacts or something like that.
And so what I constantly did was just copying things back and forth on between Cloud Desktop and the IDE.
And that quickly got me honestly just very frustrated.
And part of that frustration wasn't like, how do I go and fix this?
What do we need?
And back to like this development developer like focus that I have, I really thought about like, well, I know how to build all these integrations, but what do I need to do to let these applications let me do this.
And so it's very quickly that you see that this is clearly like an M times N problem, like you have multiple like applications and multiple integrations you want to build and like what there is better there to fix this than using a protocol.
And at the same time, I was actually working on an LSP related thing internally that didn't go anywhere.
But you put these things together in someone's brain and let them wait for like a few weeks and out of that comes like the idea of like, let's build some protocol.
And so back to like this little room, like it was literally just me going to a room with Justin and go like, I think we should build something like this.
This is a good idea.
And Justin, lucky for me just really took an interest in the idea and took it from there to like to build something together with me.
That's really the inception story is like, it's asked to from then on just going and building it over over the course of like, like a month and a half of like building the protocol, building the first integration like Justin did a lot of the, like the heavy lifting of the first integrations and cloud desktop.
I did a lot of the first proof of concept of how this can look like in an IDE.
And if you, we could talk about like some of the tidbits you can find way before the inception of like before the official release, if you were looking at the right repository at the right time.
But there you go.
That's like some of the rough story.
What was the timeline?
I know November 25th was like the official announcement date.
When did you guys start working on it?
Justin, when did we start working on that?
I think it was around July.
I think.
Yeah, as soon as David pitched this initial idea, I got excited pretty quickly and we started working on it.
I think almost immediately after that conversation.
And then I don't know, it was a couple, maybe a few months of building the really unrewarding bits, if we're being honest, because for, for establishing something that's like this communication protocol has clients and servers and like SDKs everywhere, there's just like a lot of like laying the groundwork that you have to do.
So it was a pretty, that was a pretty slow couple of months.
But then afterward, once you get some things talking over that wire, it really starts to get exciting and you can start building all sorts of crazy things.
And I think this really came to a head.
I don't remember exactly when it was maybe like approximately a month before release, there was an internal hackathon where some folks really got excited about MCP and started building all sorts of crazy applications.
I think the coolest one of which was like an MCP server that can control a 3d printer or something.
And so like suddenly people are feeling this power of like cloud connecting to the outside world in a really tangible way.
And that really added some, some juice to us into the release.
Yeah.
And we'll go into the technical details, but I just want to wrap up here.
You mentioned you could have seen some things coming if you were looking in the right places.
We always want to know what are the places to get alpha, how, how to find MCP early.
I'm a big ZED user.
I liked the ZED editor.
The first MCP implementation on an IDE wasn't that it was written by me and it was there like a month and a half before the official release just because we needed to do it in the open because it's an open source project.
And so it was, it was not that was named slightly differently because we were not set on the name yet, but it was there.
I'm happy to go a little bit enthopic.
Also had some preview of a model with ZED, right?
Some kind of fast editing model.
I, I'm, I confess, you know, I'm a cursor windsurf user.
Haven't tried ZED.
What's, what's your, you know, unrelated or unsolicited two second pitch for, for ZED?
That's a good question.
I, it really depends what you value in editors.
And for me, I wouldn't even say I like, I love ZED more than others.
I like them all like complimentary in a way, another like I do use windsurf, I do use ZED.
But I think my, my main pitch for ZED is low latency, super smooth experience editor with a decent enough AI integration.
I mean, and maybe, you know, I think that's, that's all it is for a lot of people.
I think a lot of people obviously very tied to the VS code paradigm and the extensions that come along with it.
Okay.
So I wanted to go back a little bit, you know, on, on, on some of the things that you mentioned, Justin, which was building MCP on paper.
You know, obviously we only see the end result.
It just seems inspired by LSP.
And I think both of you have acknowledged that.
So how much is there to build?
And when you say build, is it a lot of code or a lot of design?
Because I felt like it's a lot of design, right?
Like you're picking JSON RPC, like how much did you base off by LSP and, and you know, what, what was the sort of hard, hard parts?
Yeah, absolutely.
I mean, we definitely did take heavy inspiration from LSP.
David had much more prior experience with it than I did the work on developer tools.
So, you know, I've mostly worked on products or sort of infrastructural things, LSP was new to me, but as a, as a like, or from design principles, it really makes a ton of sense because it does solve this M times N problem that David referred to where, you know, in the world before LSP, you had all these different IDEs and editors, and then all these different languages that each wants to support or that their users want them to support.
And then everyone's just building like one off integrations.
And so like you use VIM and you might have really great support for like, honestly, I don't know, C or something.
And then like you switch over to JetBrains and you have the Java support, but then like, you don't get to use the great JetBrains Java support in VIM and you don't get to use the great C supported JetBrains or something like that.
So LSP largely, I think solved this problem by creating this common language that they could all speak.
And then, you know, you can have some people focus on really robust language server implementations.
And then the IDE developers can really focus on that side and they both benefit.
So that was like RTP takeaway for MCP is like that same principle and that same problem in the space of AI applications and extensions to AI applications.
But in terms of like concrete particulars, I mean, we did take JSON RPC and we took this idea of bidirectionality, but I think we quickly took it down a different route after that.
I guess there is one other principle from LSP that we try to stick to today, which is like this focus on how features manifest more than the semantics of things, if that makes sense.
David refers to it as being presentation focused, where like basically thinking and like offering different primitives, not because necessarily the semantics of them are very different, but because you want them to show up in the application differently.
Like that was a key sort of insight about how LSP was developed.
That's also something we try to apply to MCP.
But like I said, then from there, like, yeah, we spent a lot of time, really a lot of time and we could go into this more separately, like thinking about each of the primitives that we want to offer an MCP and why they should be different.
Like why we want to have all these different concepts.
That was a significant amount of work.
That was the design work as you allude to.
But then also already out of the gate, we had three different languages that we wanted to at least support to some degree, that was TypeScript Python.
And then for the Zed integration, it was Rust.
So there was some SDK building work in those languages, a mixture of clients and servers to build out, to try to create this like internal ecosystem that we could start playing with.
And then yeah, I guess just just trying to make everything like robust over like, I don't know, this whole like concept that we have for local MCP where you like launch sub processes and stuff and making that robust took some time as well.
Yeah.
Maybe adding to that, I think the LSP inference goes even a little bit further.
Like we did take actually quite a look at criticisms on LSP, like things that LSP didn't do right and things that people felt they would love to have different and really took that too hard to like see, you know, what are some of the things that we wish, you know, we should do better.
We took a, you know, like a lengthy, like look at like their very unique approach to JSON RPC I may say.
And then we decided that this is not what we do.
And so there's like these differences, but it's clearly very, very inspired because I think when you're trying to build and focus, if you're trying to build something like MCP, you kind of want to pick the areas you want to innovate in, but you kind of want to be boring about the other parts in pattern matching LSP to the problem allows you to be boring in a lot of the core pieces that you want to be boring in.
Like the choice of JSON RPC is very noncontroversial to us because it's just like, it doesn't matter at all.
Like what you, what the action like bites on the bar that you're speaking, it makes no difference to us.
The innovation is on the, the primitives you choose and these types of things.
And so we, there's way more focus on that that we wanted to do.
So having some prior art is good there basically.
It does.
I wanted to call to double click.
I mean, there's so many things you can go into.
Obviously I am passionate about protocol design.
I wanted to show you guys this.
I mean, I think you guys know, but you know, you already referred to the M times N problem and I can just share my screen here about anyone working in developer tools has faced this exact issue where you see the God box basically like the fundamental problem and solution of all infrastructure engineering is you have things going to end things and then you put the God box and they'll all be better.
Right.
So here is one problem for Uber, one problem from GraphQL, one problem from a temporal where I used to work at.
And this is from reacts.
And I was just kind of curious, like, you know, did you solve M times N problems at Facebook?
Like I, it sounds like David, you did that for a living, right?
Like this is just M times N for a living.
Yeah.
Yeah.
To, to, to some, some degree for sure.
I did.
God, what a good example of this, but like I did a bunch of this kind of work on like source control systems and these type of things.
And so there were, there were a bunch of these type of problems as well.
And so you just shove them into something that everyone can read from and ever can write to and you build that God box somewhere and it works.
Yeah.
But yeah, this is, it's just in developer tooling.
You absolutely right in developer tooling.
This is everywhere, right?
And that, you know, it shows up everywhere.
And what was, what's interesting is I think everyone who makes the God box then has the same set of problems, which is also, you now have like composability off and the remotes versus local, you know, that there's, there's this very common shared approach set of problems.
So I kind of want to take a meta lesson on how to do the God box, but we, you know, we can, we can talk about the, the sort of development stuff later.
I wanted to double click on again, the presentation that Justin mentioned of like how features manifest and how you said some things are the same, but you just want to reify some concepts so they show up differently.
And I had that sense, you know, when I was looking at the MCP docs, I'm like, why do these two things need to be the difference in other paradigms?
They're basically the same.
I think a lot of people treat tool calling as the solution to everything.
Right.
And sometimes you can actually sort of view kinds of different kinds of tool calls as different things.
And sometimes they're resources.
Sometimes they're actually taking actions.
Sometimes they're something else that I don't really know yet, but I just want to see like, what are some things that you sort of mentally group as adjacent concepts and why were they important to you to emphasize?
Yeah, I can chat about this a bit.
I think fundamentally we, we, every sort of primitive that we thought through, we thought from the perspective of the application developer first like if I'm building an application, whether it is an IDE or, you know, call a desktop or some agent interface or whatever the case may be, what are the different things that I would want to receive from like an integration?
And I think once you take that lens, it becomes quite clear that that tool calling is necessary, but very insufficient.
Like there are many other things you would want to do besides just get tools and plug them into the model.
And you want to have some way of differentiating what those different things are.
So the kind of core primitives that we started MCP with, we've since added a couple more, but the core ones are really tools, which we've already talked about is like adding tools directly to the model or function calling is sometimes called resources, which is basically like bits of data or context that you might want to add to the context.
So excuse me, to the model context.
And this, this is the first primitive where it's like, we decided this could be like application controlled.
Like maybe you want a model to automatically search through and find relevant resources and bring them into context, but maybe you also want that to be an explicit UI affordance in the application where the user can like, you know, pick through a dropdown or like a paperclip menu or whatever, and find specific things and tag them in.
And then that becomes part of like their message to the LLM.
Like those are both use cases for resources.
And then the third one is prompts, which are deliberately meant to be like user initiated or like user substituted text or messages.
So like the analogy here would be like, if you're an editor, like a slash command or something like that, or like an at, you know, auto completion type thing where it's like, I have this kind of macro effectively that I want to drop in and use.
And we have sort of expressed opinions through MCP about the different ways that these things could manifest, but ultimately it is for application developers to decide, okay, you get these different concepts expressed differently.
And it's very useful as an application developer, because you can decide the appropriate experience for each.
And actually, this can be a point of differentiation too.
Like we were also thinking, you know, from the application developer perspective, they, you know, application developers don't want to be commoditized.
They don't want the application to end up the same as every other AI application.
So like, what are the unique things that they could do to like, create the best user experience even while connecting up to this big open ecosystem of integration?
Yeah.
And I think to add to that, the, I think there are two aspects to that that I want to mention.
The first one is that interestingly enough, like while nowadays tool calling is obviously like probably like 95% plus of the integrations.
And I wish there would be, you know, more clients doing tool resources, doing prompts.
The very first implementation in Z is actually a prompt implementation.
It doesn't deal with tools.
And, and we found this actually quite useful because what it allows you to do is, for example, build an MCP server that takes like a backtrace from Sentry or any other like online platform that that tracks your, your crashes and just lets you pull this into the context window beforehand.
And so it's quite nice that way, that it's like a user driven interaction that you does a user decide when to pull this in and don't have to wait for the model to do it.
And so it's a great way to craft the prompt in a way.
And I think similarly, you know, I wish, you know, more MCP servers today would bring prompts as examples of like how to even use the tools that they're providing at the same time.
The resources bits are quite interesting as well.
And I wish we would see more usage there because it's very easy to envision, but yet nobody has really implemented it in a system where like an MCP server exposes, you know, a set of documents that you have your database, whatever you might want to as a set of resources.
And then like a client application would build a full RAG index around this, right?
This is definitely an application use case we had in mind as to why these are exposed in such a way that they're not model driven because you might want to have way more resource content than is, you know, realistically usable in a context window.
And so I think, you know, I wish applications and I hope applications will do this in the next few months, use these primitives, you know, way better because I think that there's way more rich experiences to be created that way.
Yeah, I completely agree with that.
And I would also add that.
I'll turn to it if I have it.
I think that's a great point.
And everybody just, you know, has a hammer and wants to do tool calling on everything.
I think a lot of people do tool calling to do a database query that are use resources for it.
What are like the, I guess, maybe like pros and cons or like when people should use a tool versus a resource, especially when it comes to like things that do have an API interface.
Like for a database, you can do a tool that does a SQL query versus when should you do that or a resource and stand with the data.
So like the way we separate these is like tools are always meant to be initiated by the model.
It's sort of like after models discretion that it will like find the right tool and apply it.
So if that if that's the interaction you want as a server developer where it's like, okay, this, you know, suddenly I've given the LLM the ability to run a SQL queries, for example, that makes sense as a tool, but resources are more flexible, basically.
And I think to be completely honest, the story here is practically a bit complicated today because many clients don't support resources yet.
But like, I think in an ideal world where all these concepts are fully realized and there's like full ecosystem support, you would do resources for things like the schemas of your database tables and stuff like that as a way to like either allow the user to say like, okay, now, you know, Claude, I want to talk to you about this database table.
Here it is.
Let's have this conversation or maybe the particular AI application that you're using, like, you know, it could be something agentic like Claude code is able to just like agentically look up resources and find the right schema of the database table you're talking about.
Like both those interactions are possible.
But I think like anytime you have this sort of like, you want to list a bunch of entities and then read any of them, that makes sense to model as resources.
Resources are also they're uniquely identified by a URI always.
And so you can also think of them as like, you know, sort of general purpose transformers, even like if if you want to support an interaction where a user just like drops a URI in, and then you like automatically figure out how to interpret that you could use mcp servers to do that interpretation.
One of the interesting site notes here back to the Z example of resources is that has like a prompt library that you can do that you people can interact with.
And we just exposed a set of default prompts that we want everyone to have.
And as part of that prompt library, via resources for a while so that like you would have said and that will just populate the prompt library was from an mcp server, which was quite a cool interaction.
And that was again, a very specific like both sides needed to agree upon the the URI format and and the underlying data format and but that was a nice and kind of like neat little application of resources.
There's also going back to that perspective of like, as an application developer, what are the things that I would want, we also apply this thinking to like what existing features of applications could conceivably be kind of like factored out into mcp servers if you were to take that approach today.
And so like basically any ID where you have like an attachment menu, that I think naturally models as resources, it's just you know, those implementations already existed.
Yeah, I think the immediate like, you know, when you introduced it for cloud desktop, and I saw the add sign there, I was like, Oh, yeah, that's what cursor has.
But this is for everyone else.
And you know, I think like that that is a really good design target, because it's something that already exists and people can map on pretty neatly.
I was actually featuring this chart from my hashes workshop that presumably you guys agreed on.
I think this is so useful that it should be on the front page of the docs.
Like, probably should be I think that's the suggestion.
Do you want to do a PR for this?
I love it.
Yeah, do a PR.
I've done a PR for just my hashes workshop in general, just because I'm like, you know, I know I approve.
Yeah.
Thank you.
Yeah, I mean, like, but you know, I think for me as a developer relations person, I always insist on having a map for people.
Here are all the main things you have to understand.
We'll spend the next two hours going through this.
So some one image that kind of covers all this, I think it's pretty helpful.
And I like your emphasis on prompts.
I would say that it's interesting that like, I think, you know, in the earliest early days of like chat GPT and Claude, people often came up with Oh, you can't really follow my screen, can you?
In the early days of chat of chat GPT and all that, like a lot of people started like, you know, GitHub for prompts, like we'll do prompt manager libraries and, and like those never really took off.
And I think something like this is helpful and important.
I would say like, I've also seen prompts file from human loop, I think as other ways to standardize how people share prompts.
But yeah, I agree that like, there should be more innovation here.
And I think probably people want some dynamicism, which I think you, you afford, you allow for.
And I like that you have multi step that this was, this is the main thing that got me like, like these guys really get it.
And I think you maybe have published some research that says like, actually, sometimes to get to get the model working the right way, you have to do multi step prompting or jail breaking to behave the way that you want.
And so I think prompts are not just single conversations, they're sometimes chains of conversations.
Yeah.
Another question that I had when I was looking at some server implementations, the server builders kind of decide what data gets eventually returned, especially for tool calls, for example, the Google Maps one, right?
If you just look through it, they decide what, you know, attributes kind of get returned.
And the user cannot override that if there's a missing one, that has always been my gripe with like SDKs in general, when people build like API wrapper SDKs, and then they miss one parameter that maybe is new.
And then I cannot use it.
How do you guys think about that?
And like, yeah, how much should the user be able to intervene in that versus just letting the server designer do all the work?
I think we probably bear responsibility for the Google Maps one, because I think that's one of the reference servers we've released.
I mean, in general, for things like for tool results in particular, we've actually made the deliberate decision, at least thus far for tool results to be not like sort of structured JSON data, not matching a schema really, but as like a text or images or basically like messages that you would pass into the LLM directly.
And so I guess the corollary from that is you really should just return a whole agenda of data and trust the LLM to like sort through it and sift and like, you know, extract the information it cares about because that's what that's exactly what they excel at.
And we really try to think about like, yeah, how to, you know, use our lunch to their full potential and not maybe over specify and then end up with something that doesn't scale as LLMs themselves get better and better.
So really, yeah, I suppose what should be happening in this example server, which again, we'll request welcome, it'd be great.
It's like, if all these result types were literally just passed through from the API that it's calling, and then the LLM can can do whatever it wants with the data.
Yeah, yeah, that to me is like the USB C part of this, you know, which is like, Hey, this is kind of the file, so to speak on this on the server, which is what the API returns.
And then you're kind of funneling it through without doing too much in the middle.
Yeah, but at the same time, it's like you need to do some work on some of the pieces, because sometimes they have like weird, you know, encoding or like all these different things that maybe the server should handle.
But yeah, it's a hard, it's a hard design decisions on where to draw the line.
Maybe throw AI under the bus a little bit here and just say that Claude wrote a lot of these example servers.
No surprise at all.
But I do think, sorry, I do think there's an interesting point in this that I there, I do think people at the moment still to mostly still just apply their normal software engineering API approaches to this.
And I think we're still need a little bit more relearning of how to build something for LLMs and and trust them, particularly, you know, as they are getting significantly better a year to year, right.
And I think, you know, two years ago, maybe that approach would have been very valid.
But nowadays, just like, just throw data at that thing that is really good at dealing with data is a good approach to this problem.
And I think this is like, I'm learning like 20, 30, 40 years of software engineering practices that go a little bit into this to some degree.
If I could add to that real quickly, just one framing as well for MCP is thinking in terms of like how crazily fast AI is advancing.
I mean, it's exciting.
It's also scary, like thinking, us thinking that like, the biggest bottleneck to, you know, the next wave of capabilities for models might actually be their ability to like interact with the outside world to like, you know, read data from outside data sources or like take stateful actions.
Working at a topic, we absolutely care about doing that safely and with the right control and alignment measures in place and everything.
But also as AI gets better, people will want that that will be key to like becoming productive with AI is like being able to connect them up to all those things.
So MCP is also sort of like a bet on the future and where this is all going and how important that will be.
Yeah, yeah, I would say any API attribute that says formatted underscore should kind of be gone.
And we should just get the raw data from all of them.
Because why, you know, why are you formatting for me?
The model is definitely smart enough to format an address.
So I think that should go to the end user.
Yeah, I have I think Alessio is about to move on to like server implementation.
I wanted to I think we were talking, we're still talking about sort of MCP design and goals and intentions.
And we've I think we've indirectly identified like some problems that MCP is really trying to address.
But I wanted to give you the spot to directly take on MCP versus open API.
Because I think obviously there's this is a top question.
I wanted to sort of recap everything we just talked about and give people a nice little segment that people can say, like, this is a definitive answer on MCP versus open API.
Yeah, I think fundamentally, I mean, open API specifications are a very great tool.
And like I've used them a lot in developing API's and consumers of API's I think fundamentally or we think that they're just like too granular for what you want to do with LLMs.
Like they don't express higher level AI specific concepts like this whole mental model that we've talked about with the primitives of MCP and thinking from the perspective of the application developer, like you don't get any of that when you encode this information into an open API specification.
So we believe that models will benefit more from the purpose built or purpose design tools, versus prompts and the other primitives then just kind of like here's our REST API go wild.
I do think there's another aspect I think that I'm not an open API expert.
So I might everything might not be perfectly accurate.
But I do think that we're like, there's been and we can talk about this a bit more later, there's a deliberate design decision to make the protocol somewhat stateful, because we do really believe that AI applications and AI like interactions will become inherently more stateful.
And that we're the current state of like, like need for statelessness is more a temporary point in time that will be you know, to some degree that will always exist.
But I think like more statefulness will become increasingly more popular, particularly when you think about additional modalities that go beyond just pure text based, you know, interactions with models, like it might be like video, mod audio, whatever other modalities exist and out there already.
And so I do think that like having something a bit more stateful is just inherently useful in this interaction pattern, I do think they're actually more complimentary open API and MCP than people wanted to make it out.
Like people look for these like, you know, a versus B and like, you know, have have all the all the developers of these things go in a room and fist fight it out.
But that's rarely what's going on.
I think it's actually they're very complimentary and they have their little space where they're very, very strong.
And I think, you know, just use the best tool for the job.
And if you want to have a rich interaction between an AI application, it's probably like it's probably MCP, that's the right choice.
And if you want to have like an API spec somewhere that is very easy and like a model can read and interpret and that's what what worked for you then open API is the way to go.
One more thing to add here is that we've already seen people I mean, this happened very early people in the community builds like bridges between the two as well.
So like, if what you have is an open API specification, and no one's you know, building a custom MCP server for it, there are already like translators that will take that and re expose it as MCP and you could do it the other direction too.
Awesome.
Yeah, I think there's the other side of MCPs that people don't talk as much about because it doesn't go viral, which is building the servers.
So I think everybody does the tweets about like on that, they call desktop to XMCP.
It's amazing.
How would you guys suggest people start with building servers?
I think the spec is like so there's so many things you can do that it's almost like how do you draw the line between being very descriptive as a server developer versus like going back to our discussion before like just take the data and then let the model manipulate it later.
Do you have any suggestions for people?
I think there are a few suggestions.
I think that one of the best things I think about MCP and something that we got right very early is that it's just very, very easy to build like something very simple that might not be amazing, but it's pretty it's good enough because models are very good and get this going within like half an hour.
You know, and so I think that the best part is just like pick the language of you know, of your choice that you love the most pick the SDK for it if there's an SDK for it and then just go build a tool of the thing that matters to you personally and that you want to see the model like interact with build the server through the tool and don't even worry too much about the description just yet like do a bit of like write your little description as you think about it and just give it to the model and just throw it to standard IO protocol transport wise into like an application that you like and see it do things and I think that's part of the magic that or like, you know empowerment and magic for developers to get so quickly to something that the model does something that you care about that I think really gets going and gets you into this flow of like, okay, I see this thing can do cool things.
Now I go and and can expand on this and now I can go and like really think about like which are the different tools I want which are the different resources and prompts I want.
Okay, now that I have that okay now do I what what do my evils look like for how I want this to go how do I optimize my prompts for the evils using like tools like that.
This is infinite depth that you can do but just start as simple as possible and just go go build a server in like half an hour in the language of your choice and and how the model interacts with the things that matter to you and I think that's that's where the fun is that and I and I think people I think a lot of what MCP makes and what likes greatest it just adds a lot of fun to to the development piece to just go and go and have models to things quickly.
I also I'm quite partial again to using AI to help me do the coding like I think even even during the initial development process we realized it was quite easy to to basically just take all the SDK code again you know what David suggested like you know pick the language you care about and then pick the SDK and once you have that you can literally just drop the whole SDK code into an LLMs context window and say okay now that you know MCP build me a server that does this this this and it like the results I think are astounding like I mean it might not and I'll be perfect around every every single corner or whatever and you can refine it over time but like it's a great way to kind of like one-shot something that basically does what you want and then you can iterate from there and and like David said there has been a big emphasis from the beginning on like making servers as easy and simple to build as possible which certainly helps with LLMs doing it too we often find that like really getting started is like you know 102 outlines of code in the language of your choice it's really quite easy yeah but if you don't have an SDK again give that like give the subset of the spec that you care about to the model and like another SDK and just have it build you an SDK and it usually works for like that subset building a full SDK is a different story but like to get a model to tool call in Haskell or whatever like language you like it's probably pretty straightforward yeah I sorry no I was gonna say I co-hosted a hackathon at the AGI house on personal agents and one of the personal agents somebody bill was like an MCP server builder agent where they will visit you put the URL of API spec and it will build an MCP server for them do you see that today it's kind of like yeah most servers are just kind of like a layer on top of an existing API without too much opinion and how yeah do you think that's kind of like how it's gonna be going forward just like AI generated exposed the API that already exists or are we gonna see kind of like nothing you MCP experiences that you couldn't do before I think go for it I think both like I think there there will always be value in like oh I have you know I have my data over here I want to use some connector to bring it into my application over here that use case will certainly remain I think you know this kind of goes back to like I think a lot of things today are maybe default thing to tool use when some of the other primitives would be maybe more appropriate over time and so it could still be that connector it could still just be that sort of adapter layer but could like actually adapted on to different primitives which is one one way to add more value but then I also think there's plenty of opportunity for use cases which like do you know or for MCP servers that kind of do interesting things in and out themselves and are just adapters some of the earliest examples of this were like you have a memory MCP server which gives the LLM the ability to remember things across conversations or like someone is a close co-worker built the Irish not a close close someone built the sequential thinking MCP server which gives a model the ability to like really think step by step and get better at its reasoning capabilities this is something where it's like it really isn't integrating with anything external it's just providing this sort of like way of thinking for a model I guess either way though I think AI authorship of the servers is totally possible like I've had a lot of success in prompting just being like hey I want to build an MCP server that like does this thing and even if this thing is not adapting some other API but is doing something completely original it's usually able to figure that out too yeah I do I do think that the to add to that I do think that good part of what MCP servers will be will be these like just API wrapper to some degree and that's good and valid because that works and it gets you very very far but I think we're just very early like in in exploring what you can do and I think as client support for like certain primitives get better like we can talk about sampling this might be my favorite and topic and greatest frustration at the same time I think you can just see it very easily see like way way way richer experiences and we have we have built them internally for as prototyping aspects and I think you see some of that in the community already but there's just you know things like hey summarize my you know my my my my favorite subreddits if for the morning MCP server that nobody has built yet but it's very easy to envision and the protocol can totally do this and these are like slightly richer experiences and I think as people like go away from like the oh I just want to like I'm just in this new world where I can hook up the things that matter to me to the LLM to like actually want a real workflow a real like like more richer experience that I really want exposed to the model I think then you will see these things pop up but again that's a there's a little bit of a chicken and egg problem at the moment was like what a client support versus you know what service like all this when I do yeah that that was kind of my next question on composability like how do you guys see that the plans for that what's kind of like the import of MCPs so to speak and so another MCP like if I want to build like the subreddit one there's probably gonna be like the reddit API MCP and then the summarization MCP and then how do I how do I do a super MCP yeah so this is an interesting topic and I think there so there are there two aspects I think the one aspect is like how can I build something a genetically that you requires an LLM call and like a one form of fashion like for summarization or so but I'm staying model independent and for that that's where like part of this by directionality comes in and this marriage experience where we do have this facility for servers to ask the client again who owns the LLM interaction right like we talk about cursor like runs the loop with the LLM for you there that for the server author to ask the client for a completion and basically have it like summarize something for the server and return it back and so now what model summarizes this depends on which one you have selected in cursor and not depends on what the author brings the author doesn't bring an SDK doesn't have you had an API key it's completely model independent how you can build this this is one aspect of that the second aspect to building richer original systems with MCP is that you can easily envision MCP server that serve something to like something like cursor or Windsor for a cloud desktop but at the same time also is an MCP client at the same time and itself can use MCP servers to create a rich experience and now you have a recursive property which we actually quite carefully into design principles try to retain you know you see it all over the place and authorization other aspects to the spec that retain this like recursive pattern and now you can think about like okay I have this little bundle of applications both the server and a client and I can add these in chains and build basically graphs like DAGs out of MCP servers that can just richly interact with each other a gigantic MCP server can also use the whole ecosystem of MCP servers available to themselves and I think that's a really cool environment cool thing you can do and people have experimented with this and I think you see hopefully more of this particularly when you think about like auto selecting auto installing there's a bunch of these things you can do that make and make a really fun experience I think practically there are some nice things we still need to add to the SDKs to make this really simple and like easy to execute on like this kind of recursive MCP server that is also a client or like kind of multiplexing together the behaviors of multiple MCP servers into one host as we call it these are things we definitely want to add we haven't been able to yet but like I think that would go some way to showcasing these things that we know are already possible but not necessarily picking up that much yet okay this is very exciting and very I'm sure I'm sure a lot of people get very very a lot of ideas and inspiration from this is an MCP server that is also a client is that an agent what's an agent there's a lot of definitions of agent because like in some ways you're requesting something and it's going off and doing stuff that you don't necessarily know it's like a layer of instruction between you and the ultimate raw source of the data you could spew that I just I don't know if you have a hot take on agents I do think I do think that you can build an agent that way for me I think you need to define the difference between an MCP server plus client that is just a proxy versus an agent I think there's a difference and I think that difference might be in you know for example using a sample loop to create a more rich experience to to to you have a model call tools while like inside that MCP server through these clients I think then you have a an actual like agent yeah I do think it's very simple to build agents that way yeah I think there are maybe a few paths here like it definitely feels like there's some relationship between MCP and agents one possible version is like maybe MCP is a great way to represent agents maybe there are some like you know features or specific things that are missing that would make the ergonomics of it better and we should make that part of MCP that's one possibility another is like maybe MCP makes sense it's kind of like a foundational communication layer for agents to like compose with other agents or something like that or there could be other possibilities entirely maybe MCP should specialize and narrowly focus on kind of the AI application side and not as much on the agent side I think it's a very live question and I think there are sort of trade-offs in every direction going back to the analogy of the god box I think one thing that we have to be very careful about in designing a protocol and kind of curating or shepherding an ecosystem is like trying to do too much I think it's it's a very big yeah with you know you don't want a protocol that tries to do absolutely everything under the Sun because then it'll be bad at everything too and so I think the key question which is still unresolved is like to what degree are agents really naturally sitting into this existing model and paradigm or to what degree is it basically just like orthogonal it should be something I think once you enable two-way and once you enable client server to be the same and delegation of work to another MCP server it's definitely more agentic than not but I appreciate that you keep in mind simplicity and not trying to solve every problem on this and cool I'm having to move on there I mean I'm gonna double click on a couple of things that I marked out because they coincide with things that we wanted to ask you anyway so the first one is it's just a simple how many MCP things can one implementation support you know so this is the the sort of wide versus deep question and this this is direct relevance to the nesting of MCPs that we just talked about in April 2024 when Claude was launching one of its first context the first million token context example they said you can support 250 tools and you know so to me that's wide in the sense that you you don't have tools that call tools you just have the model and a flat hierarchy of tools but then obviously you have tool confusion it's going to happen when tools are adjacent you call the wrong tool you're gonna get the bad result right do you have a recommendation of like a maximum number of MCP servers that are enabled at any given time I think to be honest like I think there's not one answer to this because to some extent it depends on the model that you're using to some extent it depends on like how well the tools are named and described for the model and stuff like that to avoid confusion I mean I think that the dream is certainly like you just furnish all this information to the LLM and it can make sense of everything this kind of goes back to like the future we envision with MCP is like all this information is just brought to the model and it decides what to do with it but today the reality or the practicalities might mean that like yeah maybe you maybe in your client application like the AI application you do some filtering over the toolset or like maybe you run like a faster smaller LLM to like filter to what's most relevant and then only pass those tools to the bigger model or you could use an MCP server which is a proxy to other MCP servers and does some filtering at that level or something like that I think hundreds as you referenced is still a fairly safe bet at least for Clauda can't speak to the other models but yeah I don't know I think over time we should just expect this to get better so we're wary of like constraining anything and preventing that sort of long yeah it highly it highly depends on the overlap of the description right like if you if you have like very separate servers that do very separate things and the tools have very clear unique names very clear well-written descriptions you know your mileage might be more higher than if you have a GitLab and a GitHub server at the same time in your context and and then the overlap is quite significant because they look very similar to the model and confusion becomes easier there's different considerations too depending on the AI application if you're if you're trying to build something very agentic maybe you are trying to minimize the amount of times you need to go back to the user with a question or you know minimize the amount of like configurability in your interface or something but if you're building other applications you're building an IDE or you're building a chat application or whatever like I think it's totally reasonable to have affordances that allow the user to say like at this moment I want this feature set or at this different moment I want this different feature set or something like that and maybe not treated as like always on the full list always on all the time yeah that's where I think the concepts of resources and tools get to blend a little bit right because now you're saying you want some degree of user control right or application control and other times you want the model to control it right so now we're choosing just subsets of tools I don't know yeah I think it's a fair point or a fair concern I guess the way I think about this is still like at the end of the day and this is a core mcp design principle is like ultimately the client application and by extension the user ultimately they should be in full control of absolutely everything that's happening via mcp when we say that tools are model controlled what we really mean is like tools should only be invoked by the model like there really shouldn't be an application interaction or user interaction where it's like okay as a user I now want you to use this tool I mean occasionally you might do that for prompting reasons but like I think that shouldn't be like a UI affordance but I think the client application or the user deciding to like filter out things that mcp servers are offering totally reasonable or even like transform them like you could imagine a client application that takes tool descriptions from an mcp server and like enriches that makes them better we really want the client applications to have full control in the mcp paradigm that in addition though like I think there one thing that that's very very early my thinking is there might be addition to the protocol where you want to give the server off of the ability to like logically group certain primitives together potentially to inform that because they might know some of these logical groupings better and that could like encompasses prompts resources and tools at the same time I mean personally we can have a design discussion on there I mean personally my take would be that those should be separate mcp servers and then the user should be able to compose them together but we can figure it out is there gonna be like a mcp standard library so to speak of like hey these are like the canonical servers do not build this we're just gonna take care of those and those can be maybe the building blocks that people can compose or do you expect people to just rebuild their own mcp servers for like a lot of things I think we will not be the prescriptive in that sense I think there will be inherently you know there's a lot of power well let me rephrase it like I have a long history in an open source and I feel the bizarre approach to this problem is is somewhat useful right and I think so that the best and most interesting option wins and I don't think we want to be very prescriptive I do will definitely foresee and this already exists that they would like 25 github servers and like 25 you know posh servers and whatnot and that's all cool and that's good and and I think they all add in their own way but effectively eventually over months or years ecosystem will converge to like a set of very widely used ones who basically I don't know if you call it winning but like that will be the most used ones and I think that's completely fine because be prescriptive about this I don't think it's any useful any use I do think of course that there will be like mcp servers and you see them already that are driven by companies for their products and of you know they will inherently be probably the canonical implementation like if you want to work with cloudflow workers and use an mcp server for that you'll probably want to use the one developed by cloudfare yeah I think there's maybe a related thing here too just about like one big thing worth thinking about we don't have any like solutions completely ready to go is this question of like trust or like you know vetting is maybe a better word like how do you determine which mcp servers are like the kind of good and safe ones to use regardless of if there are an implementations of github mcp servers that could be totally fine but you want to make sure that you're not using ones that are really like sus right and so trying to think about like how to kind of endow reputation or like you know if hypothetically anthropic is like we've vetted this it meets our criteria for for secure coding or something how can that be reflected in kind of this open model where everyone in the ecosystem can benefit don't really know the answer yet but that's very much talking about but I think that's like a great design choice of mcp switches like language agnostic like already and there's not to my knowledge a anthropic official ruby stk nor an open AI stk and alex rudal that are great that's a great job building those but now what mcp is like you don't actually have to translate an stk to all these languages you just do one one interface and kind of bless that interface as as anthropic so yeah that was that was nice I have a quick answer to this thing so like obviously there's like five or six different registries already popped up you guys announced your official registry that's gone the way and a registry is very tempting to offer download counts likes reviews and some kind of trust thing I think it's kind of brittle like no matter what kind of social proof or other thing you can you can offer the next update can compromise a trusted package and actually that's the one that does the most damage right so abusing the trust system is like setting up a trust system creates the damage from the trust system and and so I actually want to encourage people to try out mcp inspector because all you got to do is like actually just look at the traffic and like I think that's that goes for a lot of security issues yeah absolutely I think it's like this very classic just supply chain problem that like all registries effectively have and the you know there are different approaches to this problem like you can take the app or approach like that things and like have a like an army of of those on the system and review teams to do this and then you effectively build an app store right that's that's one approach of this type of problem it kind of works in you know in a very certain set of ways but I don't think it works in an open source kind of ecosystem for which you always have a registry kind of approach like similar to npm and packages and and pied by and they all have inherently these like these these supply chain attack problems right yeah yeah totally quick time check I think we're gonna go for another like 20 25 minutes is that okay for you guys okay awesome cool I wanted to double click take the time so I'm gonna sort of we previewed a little bit on like the future coming stuff so I want to leave the future coming stuff to the end like registry the the stateless servers and remote servers all the other stuff but I wanted to double click a little bit more on the launch the core servers that are part of the official repo and some of them are special ones like the like the ones we already talked about so let me just pull them up already so for example you mentioned memory you mentioned sequential thinking I think I really really encourage people should look at these what I call special servers like they're not normal servers in the in the sense that they they wrap some API and it's just easier to interact with those than to work with the API's and so I'll highlight the memory one first just because like I think there are there are a few memory startups but actually you don't need them if you just use this one it's also like 200 lines of code it's super simple and obviously then if you need to scale it up you should probably do some some more battle tested thing but if you're interested if you're just introducing memory I think this is a really good implementation I don't know if there's like special stories that you want to highlight with with some of these I think no I don't think there's a particular special I think a lot of these not all of them but a lot of them originated from that hackathon that I mentioned before where folks got excited about the idea of MCP people internally inside anthropic who wanted to have memory or like wanted to play around with the idea could quickly now prototype something using MCP in a way that wasn't possible before someone who's not like you know you don't have to become the the end-to-end expert you don't have access you don't have to have access to this like private you know proprietary code base you can just now extend quad with this memory capability so that's how a lot of these came about and then also just thinking about like you know what is the breadth of functionality that we want to demonstrate at launch totally and I think that is partially why it made your launch successful because you launch with a sufficiently spanning set of here's examples and then people just copy paste and expand from there I would also highlight the file system MCP server only because it has edit file and basically I think people were very excited when we had Eric who built your sort of sweet bench projects on the podcast as well and people were very interested in this sort of like file editing tool that is basically open source via this project and I think a lot of there's some libraries out there there's some other implementations that like you know this is core IP for them and now it's just you guys just put it out there it's just really cool yeah I really I mean honestly the file system server is one of my favorites because I think it really speaks to like a limitation that I was feeling you know I was like hacking on a game as a side project and really wanted to connect it to like cloud and artifacts like David talked about before just giving Claude or like suddenly being able to give Claude the ability to like actually interact with my local machine was huge I really love that sort of capability yeah I mean this is this is the classic example of like this server directly comes out of the frustration that both created MCP and that server there was a very clear direct path of like here's the frustration we're currently having to MCP plus this server that that we both failed and Justin in particular so that regard is close to our heart as like as as a spiritual inception point of the protocol itself so okay and then I think the last thing I'll highlight is sequential thinking which already talked about this is this gives like branching which is kind of interesting it gives us sort of you know I need more need more space to write which is kind of super interesting and I think one thing I also wanted to clarify was anthropic this week well this past week put out a new engineering blog with a think tool and there's a bit of community confusion how sequential thinking overlaps with the thing tool I just think that is there's different teams doing similar things in different parts of the world but I just wanted to let you guys clarify I think I mean there's there's definitely like sorry let me start over as far as I know there is no common lineage between these two things but I think it just speaks to a larger thing that like there are many different strategies to get an L of them to be more thoughtful or hopelessly less or whatever it might be to kind of like express these different dimensions more fully or more reliably and I don't know I think this is like the power of MCP that like you could build different servers that do these different things or have like you know different prompts or different tools within the same server that do these different things and like ask the LLM to apply a particular like mental model or thinking pattern or whatever for different results so I don't know I think I I guess don't know that there will be like one ideal prescribed method like LLM here is how you should think all the time I think there will be different applications for different purposes and MCP allows you to do that right yeah I think in addition there's also like the the way that the approach of this that some of the MCP servers they're filling a gap that existed at the at a point in time that the models later catch up to by themselves because you know they have there's training time and preparation and least research that goes into making models to things natively so to speak and you can get a lot of mileage of something as simple as a sequential thinking tool like server is not simple but it's like it's doable within a few days which is definitely not the time frame you look at adding thinking to a model natively I guess to come up with an example on the fly like I could imagine building you know if I'm working with a model that is not particularly reliable or you know maybe someone considers the generation today overall not particularly reliable like I could imagine building an MCP server that gives me like best of three you know tries like three times to answer a query with the model and then picks the best one or something like that like you could get this kind of like recursive and composable LLM interactions with MCP awesome okay cool I think so you know we sorry thanks for indulging on on like some of the the service we just wanted to double click on these I think we have time for just like future roadmap things people were most excited about this recent update moving from stateful to stateless servers you guys picked SSE as your sort of launch protocol and transport transport and obviously transport is pluggable the behind the scenes of that like was it Jared Palmer's tweet that caused it or were you already working on it no we have we have github discussions going back like you know in public going back months really talking about this this dilemma and that the trade-offs involved you know we do believe that like the future of AI applications and ecosystem and agents all of these things I think will be stateful or will be more in the direction of statefulness so we had a lot of I think honestly this is one of the most contentious topics we've discussed as like the core MCP team and like gone through multiple iterations on and back and forth but ultimately just came back to this conclusion that like if the future looks more stateful we don't want to move away from that paradigm completely now we have to balance that against it's it's been operationally complex or like it's hard to deploy an MCP server if it requires this like long-lived persistent connection this this is the original like SSE transport design is basically you deploy an MCP server and then a client can come in and connect and then basically you should remain connected indefinitely which is that's like a tall order for anyone operating at scale it's just like not a deployment or operational model you really want to support so we were trying to think like how can we balance the belief that statefulness is important with sort of simpler operation and maintenance and stuff like that and the news sort of we're calling it the streamable HTTP transport that we came up with still has SSE in there but it has a more like a gradual approach where like a server could be just plain HTTP like you know have one endpoint that you send HTTP post to and then you know get a result back but then you can like gradually enhance it with like okay now I want the results to be streaming or like now I want the server to be able to issue its own requests and as long as the server and client both support the ability to like resume sessions like you know to disconnect and come back later and pick up where you left off then you get kind of the best of both worlds where it can still be this stateful attraction and stateful server but allows you to like horizontally scale more easily or like deal with spotty network connections or whatever the case may be.
Yeah, yeah and you had a as you mentioned session ID how do you think about auth going forward for some MCPs I just need to like paste my API key in the command is there kind of like a yeah what do you see as the feature of that is there going to be like the .M equivalent of like a for MCPs or yeah we do have we do have authorization as a specification in the current draft of the next supervision of the protocol it's mostly at the moment focused on user to server authorization using like OAuth 2.1 or like you know a subset of modern OAuth basically and I think that has that has seems to be working well for people and people building on top of that and that will solve a lot of these issues because you don't really want to have people bring API keys in particular when you have like when you think about a world which which I truly believe will happen where the majority of servers will be remote servers so you need some sort of authorization with that server now for the local case because the authorization is defined on on the transport layer and so requires framing which means like headers effectively this does not work in Sanio but in Sanio you run locally and you can do whatever you want anyway and you might just pop open a browser and deal with it that way and then there's also like some thinking that is somewhat somewhat not fully decided on about you know even using HTTP locally which would solve that problem and Justin is laughing because he's very much in favor of this where I'm very much not in favor of this so there's some debate going on there but like authorization I think you know we have something I think it's like it's as everything in the protocol is like fairly minimal likes trying to solve a very practical problem in tries to be very minimal in what it what it does and then we go from there and add based on practical pain points people have on top of the protocol and don't try to over design it from the beginning so we'll just see how far our current aspect gets us basically yeah I want to build on that a bit because I think that last point is really important and like you know when when you're designing a protocol you have to be extremely conservative because if you make a mistake you basically can't undo that mistake or you break backwards compatibility so it's far easier to like only accept things or like only add things that you're extremely certain about and let people kind of do ad hoc extensions until maybe there's more like consensus that something is worth with adding to the core thing and like supporting indefinitely going forward and with all that particular and this example of API keys I think this is really illustrative because we we did a lot of this sort of like brainstorming like okay if I have this use case could I accomplish that with this version of off and I think the answer is yes for like the API key example like you can have an MCB server which is an OAuth authorization server and at the like slash authorize web page it just has like a text box for you to put in an API key like that would be a totally valid OAuth flow for the MCB server maybe not the most ergonomic or not what people would ideally like but because it does fit into the existing paradigm and as possible today we're worried about like adding too much other too many other options that both then clients and servers need to think about yeah have you guys gave scopes any thought if it's like we had an episode with Dermash Shaw yesterday from agent A.A.
and HubSpot and he was given the example of like email like he has all of his emails and you know he would like to have more granular scope for hey you can only access these types of emails like emails to this person today most scopes are like rest driven basically it's like what endpoints can you access do you see a future in which the model kind of access like the scope layer so to speak and kind of dynamically limits the data that passes through I think the I think there is a potential need for scopes that goes back to like we have discussions around this but what we're currently trying to do is just like rooting them in very specific example and like have a good set of like these are actual problems that you cannot currently solve with the current implementations and that's like the bar we set to add to the protocol and I think that and then you know based on that prototype using that extensibility that we have at the moment where every structure that's returned is extensible and then build on top of that and prove that you that this will have a good user experience and then we put it to the protocol that's usually been for the most part the case it's actually been not quite the case for authorization in general that's been a bit more top down but I can't totally see why people want it it's just a matter of like showcasing the specific examples and like what the what the potential solutions would be so that we don't accidentally run into this like yeah the soft this into this approach where like it sounds roughly right and we put it in and it was actually not really right and now you're back to this like adding it's easy removing as hard in protocol design and so we're just a little bit we're just a little bit you know careful around the so to speak that being said you know every time I hear it like in the rough description it makes sense I would love to have a very practical and trend user example of this and where it falls apart a current implementation then we can have a discussion there's a little bit of weariness from my perspective to maybe not with scope specifically I think those could make a lot of sense as long as we have the the use cases in mind but I do think you know thinking about composability and logical groupings of things I think it does often make sense for mcp servers to be quite small things and if you want lots of collection for functionality for those to be discrete servers that you kind of combine together as a user or in the application layer and so some of the pushback about off has been like well if I need to authorize with like 20 different things on the other side how can I do that it's like well maybe maybe that's not what the server should be doing and maybe it shouldn't be connecting to 20 different things maybe those should be separate servers that combine up some awesome lots of discussion there where should people go if they want to get involved in these debates is it just the specification repo discussion page that is that's a good start I want to caveat it slightly that on the internet it's very easy to to be part of a discussion and having an opinion without then actually doing the work and so I think there we were both chance and I very old school open source people that like it's it's marriage driven in the sense that if you have done work and if you if you showcase this with like practical examples and work in SDKs towards the X extensions you want to make you have a good chance that it gets in if you're just there to have an opinion you're very likely just being ignored to be frank because there's a limit to how much discussion points we can read of course we value the discussion and we want to have the discussion but we also need to manage our time and our engagement and we obviously select for the people who are doing the most work we're trying to figure out you know that honestly like I think even compared to open source work out done in the past just this year volume of conversation and notifications are an MCP stuff is extraordinary which is great on one hand but I think we do need to figure out more scalable structures to both engage with the community but also keep you know conversations high signal and like effective and I guess there's something else to be aware of related to David's point is like I do believe that a big part of running a successful open source project is sometimes making hard decisions that people will be unhappy about and you kind of just have to like you know learn to like figure out like what are the things what is like the actual vision for the project where do we as the kind of like maintainers or like shepherds or whatever believes that it's going and just commit to that and like understand that some people won't agree with that vision and that's totally fine but then maybe maybe there will be other projects that are more in line with what they're hoping for something like that.
I think that's a very interesting and quite good point is like they're like a like a product like MCP is an entry into like into a solution space of the problems in that in the general like space and it's it is one of many entries in a way and and if you do not like the direction you know that we and like people that are very close you know in the development of the protocol choose them and then we cannot then there's always place for more right that's the beauty of open search right the good old you know forget approach.
We do always want to hear the feedback and we need to make it scalable I think but also just recognition that like sometimes we will be going with our intuition about what is the right choice there might be a lot of like flame in the open source discussions about it but that's just the nature of projects like this sometimes.
Yeah fortunately neither of you are new to that I would also say there's a lot of history to be drawn from Facebook open source right and both of you if you weren't directly involved you know everyone who was directly involved I would say React we eventually started because I was obviously deeply part of the React ecosystem we eventually started working groups where it was it was open it was conducted on in discussions and each member of the working group had a voice that represented a significant part of the community but also showed that they did the work they had a significance they weren't sort of drive by people with with no skin in the game and I think that was helpful for a while I'm not sure it's like an actively managed thing because it reacts on issues with the multi-company situation that they're in the other thing that actually is to me is more interesting is GraphQL because MCP where like currently has the gripe the hype that GraphQL had and I lived through that one and eventually you know Facebook donated GraphQL to an open source foundation and I think that there's a question of like do we want to do that this trade-offs right it's not a clear yes or no I would say that most people are happy with anthropic and you guys obviously because you created it you guys being the stewards but at some point at some scale you're gonna hit some ceiling there where you're like okay like you know this is owned by one company and you know eventually people want to like you know the truly open standard is a nonprofit there's multiple stakeholders is a good governance process all of which is governed by like Linux foundation Apache whatever so I want to ask like any thoughts there I personally would say swirly you know like what are your thoughts yeah I think governance in generalism yes as your super interesting problem in an open source space I think there are two things and one is I we really we really really want to make this and have this be an open standard and open protocol and open project with you know participation from everyone who wants to be part of the partake and I think that actually is working quite well so far if you look at the pull request if you look for example a lot of the inputs on the streamable HTTP thing came from from companies like you know Shopify and others that had discussed and worked on this and and brought proposals to the table and I think that works really well the thing that we are a bit wary about is any type of official standardization particularly going through an actual standardization body or any type of like foundational work that starts having processes as part of this to stay to stay somewhat fair to everyone that can act process that in a fast moving field like AI can be detrimental to the project and that's what we worry about we worry about processes that are slowing us down and so we're trying to find this nice middle ground of like how can we have participation that we let that we luckily do have from everyone work towards everyone's you know everyone's like like problems that they have potentially with the governance model and figure the right path forward out without accidentally without accidentally slowing down the project I think that's what we're trying to do but yeah we genuinely we were very genuine in our desire to have this be an open project and like yes it's it was initiated by anthropic and David and I work at anthropic but like we don't want it to be seen as like this is an anthropic protocol I think it's very important for the whole ecosystem that this is something that like any AI lab could have a stake in or contribute to or make use of but yeah it's just it's it's balancing that against avoiding death by committee basically and so like I think there are a lot of models for doing this successfully an open source I think most of the delicacies are really around like you know sort of corporate sponsorship and corporate say and we'll kind of navigate that as it comes up but we absolutely want this to be like a community project that that being said I want to highlight this that at the moment as we speak there's plenty of people are not anthropic employees who have committed access and admin access who's the repository is right there you know some of the people from pedantic have committed access to the the python SDK because they did a lot of really good work there and we had a lot of contributions from blocking others to the to the specifications SDK is like the Java SDK and the C sharp SDK is they're completely done by different companies like the C sharp one is done by Microsoft it's a very recent edition last week and they do everything there they have full admin rights over that the same goes with JetBrains doing the Kotlin one and spring AI doing the Java one so it is actually if you really look at it it's it's already like a multi-company big project with everyone there was a lot of people beyond just us to having commit access to and rights to the to the project as is yeah awesome guys this was great just to wrap up do you have any mcp server wishlist what do you want people to build you that is not there or quiet quiet or server I want some I want more sampling clients that's all I want I want cool I want someone to build a client that assembling and as someone else that builds me a server that does summarize my reddit threats or summarize like I'm an old Eve online player summarize what happened in Eve online in the last week for me I wish that would someone would do that but for that I want a sampling client I want this model independent not because I wanted to use any other modeling cloth because cloth is by far the best but I just wanted to I just want to have a sampling client for the sake of having a sampling client just another do I'll echo that and just even broadly say like I think just more clients that support the full breadth of the spec would be amazing I mean we kind of design things so that things could be adopted incrementally anyway but like still it would be great if you know all these primitives that we put this this thought into do get manifested somehow that would be amazing but going back to you know some of my initial motivation for working on mcp and like excitement about the file system server you know like I like hacking on a game as a side project so I would really love to have an mcp client and or mcp server with like the Godot engine which I was using to build the game and just like have really easy like AI integration with that or like have you know Claude run and play test my game or something like Claude plays Pokemon who knows hey at least you have them already built to have taught already built your 3d model from now on with blender right but yeah I mean honestly even like shader code and stuff already I was just like this is not my wheel it's amazing what you can do when you enable builders yeah we're actually working on a call place Pokemon hackathon with David Hershey so to bring mcp into that I had no plans but if he wants to he can awesome guys well thank you for the time yeah keep up the good work thank you both this was fun yeah thank you really appreciate it cheers (upbeat music)