MCP Architecture: Protocol Layers, Session Lifecycle, and Message Flow

MCP Architecture

MCP architecture is easiest to understand as a layered system. A host owns the user and model experience. A client connection inside that host speaks the protocol. A server exposes capabilities and mediates access to underlying systems.

The protocol base is JSON-RPC 2.0, but the architectural value is higher-level: capability negotiation, lifecycle management, transport choice, and a clean separation between interface and execution.

A single host can maintain multiple MCP client connections at once, each connected to a focused server. That pattern is one of the reasons MCP scales well across domains like code, documentation, ticketing, and internal operations.

Protocol Lifecycle in One Pass

Client opens a transport connection to the server.
Initialization negotiates protocol version and declared capabilities.
Client discovers capabilities such as tools, resources, and prompts.
Normal operation begins: tool calls, resource reads, notifications, logging, and optional advanced features.
Connection shuts down gracefully or is recovered after transport failure.

Capability Request Path

Each layer has a distinct responsibility. Trace a failed call from left to right to find the owner of the correction.

User and Model Express intent inside the host experience
Host Selects a capability and applies approval policy
MCP Client Sends a negotiated JSON-RPC request
MCP Server Validates arguments and authorization
Backend Performs the permitted domain operation
Structured Result Returns content or a typed protocol error

Actors and Layers

The architecture page is the primary home for understanding how the protocol is assembled. The host is the application the user interacts with. The client is the protocol implementation within the host. The server is the process or service that describes capabilities and executes work safely.

This layered split is what keeps MCP from becoming just a bag of tool definitions. The host decides how and when to involve the model. The client manages session mechanics. The server decides what capability surface is actually safe and valid.

Host layer: UX, model orchestration, user approvals, policy, session state
Client layer: initialize, discover, invoke, receive notifications, track transport
Server layer: capability registration, validation, backend execution, result shaping
Backend layer: filesystems, APIs, databases, search engines, ticket systems, internal services

Session Initialization

Initialization is not a formality. It is the first contractual checkpoint of the session. The client and server agree on protocol version and declare which capabilities they support. If a feature is not negotiated, neither side should assume it exists.

This matters because the modern MCP ecosystem includes optional features such as roots, sampling, logging, prompt support, and resource templates. Production integrations stay robust by checking declared capabilities instead of assuming parity between different hosts and servers.

Initialization must happen before normal requests.
Protocol version mismatch should fail early and explicitly.
Capability checks prevent servers from requesting features the client never offered.
Change notifications let clients react when tool, prompt, or resource inventories evolve.

Initialization Followed by Capability Discovery

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "initialize",
  "params": {
    "protocolVersion": "2025-11-25",
    "capabilities": {
      "roots": { "listChanged": true }
    },
    "clientInfo": {
      "name": "enterprise-assistant",
      "version": "2.4.0"
    }
  }
}

The client declares what it supports instead of silently assuming defaults.
Protocol version should match an actual supported revision.
Only after initialize succeeds should the client list tools, resources, or prompts.

Message Flow

During normal operation, the client sends requests such as tools/list, tools/call, resources/list, resources/read, or prompts/get. The server may also emit notifications for list changes or logging. That means MCP is not just a request-response API surface. It is a session with state and negotiated features.

For remote transports, session management matters even more. Modern Streamable HTTP deployments can run statefully or statelessly, and that choice affects resumability, routing, and horizontal scale.

Requests carry ids and expect responses.
Notifications do not expect responses.
Long-running operations should support cancellation and clear progress semantics.
Stateful remote sessions can carry session ids and resumability logic.

Multi-Server Hosts

Real hosts rarely talk to only one server. A coding assistant may need a repository server, a file server, an internal documentation server, and a ticketing server. The host can keep them separate and choose which one to involve based on user intent and policy.

That decomposition is healthy. A documentation team should not need to ship the same server process as a deployment automation team. Separate servers keep ownership, review, and blast radius smaller.

Server	Why It Exists	Typical Capabilities
Filesystem server	Expose current workspace context	read files, search files, list directories
Repository server	Work with version control systems	list PRs, read diffs, comment on issues
Docs server	Expose curated knowledge	search docs, read policies, fetch runbooks
Workflow server	Execute guarded business actions	create tickets, trigger workflows, post updates

Architecture Tradeoffs

A local stdio server is simple and low-latency because the host spawns a child process and speaks JSON-RPC over stdin and stdout. A remote server is operationally heavier but easier to centralize, monitor, and govern across many users.

The right architecture depends on ownership and trust. If the capability depends on local files, stdio is often natural. If it depends on shared enterprise systems and central identity, a remote deployment is usually cleaner.

Local servers are easier to iterate on during development.
Remote servers are easier to govern and audit centrally.
A mixed architecture is common in enterprise AI products.
Capability boundaries should follow ownership and data sensitivity, not just technical convenience.

Architecture Gate

Review MCP architecture as a sequence. First, the host knows which server configuration or endpoint to connect to. Second, the client opens a transport session. Third, initialization negotiates protocol version and capabilities. Fourth, the host discovers tools, resources, and prompts. Fifth, the host filters and presents capabilities. Sixth, calls execute with validation, authorization, and structured results.

Each step has its own failure mode. A server can fail to start, initialize with incompatible capabilities, omit an expected tool, expose a capability with a weak schema, reject authorization, return oversized data, or fail backend execution. Treating all of those as "MCP failed" makes debugging slow.

The architecture also separates ownership. The host owns user experience and overall orchestration. The client owns one protocol connection. The server owns capability definitions and backend adapters. The authorization system owns identity and token issuance for remote deployments. Backend services own final data permissions.

This layered view helps teams make safe design decisions. For example, a host may hide a discovered write tool from the model until a user enables it. A server may expose the same tool but still deny calls for users without object-level permission. A backend may reject an action even after the MCP layer approved it.

Trace architecture from transport connection through capability execution.
Debug failures at the layer where they occur.
Keep host policy, server validation, and backend authorization separate.
Use initialization and discovery metadata as contracts, not informal hints.

Draw the Session

Draw one MCP session as a sequence diagram: process or endpoint connection, initialize, capability discovery, host filtering, invocation, validation, backend call, response, and trace. Add failure points at each stage.

This diagram becomes a debugging tool. When a real integration fails, you can point to the exact stage instead of saying the server is broken. It also helps security reviewers see where authorization and consent belong.

Diagram the full message flow.
Mark discovery, invocation, and backend authorization separately.
Use the diagram during incident review.

Incident Diagram

A useful architecture diagram should show both success and failure paths, because most production MCP debugging is about finding which layer failed and who owns the fix.

Session Trace

Follow one session from transport connection through initialization, discovery, host filtering, invocation, backend execution, and shutdown. Mark the first layer that would explain each failure so architecture becomes a practical diagnostic map.

Initialization Contract

Initialization is the first protocol interaction. The client sends `initialize` with its latest supported protocol version, client identity, and client capabilities. The server returns its selected version, server identity, capabilities, and optional instructions. If the server selects a version the client cannot support, the client must disconnect rather than continuing with guessed message shapes.

After the response, the client sends `notifications/initialized`. Only then should normal discovery and invocation begin. Store the negotiated capability set with the connection, because support is session-specific. Seeing a method in documentation or an SDK does not mean this peer advertised it for this session.

Shutdown follows transport ownership. For stdio, the client normally closes input, waits briefly, and then terminates the process if needed. For HTTP, normal connection and session cleanup rules apply. Timeouts should cancel local waiting and, where supported, send protocol cancellation without assuming remote work has definitely stopped.

JSON-RPC request IDs correlate responses and may be processed concurrently after initialization. Notifications have no ID and receive no response. A client must therefore match by ID rather than arrival order, reject duplicate active IDs, and keep server-to-client requests distinct from client-to-server requests in traces.

Reject operation messages before initialization completes.
Persist the negotiated version and capabilities per client-server session.
Do not infer optional support from method names or SDK availability.
Record initialization and shutdown events in connection diagnostics.

Session Lifecycle Examples

Text Diagram: Host to Execution Path

User
  -> Host UI / Conversation Runtime
    -> MCP Client Session
      -> Transport (stdio or Streamable HTTP)
        -> MCP Server
          -> Validation and Authorization
            -> Backend System
              -> Result
            -> Structured MCP Response

This diagram is the backbone for the rest of the course.
Validation and authorization belong before backend execution, not after.

Before you move on

Architecture Review

4 checks

Separate host, client, server, and backend concerns.
Treat initialization and capability negotiation as real protocol gates.
Use multiple focused servers instead of one giant capability blob.
Pick stateful or stateless remote sessions based on deployment needs.

Architecture Questions

Yes. Many servers do, but each capability type should still have a clear job and boundary.

No. The transport changes how messages move, not the meaning of lifecycle, discovery, or capability invocation.

Previous Next

MCP Architecture: Protocol Layers, Session Lifecycle, and Message Flow