Wandering Nomad: MCP

Showing posts with label MCP. Show all posts

4.8.25

The Agentic Web: when bots become the primary users of the internet

Search boxes and feeds defined the first two web eras. A new position paper proposes the third: the Agentic Web, where autonomous software agents—often LLM-powered—act on our behalf, coordinate with other agents, and execute long-horizon tasks across services. The authors offer a working definition and argue the shift is already visible in consumer assistants that can plan purchases and book reservations end-to-end.

A framework in three dimensions

The paper lays out a conceptual stack for this world: intelligence (reasoning, memory, planning), interaction (tools, APIs, multi-agent protocols), and economics (incentives, pricing, marketplaces). These dimensions, taken together, underpin capabilities like retrieval, recommendation, planning and collaboration that move beyond single-turn chat.

From retrieval to planning to coordination

Architecturally, the authors chart algorithmic transitions: user-issued queries give way to agentic retrieval; recommender systems evolve into agent planners; and isolated tools become multi-agent collectives able to decompose and delegate work. A worked example walks through agents co-planning a travel itinerary, highlighting orchestration and memory.

New pipes: MCP and agent-to-agent messaging

HTTP and RPC weren’t built for autonomous, negotiated workflows. The paper surveys emerging Model Context Protocol (MCP) interfaces and purpose-built agent-to-agent (A2A) messaging layers to support capability discovery, tool brokering and structured negotiations between services—foundational plumbing for an internet of bots.

The Agent Attention Economy

If algorithms once competed for human attention, services on the Agentic Web will compete to be selected by agents mid-plan. That reframes ranking, pricing and attribution around machine decision-makers—an attention market where tools, APIs and even other agents bid for inclusion in workflows.

What breaks (and who pays)

The authors predict “agent browsers” will disrupt today’s user-centric browsing model, shifting interfaces from manual clicks to delegated execution. They also flag a looming billing problem for complex, multi-step agent services that span providers and time windows—who gets paid, and how, when dozens of tools contribute to one outcome?

Risks, red teaming and defense

A full section maps threats across layers (prompt-/tool-injection, data exfiltration, compromised marketplaces), and compares human-in-the-loop versus automated red teaming for agent systems. The authors argue for hybrid approaches, inference-time guardrails, and controllable planning to keep autonomous workflows within safe bounds.

Why it matters

If the Agentic Web arrives, the primary “users” of the internet won’t be humans but agents negotiating with each other—demanding new protocols, marketplaces, governance and safety tooling. For startups, the opportunity is to build the pipes, policies and platforms that let those agents cooperate—and compete—reliably.

Paper link: arXiv 2507.21206 (PDF)

15.7.25

Anthropic Brings Canva into Claude: How MCP Integration Lets You Design by Chat

Anthropic has rolled out a new Canva plug-in for Claude that turns the popular design platform into a conversational workspace. Thanks to the Model Context Protocol (MCP), users can generate presentations, resize images, fill branded templates, or search and summarise Canva Docs without ever leaving the chat window.

How It Works

Natural-language prompts — “Create a 10-slide pitch deck with a dark tech theme.”
Claude translates the request into structured MCP calls.
Canva’s MCP server executes the actions and streams results back as editable links.
Users refine with follow-ups such as “Swap slide 3’s hero image for a blue gradient.”

Because MCP is stateless and schema-based, Claude can also pull content from the design — for example, summarising a 40-page brand guide or extracting colour codes for a new asset.

What You Need

Claude subscription: $17 / month
Canva Pro or Teams: from $15 / month
Link the two accounts once; thereafter, the bot can launch or tweak designs at will.

Why It Matters

Benefit	Impact
Fewer tabs, faster flow	Designers and marketers iterate inside a single chat thread.
Multimodal productivity	Text + visual generation collapses into one agentic workflow.
Growing MCP ecosystem	Canva joins Microsoft, Figma, and others adopting the “USB-C of AI apps,” signalling a coming wave of tool-aware chatbots.

Early Use Cases

Rapid mock-ups: Marketing teams prototype social ads in seconds.
Live meeting edits: Change fonts or colours mid-presentation by typing a request.
Doc intelligence: Ask Claude to list key action items buried in a lengthy Canva Doc.

The Bigger Picture

Anthropic positions this launch as a template for future AI-centric productivity suites: instead of juggling APIs or iframed plug-ins, developers expose clean MCP endpoints and let large language models handle orchestration and chat UX. For users, that translates to creative work at conversation speed.

Claude’s Canva integration is live today for paid users, with additional MCP-powered tools— including Figma workflows—already in Anthropic’s new “Claude Integrations” directory.

8.7.25

Context Engineering in AI: Designing the Right Inputs for Smarter, Safer Large-Language Models

What Is Context Engineering?

In classic software, developers write deterministic code; in today’s AI systems, we compose contexts. Context engineering is the systematic craft of designing, organizing and manipulating every token fed into a large-language model (LLM) at inference time—instructions, examples, retrieved documents, API results, user profiles, safety policies, even intermediate chain-of-thought. Well-engineered context turns a general model into a domain expert; poor context produces hallucinations, leakage or policy violations.

Core Techniques

Technique	Goal	Typical Tools / Patterns
Prompt Design & Templates	Give the model clear role, task, format and constraints	System + user role prompts; XML / JSON schemas; function-calling specs
Retrieval-Augmented Generation (RAG)	Supply fresh, external knowledge just-in-time	Vector search, hybrid BM25+embedding, GraphRAG
Context Compression	Fit more signal into limited tokens	Summarisation, saliency ranking, LLM-powered “short-former” rewriters
Chunking & Windowing	Preserve locality in extra-long inputs	Hierarchical windows, sliding attention, FlashMask / Ring Attention
Scratchpads & CoT Scaffolds	Expose model reasoning for better accuracy and debuggability	Self-consistency, tree-of-thought, DST (Directed Self-Testing)
Memory & Profiles	Personalise without retraining	Vector memories, episodic caches, preference embeddings
Tool / API Context	Let models call and interpret external systems	Model Context Protocol (MCP), JSON-schema function calls, structured tool output
Policy & Guardrails	Enforce safety and brand style	Content filters, regex validators, policy adapters, YAML instruction blocks

Why It Matters

Accuracy & Trust – Fact-filled, well-structured context slashes hallucination rates and citation errors.
Privacy & Governance – Explicit control over what leaves the organisation or reaches the model helps meet GDPR, HIPAA and the EU AI Act.
Cost Efficiency – Compressing or caching context can cut token bills by 50-80 %.
Scalability – Multi-step agent systems live or die by fast, machine-readable context routing; good design tames complexity.

High-Impact Use Cases

Sector	How Context Engineering Delivers Value
Customer Support	RAG surfaces the exact policy paragraph and recent ticket history, enabling a single prompt to draft compliant replies.
Coding Agents	Function-calling + repository retrieval feed IDE paths, diffs and test logs, letting models patch bugs autonomously.
Healthcare Q&A	Context filters strip PHI before retrieval; clinically-approved guidelines injected to guide safe advice.
Legal Analysis	Long-context models read entire case bundles; chunk ranking highlights precedent sections for argument drafting.
Manufacturing IoT	Streaming sensor data is summarised every minute and appended to a rolling window for predictive-maintenance agents.

Designing a Context Pipeline: Four Practical Steps

Map the Task Surface
• What knowledge is static vs. dynamic?
• Which external tools or databases are authoritative?
Define Context Layers
• Base prompt: role, format, policy
• Ephemeral layer: user query, tool results
• Memory layer: user or session history
• Safety layer: filters, refusal templates
Choose Retrieval & Compression Strategies
• Exact text (BM25) for short policies; dense vectors for semantic match
• Summaries or selective quoting for large PDFs
Instrument & Iterate
• Log token mixes, latency, cost
• A/B test different ordering, chunking, or reasoning scaffolds
• Use self-reflection or eval suites (e.g., TruthfulQA-Context) to measure gains

Emerging Tools & Standards

MCP (Model Context Protocol) – open JSON schema for passing tool output and trace metadata to any LLM, adopted by Claude Code, Gemini CLI and IBM MCP Gateway.
Context-Aware Runtimes – vLLM, Flash-Infer and Infinity Lite stream 128 K-1 M tokens with optimized KV caches.
Context Observability Dashboards – Startups like ContextHub show token-level diff, attribution and cost per layer.

The Road Ahead

As context windows expand to a million tokens and multi-agent systems proliferate, context engineering will sit alongside model training and fine-tuning as a first-class AI discipline. Teams that master it will ship assistants that feel domain-expert-smart, honest and cost-efficient—while everyone else will chase unpredictable black boxes.

Whether you’re building a retrieval chatbot, a self-healing codebase or an autonomous research agent, remember: the model is only as good as the context you feed it.

28.6.25

Google Launches Gemini CLI: An Open‑Source AI Agent for Your Terminal

💻 Gemini CLI Places AI Power in Developers’ Terminals

Google has unveiled Gemini CLI, a fully open-source AI agent that brings its latest Gemini 2.5 Pro model directly into developers’ terminals. Built for productivity and versatility, it supports tasks ranging from code generation to content creation, troubleshooting, research, and even image or video generation—all initiated via natural-language prompts.

🚀 Key Features & Capabilities

Powered by Gemini 2.5 Pro: Supports a massive 1 million-token context window, ideal for long-form conversations and deep codebases.
Multi-task Utility: Enables developers to write code, debug, generate documentation, manage tasks, conduct research, and create images/videos using Google’s Imagen and Veo tools.
MCP & Google Search Integration: Offers external context via web search and connects to developer tools using the Model Context Protocol.
Rich Extensibility: Fully open-source (Apache 2.0), enabling community contributions. Ships with MCP support, customizable prompts, and non-interactive scripting for automated workflows.
Generous Free Preview: Personal Google account grants 60 requests/minute and 1,000 requests/day, among the highest rates available from any provider.

🔧 Seamless Setup & Integration

Installs easily on Windows, macOS, and Linux.
Requires only a Google account with a free Gemini Code Assist license.
Works in tandem with Gemini Code Assist for VS Code, providing a unified CLI and IDE experience.
Ideal for both interactive use and automation within scripts or CI/CD pipelines.

Why It Matters

Meets Developers Where They Work: Integrates AI directly into the CLI—developers' most familiar environment—without needing new interfaces.
Long-Context Reasoning: The 1M-token window enables handling large codebases, multi-file logic, and in-depth document analysis in one session.
Multimodal Power: Beyond code, it supports image and video generation—making it a fully-fledged creative tool.
Openness & Community: As open-source software, Gemini CLI invites global collaboration, transparency, and innovation. Google encourages contributions via its GitHub repo
Competitive Edge: With elite token limits and flexibility, it positions itself as a strong alternative to existing tools like GitHub Copilot CLI and Anthropic’s Claude Code

✅ Final Takeaway

Gemini CLI marks a generational leap for developer AI tools—offering open-source freedom, high context capacity, and multimodal capabilities from within the terminal. With generous usage, extensibility, and seamless integration with developer workflows, it emerges as a compelling entry point into AI-first development. For teams and individuals alike, it’s a powerful new way to harness Gemini at scale.

29.5.25

Mistral AI Launches Agents API to Simplify AI Agent Creation for Developers

Mistral AI has unveiled its Agents API, a developer-centric platform designed to simplify the creation of autonomous AI agents. This launch represents a significant advancement in agentic AI, offering developers a structured and modular approach to building agents that can interact with external tools, data sources, and APIs.

Key Features of the Agents API

Built-in Connectors:
The Agents API provides out-of-the-box connectors, including:
- Web Search: Enables agents to access up-to-date information from the web, enhancing their responses with current data.
- Document Library: Allows agents to retrieve and utilize information from user-uploaded documents, supporting retrieval-augmented generation (RAG) tasks.
- Code Execution: Facilitates the execution of code snippets, enabling agents to perform computations or run scripts as part of their workflow.
- Image Generation: Empowers agents to create images based on textual prompts, expanding their multimodal capabilities.
Model Context Protocol (MCP) Integration:
The API supports MCP, an open standard that allows agents to seamlessly interact with external systems such as APIs, databases, and user data. This integration ensures that agents can access and process real-world context effectively.
Persistent State Management:
Agents built with the API can maintain state across multiple interactions, enabling more coherent and context-aware conversations.
Agent Handoff Capability:
The platform allows for the delegation of tasks between agents, facilitating complex workflows where different agents handle specific subtasks.
Support for Multiple Models:
Developers can leverage various Mistral models, including Mistral Medium and Mistral Large, to power their agents, depending on the complexity and requirements of the tasks.

Performance and Benchmarking

In evaluations using the SimpleQA benchmark, agents utilizing the web search connector demonstrated significant improvements in accuracy. For instance, Mistral Large achieved a score of 75% with web search enabled, compared to 23% without it. Similarly, Mistral Medium scored 82.32% with web search, up from 22.08% without. (Source)

Developer Resources and Accessibility

Mistral provides comprehensive documentation and SDKs to assist developers in building and deploying agents. The platform includes cookbooks and examples for various use cases, such as GitHub integration, financial analysis, and customer support. (Docs)

The Agents API is currently available to developers, with Mistral encouraging feedback to further refine and enhance the platform.

Implications for AI Development

The introduction of the Agents API by Mistral AI signifies a move toward more accessible and modular AI development. By providing a platform that simplifies the integration of AI agents into various applications, Mistral empowers developers to create sophisticated, context-aware agents without extensive overhead. This democratization of agentic AI has the potential to accelerate innovation across industries, from customer service to data analysis.

26.5.25

The 3 Biggest Bombshells from Last Week’s AI Extravaganza

The week of May 23, 2025, marked a significant milestone in the AI industry, with major announcements from Microsoft, Anthropic, and Google during their respective developer conferences. These developments signal a transformative shift in AI capabilities and their applications.

1. Microsoft's Push for Interoperable AI Agents

At Microsoft Build, the company introduced the adoption of the Model Context Protocol (MCP), a standard facilitating communication between AI agents, even those built on different large language models (LLMs). Originally developed by Anthropic in November 2024, MCP's integration into Microsoft's Azure AI Foundry enables developers to build AI agents that can seamlessly interact, paving the way for more cohesive and efficient AI-driven workflows.

2. Anthropic's Claude 4 Sets New Coding Benchmarks

Anthropic unveiled Claude 4, including its Opus and Sonnet variants, surprising the developer community with its enhanced coding capabilities. Notably, Claude 4 achieved a 72.5% score on the SWE-bench software engineering benchmark, surpassing OpenAI's o3 (69.1%) and Google's Gemini 2.5 Pro (63.2%). Its "extended thinking" mode allows for up to seven hours of continuous reasoning, utilizing tools like web search to tackle complex problems.

3. Google's AI Mode Revolutionizes Search

During Google I/O, the company introduced AI Mode for its search engine, integrating the Gemini model more deeply into the search experience. Employing a "query fan-out technique," AI Mode decomposes user queries into multiple sub-queries, executes them in parallel, and synthesizes the results. Previously limited to Google Labs users, AI Mode is now being rolled out to a broader audience, potentially reshaping how users interact with search engines and impacting SEO strategies.

24.5.25

Microsoft's NLWeb: Empowering Enterprises to AI-Enable Their Websites

Microsoft has introduced NLWeb, an open-source protocol designed to transform traditional websites into AI-powered platforms. Announced at the Build 2025 conference, NLWeb enables enterprises to embed conversational AI interfaces directly into their websites, facilitating natural language interactions and improving content discoverability.

Understanding NLWeb

NLWeb, short for Natural Language Web, is the brainchild of Ramanathan V. Guha, a pioneer known for co-creating RSS and Schema.org. The protocol builds upon existing web standards, allowing developers to integrate AI functionalities without overhauling their current infrastructure. By leveraging structured data formats like RSS and Schema.org, NLWeb facilitates seamless AI interactions with web content.

Microsoft CTO Kevin Scott likens NLWeb to "HTML for the agentic web," emphasizing its role in enabling websites and APIs to function as agentic applications. Each NLWeb instance operates as a Model Control Protocol (MCP) server, providing a standardized method for AI systems to access and interpret web data.

Key Features and Advantages

Enhanced AI Interaction: NLWeb allows AI systems to better understand and navigate website content, reducing errors and improving user experience.
Leveraging Existing Infrastructure: Enterprises can utilize their current structured data, minimizing the need for extensive redevelopment.
Open-Source and Model-Agnostic: NLWeb is designed to be compatible with various AI models, promoting flexibility and broad adoption.
Integration with MCP: Serving as the transport layer, MCP works in tandem with NLWeb to facilitate efficient AI-data interactions.

Enterprise Adoption and Use Cases

Several organizations have already begun implementing NLWeb to enhance their digital platforms:

O’Reilly Media: CTO Andrew Odewahn highlights NLWeb's ability to utilize existing metadata for internal AI applications, streamlining information retrieval and decision-making processes.
Tripadvisor and Shopify: These companies are exploring NLWeb to improve user engagement through AI-driven conversational interfaces.

By adopting NLWeb, enterprises can offer users a more interactive experience, allowing for natural language queries and personalized content delivery.

Considerations for Implementation

While NLWeb presents numerous benefits, enterprises should consider the following:

Maturity of the Protocol: As NLWeb is still in its early stages, widespread adoption may take 2-3 years. Early adopters can influence its development and integration standards.
Regulatory Compliance: Industries with strict regulations, such as healthcare and finance, should proceed cautiously, ensuring that AI integrations meet compliance requirements.
Ecosystem Development: Successful implementation depends on the growth of supporting tools and community engagement to refine best practices.

Conclusion

NLWeb represents a significant step toward democratizing AI capabilities across the web. By enabling enterprises to integrate conversational AI into their websites efficiently, NLWeb enhances user interaction and positions businesses at the forefront of digital innovation. As the protocol evolves, it holds the promise of reshaping how users interact with online content, making AI-driven experiences a standard component of web navigation

14.5.25

MCP: The Emerging Standard for AI Interoperability in Enterprise Systems

In the evolving landscape of enterprise AI, the need for seamless interoperability between diverse AI agents and tools has become paramount. Enter the Model Context Protocol (MCP), introduced by Anthropic in November 2024. In just seven months, MCP has garnered significant attention, positioning itself as a leading framework for AI interoperability across various platforms and organizations.

Understanding MCP's Role

MCP is designed to facilitate communication between AI agents built on different language models or frameworks. By providing a standardized protocol, MCP allows these agents to interact seamlessly, overcoming the challenges posed by proprietary systems and disparate data sources.

This initiative aligns with other interoperability efforts like Google's Agent2Agent and Cisco's AGNTCY, all aiming to establish universal standards for AI communication. However, MCP's rapid adoption suggests it may lead the charge in becoming the de facto standard.

Industry Adoption and Support

Several major companies have embraced MCP, either by setting up MCP servers or integrating the protocol into their systems. Notable adopters include OpenAI, MongoDB, Cloudflare, PayPal, Wix, and Amazon Web Services. These organizations recognize the importance of establishing infrastructure that supports interoperability, ensuring their AI agents can effectively communicate and collaborate across platforms.

MCP vs. Traditional APIs

While APIs have long been the standard for connecting different software systems, they present limitations when it comes to AI agents requiring dynamic and granular access to data. MCP addresses these challenges by offering more control and specificity. Ben Flast, Director of Product at MongoDB, highlighted that MCP provides enhanced control and granularity, making it a powerful tool for organizations aiming to optimize their AI integrations.

The Future of AI Interoperability

The rise of MCP signifies a broader shift towards standardized protocols in the AI industry. As AI agents become more prevalent and sophisticated, the demand for frameworks that ensure seamless communication and collaboration will only grow. MCP's early success and widespread adoption position it as a cornerstone in the future of enterprise AI interoperability.

10.5.25

Zencoder Introduces Zen Agents: Revolutionizing Team-Based AI in Software Development

On May 9, 2025, Zencoder announced the launch of Zen Agents, a groundbreaking platform designed to transform software development by introducing collaborative AI tools tailored for team environments. Unlike traditional AI coding assistants that focus on individual productivity, Zen Agents emphasizes team-based workflows, enabling organizations to create, share, and deploy specialized AI agents across their development processes.

Bridging the Collaboration Gap in Software Engineering

Andrew Filev, CEO and founder of Zencoder, highlighted the limitations of current AI tools that primarily cater to individual developers. He pointed out that in real-world scenarios, software development is inherently collaborative, and existing tools often overlook the complexities of team dynamics. Zen Agents addresses this gap by facilitating the creation of AI agents that can be customized for specific frameworks, workflows, or codebases, and shared across teams to ensure consistency and efficiency.

Technical Innovation: Integration with Model Context Protocol (MCP)

A standout feature of Zen Agents is its implementation of the Model Context Protocol (MCP), a standard initiated by Anthropic and supported by OpenAI. MCP allows large language models to interact seamlessly with external tools, enhancing the capabilities of AI agents within the development lifecycle. To support this integration, Zencoder has introduced its own registry comprising over 100 MCP servers, facilitating a robust ecosystem for AI tool interaction.

Open-Source Marketplace: Harnessing Collective Intelligence

Zen Agents features an open-source marketplace where developers can contribute and discover custom AI agents. This community-driven approach mirrors successful ecosystems like Visual Studio Code extensions and npm packages, allowing for rapid expansion of capabilities and fostering innovation. Early adopters have already developed agents that automate tasks such as code reviews, accessibility enhancements, and integration of design elements from tools like Figma directly into codebases.

Enterprise-Ready with a Focus on Security and Compliance

Understanding the importance of security and compliance in enterprise environments, Zencoder has ensured that Zen Agents meets industry standards, boasting certifications like ISO 27001, SOC 2 Type II, and ISO 42001 for responsible AI management systems. These credentials position Zen Agents as a viable solution for organizations seeking to integrate AI into their development workflows without compromising on security.

Flexible Pricing to Accommodate Diverse Needs

Zencoder offers a tiered pricing model for Zen Agents to cater to various user requirements:

Free Tier: Access to basic features suitable for individual developers or small teams.
$20/Month Plan: Enhanced capabilities for growing teams needing more advanced tools.
$40/Month Plan: Comprehensive features designed for larger organizations with complex development needs.

Looking Ahead: Enhancing Developer Productivity

Zencoder envisions Zen Agents evolving towards greater autonomy, aiming to amplify developer productivity by minimizing context-switching and streamlining workflows. By focusing on the collaborative aspects of software development, Zen Agents aspires to facilitate a "flow state" for developers, where AI agents handle routine tasks, allowing human developers to concentrate on creative and complex problem-solving.

8.5.25

Microsoft Embraces Google’s Standard for Linking AI Agents: Why It Matters

In a landmark move for AI interoperability, Microsoft has adopted Google's Model Coordination Protocol (MCP) — a rapidly emerging open standard designed to unify how AI agents interact across platforms and applications. The announcement reflects a growing industry consensus: the future of artificial intelligence lies not in isolated models, but in connected multi-agent ecosystems.

What Is MCP?

Developed by Google, Model Coordination Protocol (MCP) is a lightweight, open framework that allows AI agents, tools, and APIs to communicate using a shared format. It provides a standardized method for passing context, status updates, and task progress between different AI systems — regardless of who built them.

MCP’s primary goals include:

🧠 Agent-to-agent collaboration
🔁 Stateful context sharing
🧩 Cross-vendor model integration
🔒 Secure agent execution pipelines

Why Microsoft’s Adoption Matters

By integrating MCP, Microsoft joins a growing alliance of tech giants, including Google, Anthropic, and NVIDIA, who are collectively shaping a more open and interoperable AI future.

This means that agentic systems built in Azure AI Studio or connected to Microsoft Copilot can now communicate more easily with tools and agents powered by Gemini, Claude, or open-source platforms.

"The real power of AI isn’t just what one model can do — it’s what many can do together."
— Anonymous industry analyst

Agentic AI Is Going Cross-Platform

As companies shift from isolated LLM tools to more autonomous AI agents, standardizing how these agents coordinate is becoming mission-critical. With the rise of agent frameworks like CrewAI, LangChain, and AutoGen, MCP provides the "glue" that connects diverse agents across different domains — like finance, operations, customer service, and software development.

A Step Toward an Open AI Stack

Microsoft’s alignment with Google on MCP suggests a broader industry pivot away from closed, siloed systems. It reflects growing recognition that no single company can dominate the agent economy — and that cooperation on protocol-level standards will unlock scale, efficiency, and innovation.

Final Thoughts

The adoption of MCP by Microsoft is more than just a technical choice — it’s a strategic endorsement of open AI ecosystems. As AI agents become more integrated into enterprise workflows and consumer apps, having a universal language for coordination could make or break the usability of next-gen tools.

With both Microsoft and Google now on board, MCP is poised to become the default operating standard for agentic AI at scale.