Cloud Engineer Lab
Cloud Engineer Lab
Cloud Engineer Lab
Cloud Engineer Lab
© 2026
Anthropic in 2026: Claude 4, Claude Code, MCP and Everything Enterprise Teams Need to Know
AI & InnovationIntermediate

Anthropic in 2026: Claude 4, Claude Code, MCP and Everything Enterprise Teams Need to Know

Anthropic is moving faster than any AI lab in history. Claude 4 models, Claude Code, Model Context Protocol, Extended Thinking, and Computer Use are reshaping how enterprises build AI automation. Here is everything that matters and how to use it.

17 min read
Share

When Anthropic launched Claude in 2023, it was positioned as a safer, more reliable alternative to OpenAI's GPT models. In 2026, Anthropic is something else entirely — a full AI platform company whose tools are now embedded in enterprise workflows, developer pipelines, and IT automation stacks across the globe.

This article covers every major Anthropic tool, model, and capability that enterprise IT teams, automation architects, and cloud engineers need to understand — and more importantly, how to start using them in production.


Who is Anthropic and Why Does It Matter?

Anthropic was founded in 2021 by Dario Amodei, Daniela Amodei, and several key OpenAI alumni with a specific thesis: building powerful AI systems and frontier safety research are not in conflict — they require each other.

That safety-first DNA shows up in everything Anthropic ships:

  • Constitutional AI — a training methodology that teaches models to evaluate and refine their own outputs against a set of principles
  • Responsible Scaling Policy (RSP) — a commitment to slow down or stop deployment if capability thresholds require additional safety work
  • Interpretability research — published work attempting to understand what is actually happening inside large language models

For enterprise buyers, this matters. Anthropic's models are consistently ranked among the most reliable and least likely to hallucinate or produce harmful outputs in structured enterprise tasks — a direct result of this research focus.


The Claude 4 Model Family

The Claude 4 generation represents the most significant capability leap in Anthropic's history. Three models, each purpose-built for a specific enterprise use case.

Claude Opus 4 — The Frontier Reasoner

Model ID: claude-opus-4-7

Opus 4 is Anthropic's most capable model — designed for tasks that require deep reasoning, nuanced judgment, and handling genuinely difficult problems. It excels at:

  • Complex multi-step analysis and strategic planning
  • Advanced code generation and architecture design
  • Research synthesis across long documents
  • Tasks that benefit from extended thinking (more on this below)

When to use Opus 4: when quality and depth matter more than speed or cost. Complex PowerShell automation, architecture reviews, policy analysis, and anything requiring expert-level judgment.

Claude Sonnet 4 — The Production Workhorse

Model ID: claude-sonnet-4-6

Sonnet 4 is the model most enterprise teams reach for first — it delivers near-Opus quality at significantly lower cost and latency. It is the right default for:

  • API integrations and enterprise automation workflows
  • Code generation, review, and debugging
  • Document processing and classification at scale
  • Chatbots and AI assistants in internal tools
  • Anything running at volume where cost per call matters

Claude Haiku 4 — The Speed and Scale Model

Model ID: claude-haiku-4-5-20251001

Haiku 4 is Anthropic's fastest, most cost-efficient model. Built for tasks that require real-time response and high throughput:

  • Real-time classification and routing
  • Inline code completion and suggestion
  • High-volume document tagging and extraction
  • Customer-facing applications with sub-second latency requirements

Model Comparison at a Glance

CapabilityHaiku 4Sonnet 4Opus 4
Complex reasoningGoodVery GoodBest
Code generationGoodExcellentExcellent
SpeedFastestFastModerate
CostLowestMediumHigher
Context window200K tokens200K tokens200K tokens
Extended thinkingNoYesYes
Best forScale & speedGeneral productionDeep analysis

Build a tiered model strategy

Do not send every request to Opus 4. Route simple classification and extraction tasks to Haiku, standard automation and coding to Sonnet, and reserve Opus for genuinely complex reasoning tasks. A tiered strategy can reduce your AI infrastructure cost by 60–80% with minimal quality loss.


Claude Code: AI-Native Software Engineering

Claude Code is Anthropic's agentic coding tool — and it is one of the most practically useful AI tools released in 2025/2026 for engineers who write code as part of their work.

What Claude Code Actually Is

Claude Code is not a chat interface with a code block. It is a CLI-based, agentic tool that:

  • Reads and understands your entire codebase in context
  • Edits files directly with your review
  • Runs commands, tests, and builds in your terminal
  • Commits code to git on your behalf
  • Works across your file system with the permissions you grant

It is available as:

  • CLI (npm install -g @anthropic-ai/claude-code) — runs in any terminal
  • Desktop app — macOS and Windows native client
  • Web app — claude.ai/code
  • IDE extensions — VS Code and JetBrains

What Claude Code Can Do in Practice

For IT and cloud engineers, the most immediately useful applications are:

PowerShell and script automation

powershell
# Instead of writing this from scratch, describe it to Claude Code:
# "Write a PowerShell script that queries all Intune managed devices,
#  finds devices that haven't checked in for more than 30 days,
#  exports a CSV report, and sends an email summary via Graph API"
# Claude Code will write, test, and iterate on this entire script for you.

Infrastructure as Code generation Describe your infrastructure requirements in plain English. Claude Code writes the Bicep, Terraform, or ARM template, validates the syntax, and iterates until it matches your specification.

Codebase understanding and documentation Point Claude Code at an existing repository — even one you did not write — and ask it to explain the architecture, identify security issues, or generate documentation. It reads everything and synthesises across files.

Automated code review Run Claude Code against a pull request diff and get a structured review covering logic errors, security vulnerabilities, performance issues, and style inconsistencies.

Claude Code and enterprise security

Claude Code operates with the file system permissions of the user running it. It does not exfiltrate your code to train future models — Anthropic has explicit commitments against this for API and Claude Code users. Review your organisation's AI tool policy before using it with sensitive codebases.


Model Context Protocol (MCP): The USB Standard for AI

Model Context Protocol — MCP — is arguably Anthropic's most strategically important contribution to the AI ecosystem in 2025/2026. It is an open standard that defines how AI models connect to external tools, data sources, and services.

The Problem MCP Solves

Before MCP, every AI tool integration was a custom, one-off implementation. If you wanted Claude to query your ServiceNow instance, you wrote custom API code. If you wanted it to read your SharePoint documents, more custom code. Each integration was brittle, non-portable, and required ongoing maintenance.

MCP defines a universal protocol — like USB for hardware — so that any MCP-compatible AI client can connect to any MCP-compatible server without custom integration work.

How MCP Works

text
┌─────────────────┐         MCP Protocol         ┌─────────────────────┐
│   AI Client     │ ◄──────────────────────────► │   MCP Server        │
│  (Claude Code,  │                               │  (GitHub, Postgres, │
│   Claude.ai,    │   Resources / Tools / Prompts │   ServiceNow, etc.) │
│   your app)     │                               │                     │
└─────────────────┘                               └─────────────────────┘

MCP servers expose three types of capabilities:

  • Resources — data the AI can read (files, database records, API responses)
  • Tools — actions the AI can take (create ticket, run query, send email)
  • Prompts — reusable prompt templates for specific workflows

MCP Servers Available Today

The ecosystem is growing rapidly. Key MCP servers with enterprise relevance:

MCP ServerWhat it exposes
GitHubRead repos, issues, PRs; create branches; commit code
PostgreSQL / SQLiteQuery databases in natural language
Google DriveRead and search documents, spreadsheets
SlackRead channels, send messages, search history
Brave Search / Web fetchReal-time web search and page reading
AzureQuery Azure resources, read logs, manage configurations
ServiceNowRead and create incidents, change requests
FilesystemRead, write, search local files

Building a Custom MCP Server

For enterprise IT teams, the most powerful application is building custom MCP servers that expose your internal systems to AI models. Here is a minimal Python example:

python
# Minimal MCP server exposing an Intune device query tool
# pip install mcp httpx
 
from mcp.server import Server
from mcp.server.stdio import stdio_server
from mcp.types import Tool, TextContent
import httpx
import json
 
server = Server("intune-mcp")
 
@server.list_tools()
async def list_tools() -> list[Tool]:
    return [
        Tool(
            name="get_noncompliant_devices",
            description="Returns a list of non-compliant Intune managed devices",
            inputSchema={
                "type": "object",
                "properties": {
                    "limit": {
                        "type": "integer",
                        "description": "Max number of devices to return",
                        "default": 50
                    }
                }
            }
        )
    ]
 
@server.call_tool()
async def call_tool(name: str, arguments: dict) -> list[TextContent]:
    if name == "get_noncompliant_devices":
        # Call Microsoft Graph API — replace with your access token logic
        headers = {"Authorization": f"Bearer {get_access_token()}"}
        response = httpx.get(
            "https://graph.microsoft.com/v1.0/deviceManagement/managedDevices"
            "?$filter=complianceState eq 'noncompliant'"
            f"&$top={arguments.get('limit', 50)}"
            "&$select=deviceName,userPrincipalName,complianceState,lastSyncDateTime",
            headers=headers
        )
        devices = response.json().get("value", [])
        return [TextContent(type="text", text=json.dumps(devices, indent=2))]
 
async def main():
    async with stdio_server() as (read_stream, write_stream):
        await server.run(read_stream, write_stream, server.create_initialization_options())
 
if __name__ == "__main__":
    import asyncio
    asyncio.run(main())

Once your MCP server is running, any MCP-compatible client (Claude Code, Claude Desktop, your own Claude-powered app) can query your Intune environment in natural language — no custom integration code required on the client side.

MCP is the right architecture for enterprise AI integrations

If your organisation is building multiple AI tools that need to access the same internal systems — ITSM, identity, endpoint management, monitoring — build MCP servers for those systems once, and all your AI tools benefit. This is far more scalable than point-to-point integrations.


Extended Thinking: Claude Reasons Before It Responds

Extended Thinking is a capability available on Claude Sonnet 4 and Opus 4 that allows the model to work through a problem step by step in a private reasoning process before producing its final response.

Why Extended Thinking Matters

Standard language models produce output token by token in a single pass. Extended Thinking gives Claude a "scratchpad" — a chain of reasoning that happens before the answer — similar to how a human expert thinks through a hard problem before speaking.

The result: dramatically better performance on tasks that require:

  • Multi-step logical reasoning
  • Complex code architecture decisions
  • Mathematical or statistical analysis
  • Policy interpretation and compliance assessment
  • Troubleshooting complex infrastructure issues

Using Extended Thinking via the API

python
import anthropic
 
client = anthropic.Anthropic()
 
# Enable extended thinking with a budget of 10,000 thinking tokens
response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=16000,
    thinking={
        "type": "enabled",
        "budget_tokens": 10000  # How much thinking to allow — more = better for hard problems
    },
    messages=[{
        "role": "user",
        "content": """
        Our Azure tenant has 3,500 devices across 12 countries. We need to design
        a Conditional Access policy architecture that:
        1. Enforces MFA for all users accessing M365 apps
        2. Requires compliant devices for accessing sensitive SharePoint sites
        3. Blocks legacy authentication protocols entirely
        4. Allows break-glass accounts to bypass MFA with strong alerting
        5. Accommodates a manufacturing floor with shared kiosk devices
 
        Design the complete CA policy set, including policy names, conditions,
        grant controls, and the order of precedence. Flag any conflicts.
        """
    }]
)
 
# The thinking blocks show the reasoning process
for block in response.content:
    if block.type == "thinking":
        print("=== REASONING PROCESS ===")
        print(block.thinking[:500] + "...")  # Claude's internal reasoning
    elif block.type == "text":
        print("=== FINAL ANSWER ===")
        print(block.text)

For enterprise use cases like Conditional Access architecture, compliance gap analysis, or complex PowerShell script design, Extended Thinking consistently produces more reliable, better-reasoned outputs than standard mode.


Computer Use: Claude as an Autonomous Operator

Computer Use is Anthropic's capability that allows Claude to interact with a computer interface — seeing a screenshot, then taking actions like clicking, typing, scrolling, and navigating — to complete tasks autonomously.

What Computer Use Enables

With Computer Use, you can ask Claude to:

  • Navigate a web portal and extract data that has no API
  • Fill in forms across legacy enterprise applications
  • Perform repetitive GUI-based tasks that cannot be scripted via CLI
  • Test web applications by interacting with them as a user would
  • Operate any software with a graphical interface

A Practical Enterprise Use Case

python
import anthropic
import base64
from PIL import ImageGrab  # pip install Pillow
 
client = anthropic.Anthropic()
 
def take_screenshot() -> str:
    """Capture screen and return as base64 PNG"""
    screenshot = ImageGrab.grab()
    import io
    buffer = io.BytesIO()
    screenshot.save(buffer, format="PNG")
    return base64.b64encode(buffer.getvalue()).decode()
 
# Example: Ask Claude to navigate a legacy ITSM portal
response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=4096,
    tools=[{
        "type": "computer_20241022",
        "name": "computer",
        "display_width_px": 1920,
        "display_height_px": 1080,
    }],
    messages=[{
        "role": "user",
        "content": [
            {
                "type": "image",
                "source": {
                    "type": "base64",
                    "media_type": "image/png",
                    "data": take_screenshot()
                }
            },
            {
                "type": "text",
                "text": "Navigate to the ServiceDesk portal, find all open P1 incidents created today, and export a summary as CSV."
            }
        ]
    }]
)

Computer Use requires careful scoping

Computer Use gives Claude the ability to click and type on your desktop. Always run it in an isolated virtual machine or container with minimal permissions. Never run Computer Use agents on a machine with access to production credentials or sensitive data without strict session boundaries.


The Claude API: Enterprise Features That Matter

For teams building on the Anthropic API directly, several capabilities significantly impact how you architect AI applications.

Prompt Caching — Slash Your API Costs

Prompt Caching allows you to cache frequently used portions of your prompts — system prompts, large knowledge bases, long documents — and pay a fraction of the cost on repeat calls.

python
import anthropic
 
client = anthropic.Anthropic()
 
# Cache a large system prompt or knowledge base
response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    system=[
        {
            "type": "text",
            "text": "You are an expert Microsoft Intune consultant...",
        },
        {
            "type": "text",
            "text": open("intune_knowledge_base.txt").read(),  # Large document
            "cache_control": {"type": "ephemeral"}  # Cache this block
        }
    ],
    messages=[{
        "role": "user",
        "content": "What compliance policies should I apply to BYOD Android devices?"
    }]
)
 
# Subsequent calls with the same cached content cost ~90% less for the cached portion
print(f"Input tokens: {response.usage.input_tokens}")
print(f"Cache read tokens: {response.usage.cache_read_input_tokens}")

Pricing impact: cache writes cost 25% more than base input tokens, but cache reads cost 90% less. For applications with a large, stable system prompt — internal knowledge bases, policy documentation, code context — the savings are substantial.

Tool Use (Function Calling)

Tool Use allows Claude to call functions you define, enabling structured integrations with external systems:

python
tools = [
    {
        "name": "get_device_compliance",
        "description": "Check compliance status of a device in Microsoft Intune",
        "input_schema": {
            "type": "object",
            "properties": {
                "device_name": {
                    "type": "string",
                    "description": "The hostname of the device to check"
                },
                "include_policy_detail": {
                    "type": "boolean",
                    "description": "Whether to include detailed policy failure reasons",
                    "default": False
                }
            },
            "required": ["device_name"]
        }
    },
    {
        "name": "create_support_ticket",
        "description": "Create a ServiceNow incident for a device issue",
        "input_schema": {
            "type": "object",
            "properties": {
                "title": {"type": "string"},
                "description": {"type": "string"},
                "priority": {"type": "string", "enum": ["P1", "P2", "P3", "P4"]},
                "assigned_group": {"type": "string"}
            },
            "required": ["title", "description", "priority"]
        }
    }
]
 
response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=4096,
    tools=tools,
    messages=[{
        "role": "user",
        "content": "Check if LAPTOP-UK-042 is compliant and if not, raise a P3 ticket to the endpoint team."
    }]
)

Vision — Document and Image Understanding

All Claude 4 models natively understand images. For enterprise IT teams, this unlocks:

  • Screenshot-based troubleshooting — paste an error screenshot, get a diagnosis
  • Network diagram analysis — upload a Visio or draw.io export, ask architecture questions
  • Dashboard and report interpretation — feed in a Power BI screenshot, extract the key numbers
  • Handwritten form processing — extract structured data from scanned documents

Claude.ai Enterprise Features

For organisations using Claude.ai directly (not the API), several enterprise-tier features are relevant.

Projects

Projects allow teams to give Claude persistent context across multiple conversations:

  • Upload company documentation, runbooks, and wikis once — Claude retains them across all conversations in the project
  • Shared projects let teams collaborate with the same AI context
  • Use cases: IT knowledge base assistant, onboarding helper, policy advisor

Artifacts

Artifacts are self-contained outputs Claude can generate that are interactive or exportable:

  • React components rendered live in the browser
  • Interactive data visualisations
  • Runnable HTML/JavaScript tools
  • Exportable documents and reports

For IT teams: use Artifacts to generate interactive PowerShell reference guides, troubleshooting decision trees, or visualisations of your infrastructure state.


Anthropic's Safety Approach: Why It Matters for Enterprise

Anthropic's differentiation is not just capability — it is the combination of capability and safety research. For enterprise buyers, this translates to:

Constitutional AI in Practice

Every Claude model is trained with Constitutional AI — a process where the model evaluates and refines its own outputs against a set of principles covering helpfulness, harmlessness, and honesty. The practical result:

  • Lower hallucination rates on factual tasks compared to models without this training
  • More reliable refusal of genuinely harmful requests without over-refusing legitimate enterprise tasks
  • Consistent behaviour across languages and contexts

The Responsible Scaling Policy

Anthropic's RSP is a public commitment to pause or limit deployment if models reach capability thresholds that require additional safety work before wider release. For enterprise CISOs evaluating AI vendors:

  • Anthropic is the only major AI lab with a published, specific RSP
  • It signals a vendor that will prioritise safety over speed-to-market when the stakes are high
  • Relevant for regulated industries evaluating AI risk under NIS2, DORA, and GDPR

What Enterprises Should Do Right Now

Immediate actions (this month)

  1. Start a Claude API trial — Anthropic offers free credits; get your developers building with Sonnet 4 against your most common automation task
  2. Install Claude Code in your engineering team — let them use it for a sprint and measure productivity impact
  3. Identify your top 3 MCP server candidates — which internal systems would create the most value if AI could query them directly?

Build over the next quarter

  1. Design a tiered model strategy — map your use cases to Haiku, Sonnet, and Opus with cost and quality thresholds documented
  2. Build your first MCP server — start with read-only access to one internal system; expand from there
  3. Implement Prompt Caching for any production API integration with a large system prompt
  4. Evaluate Extended Thinking for your highest-value, highest-complexity automation tasks

Strategic planning (6–12 months)

  1. Develop an AI governance policy covering which data can flow through external AI APIs (Anthropic, OpenAI, etc.) and what classification level requires on-premises or sovereign deployment
  2. Map Claude capabilities to your IT automation backlog — prioritise the 10 highest-ROI automations
  3. Evaluate Claude for Enterprise if you have 50+ users — private deployment options, SSO, audit logging, and data privacy commitments

Closing Thoughts

Anthropic has moved from being a safety-focused research organisation that also builds products to being a production AI platform that enterprises genuinely depend on. The pace of releases — Claude 4 models, Claude Code, MCP, Extended Thinking, Computer Use — is accelerating, not slowing.

The organisations that will extract the most value are not those waiting for the tools to mature further. They are the ones building now: experimenting with the API, deploying Claude Code to engineering teams, and designing the MCP server architecture that will connect their enterprise systems to AI in a governed, scalable way.

Anthropic's bet is that safety and capability are complementary. Their track record so far suggests they are right — and for enterprise teams who need both, that matters enormously.

Follow this blog for hands-on implementation guides for Claude API integration in enterprise IT workflows, building MCP servers for Microsoft Graph and Intune, and using Claude Code for PowerShell automation at scale.

CChetan Yamger

Written by

Chetan Yamger

Cloud Engineer · AI Automation Architect · Modern Workplace Consultant

Cloud Engineer, AI Automation Architect, and Modern Workplace Consultant based in Amsterdam, Netherlands. Specializing in scalable, secure enterprise solutions with Microsoft Azure, Intune, PowerShell, and AI-driven automation using ChatGPT, Gemini, and modern LLM technologies.

Cloud & Modern WorkplaceMicrosoft Intune & MDMAzure & Microsoft 365AI AutomationPrompt EngineeringPowerShell & Graph APIWindows AutopilotConditional Access & Zero TrustSCCM / MECM & MSIXVDI / WVDPower BINode.js & Next.js
Newsletter

Stay in the loop.
New articles, straight to you.

Deep-dive technical articles on Intune, PowerShell, and AI — no noise, no spam.

New article notifications
No spam, ever
Free forever

Discussion

Share your thoughts — your email stays private

Leave a comment

0/2000

Your email is used to prevent spam and will never be displayed.