I Built an AI Agent in 20 Minutes with MaxClaw – Step-by-Step Guide
Spinning up a serious AI agent used to mean wrestling with APIs, servers, configs, and a weekend of debugging. Now you can get a fully tooled-up assistant running in about 20 minutes – and even non-technical users can use it from their phone.
In this guide, we’ll walk through MaxClaw, MiniMax’s hosted version of OpenClaw, how it works, why their M2.7 model is getting attention, and how to launch practical agents for research, trend tracking, content, and coding.
What Is MaxClaw and Why Does It Matter?
MaxClaw is MiniMax’s hosted implementation of OpenClaw – an AI agent orchestration framework. Instead of manually wiring models, tools, memory, and schedules yourself, MaxClaw gives you a ready-to-run environment on MiniMax’s infrastructure.
Under the hood, OpenClaw is designed to let an AI agent:
• Use tools (web search, code execution, image/video generation, web scraping, etc.)
• Maintain memory and files over time
• Run tasks autonomously on a schedule (not just respond to prompts)
• Coordinate multiple skills and plugins
Normally, setting this up yourself means dealing with:
• API keys and model deployments
• MCP server configuration
• Tool integrations and rate limiting
• Memory storage and security
• VPS hosting and maintenance
MaxClaw skips all of that. You get:
• A hosted OpenClaw instance powered by MiniMax’s M2.7 model
• A full toolset out of the box (search, image, video, code, scraping, and more)
• Integrations for web, desktop, mobile, Telegram, Discord, Slack, and others
• Access to over 10,000 pre-built “experts” (specialized agent configurations)
The result: you can deploy a capable AI assistant that runs on its own, without touching infrastructure.
The M2.7 Model: Why This Agent Setup Works So Well
MaxClaw is built around MiniMax’s M2.7 model, which is getting a lot of attention for two reasons: how it was trained and how well it handles tools.
A Model That Helped Train Itself
Training large AI models requires a big “harness” around them: automated tests, validation suites, deployment pipelines, and more. For M2.7, MiniMax had the model itself iteratively improve that training harness.
The process looked like this:
• Start with a base model
• Let the model refine the training inputs and harness configuration
• Repeat this loop 100 times with no human intervention
Over these iterations, performance improved by more than 30%. The end result is a model that sits in the same performance tier as frontier models like Claude Opus or GPT – but at a significantly lower cost and with stronger tool-usage behavior.
Why Tool Calling Matters for Agents
Most general-purpose models start to struggle when you give them dozens of tools or skills to choose from. They misfire, ignore tools, or pick the wrong ones.
MiniMax tested M2.7 in heavy tool environments (20–40+ tools) and reported:
• 97% tool compliance for M2.7
• Around 74% for comparable models in the same setup
That reliability is a big deal for agent frameworks like OpenClaw, where a single agent might need to juggle web search, scraping, code execution, image generation, and more in one workflow.
MiniMax also plans to open-source the M2.7 weights, which makes it interesting even if you later decide to host your own stack.
How MaxClaw Works in Practice
MaxClaw is designed so that even non-technical users can run a personal AI assistant. One example: a non-technical user using it entirely from Telegram to handle day-to-day tasks like research, planning, content ideas, and motivation – with no setup on their side beyond adding the bot.
Behind the scenes, the MaxClaw instance stores:
• Files (generated images, documents, logs)
• Skills (what the agent can do)
• Identity and tone (how it should speak and behave)
• To-do lists and heartbeat files (what needs to run and when)
The owner can view and edit all of this from the MiniMax web platform, while the end user just chats with the bot on Telegram, Slack, or another channel.
Pricing: Credits vs Token Plan
MiniMax offers two main ways to pay for MaxClaw and related features.
Option 1: Agent Credits
You can buy a subscription that gives you a pool of credits to use on:
• MiniMax agents (including MaxClaw)
• Direct interactions with models and experts on the platform
Plans start around $19/month, with discounts for yearly billing. An extra bonus: you get 200 additional credits every day just for signing in, which can stretch the value quite a bit if your usage is moderate.
Option 2: The Token Plan (All-in-One)
The more flexible option is MiniMax’s token plan. This is an all-modality subscription where you pay a flat monthly fee and get a bundle of requests across:
• Base language model (like M2.7)
• Speech/voice models
• Image generation
• Halo video generation models
• High-speed tiers (depending on plan)
Key points:
• Plans start around $10/month for a starter tier
• You can use the same pool for MaxClaw, coding, image gen, video gen, and more
• You don’t need separate API keys and subscriptions for each modality
If you’re doing multi-modal work (text, images, video, voice) or running multiple agents, the token plan is usually the simpler and cheaper option compared to piecing together multiple vendors.
Setting Up Your Own MaxClaw Agent
You can deploy MaxClaw from the web app or via MiniMax’s desktop and mobile apps (Mac, Windows, iOS, Android). The basic flow on the web looks like this:
1. Choose How You’ll Pay
First, decide whether you’ll use:
• Agent credits, or
• A token plan (recommended if you want image/video/speech too)
If you go with the token plan, you’ll generate an API key from your plan page and paste it into the MaxClaw setup so your agent runs against that subscription.
2. Pick an Expert to Base Your Agent On
MiniMax has two levels of building blocks:
• Experts – full agent configurations with specific purposes (e.g., trend tracking, landing page building, research, trading).
• Skills – modular capabilities you can attach to any agent (e.g., icon maker, landing page builder, custom tools).
When creating a MaxClaw instance, you start by choosing an expert. Some examples include:
• Default expert – a general-purpose assistant with a broad toolset
• Trend Intel – focused on tracking and summarizing trends
• Image generation expert – optimized for AI image workflows
• Multi-agent trading, industry research, and more
Pick the expert that matches your main use case. You can always customize skills later.
3. Connect Your Plan and Deploy
From the MaxClaw page:
• Click the plus button to create a new MaxClaw instance
• Select your expert
• If you’re using the token plan, paste in your API key
• Click to deploy
Within a short time, your agent is live and ready to chat. You can talk to it directly in the web interface or connect it to other platforms.
4. Connect to Telegram, Slack, and More
MaxClaw supports multiple channels, including Telegram and Slack. To connect, you typically:
• Create a bot on the target platform (e.g., via BotFather on Telegram)
• Grab the bot token
• Ask your MaxClaw agent to help you connect – it will guide you through the steps
Once connected, you can use the agent entirely from your phone or chat app, while still managing its configuration from the MiniMax dashboard.
Real Use Cases: From Images to Trend Reports
To get a feel for what MaxClaw can actually do, here are a few concrete examples based on different experts.
Image Generation Expert
With the image generation expert, you can ask for complex visual tasks like:
“Generate an AI image of a specific creator. Look up what they look like, then create a cartoon-style, animated version of them with a coding-themed background.”
The agent will:
• Use web search to find reference images
• Analyze those images with an image analysis tool
• Generate a new AI image that matches the description
• Save it into your MaxClaw files for download
All of this happens without you manually wiring search, image analysis, and image generation together – the expert is pre-configured to do it.
Trend Intel Expert
The Trend Intel expert is built to track what’s happening in fast-moving spaces like AI. You can ask:
“Research the latest trends around AI agents, OpenClaw, MaxClaw, MiniMax, and new AI models as of today. Summarize the key stories and numbers.”
The agent will:
• Run multi-source web research
• Aggregate and summarize key developments (e.g., adoption numbers, new models, funding, releases)
• Present a structured report
Where this becomes powerful is when you add automation. For example:
“Every second day, send me a shorter version of this report with only the key bullet points I need to know.”
MaxClaw can then:
• Set up a recurring task (cron-like behavior)
• Generate a fresh report every two days
• Deliver it via your chosen channel (e.g., Telegram)
This kind of automated, scheduled research is very similar to what you can build with other agent frameworks. If you’re interested in building similar workflows on different stacks, you might also like this guide to building Claude-based agents without code.
Creating Custom Skills
You’re not limited to the skills that ship with an expert. From the MaxClaw interface, you can:
• Open Manage skills
• Ask the agent to create a new skill based on your instructions
• Use chat, files, or Claw Hub plugins as building blocks
For example, after getting a useful AI trends report, you could say:
“Turn the report you just generated into a reusable skill called ‘AI Trends Report’ that runs every second day and outputs a concise summary.”
MaxClaw will then:
• Create a new skill folder and configuration file
• Wire it into your agent’s skills list
• Make it available for future use and scheduling
Default Expert + Coding & Landing Pages
Using the default expert, you can add extra skills from the skills marketplace, such as:
• Landing page builder
• Icon maker
• Other user-contributed or MiniMax-built skills
Once added to your account’s skills, all your MaxClaw instances can access them. You can then prompt your agent to:
• Generate a complete landing page for an AI generative media product
• Create supporting images and icons
• Generate a short AI video (e.g., a 6-second 768p clip of a dragon flying) to embed on the page
Because image and video generation are included in the token plan, you don’t need separate integrations or billing for those capabilities.
Who Should Use MaxClaw (and Who Shouldn’t)
MaxClaw is not trying to replace fully self-hosted, deeply customized OpenClaw deployments – it’s a faster, easier alternative for many use cases.
MaxClaw Is a Great Fit If You:
• Want a personal or team AI assistant quickly (research, planning, content, trend tracking)
• Don’t want to manage servers, security, or API juggling
• Need multi-modal features (text, images, video, speech) under one subscription
• Are okay with “simple to medium” complexity automations rather than deeply bespoke systems
It’s especially compelling if you’re already spending a lot on separate tools for language models, image gen, and video gen. Consolidating into one token plan can be significantly cheaper.
You May Prefer Self-Hosting If You:
• Are a developer who wants full control over infrastructure and configs
• Need extreme customization, scaling, or proprietary integrations
• Are willing to invest 20–100+ hours into building a highly specialized agent
• Want to run everything on your own hardware or private cloud
In that case, you might still use MiniMax’s M2.7 model inside your own OpenClaw instance for its strong tool-calling behavior, while keeping full control over deployment. For another example of quickly building AI-powered web experiences, you can also check out this guide to vibe-coding a Shopify store with Claude.
Cost and Value Compared to DIY OpenClaw
Running your own OpenClaw stack typically involves:
• VPS hosting (or your own machine) – often $10+/month
• API usage for frontier models (easily multiple dollars per day under heavy use)
• Separate subscriptions for image, video, and speech generation
• Your time to set up, secure, and maintain everything
For many users, that quickly adds up to $50–$200+ per month, plus the time cost.
With MiniMax’s token plan, you pay a flat monthly fee (starting around $10–$20) and get:
• Multiple MaxClaw instances
• Generous language model usage
• Image, video, and speech generation included
• No separate hosting or infra costs
For anyone who wants capable agents without running their own stack, that’s a strong value proposition.
In short: MaxClaw won’t replace hardcore, self-hosted OpenClaw builds, but it’s an excellent way to get powerful AI agents running in minutes instead of days – and to give non-technical users a reliable assistant they can use from their phone.
Comments
No comments yet. Be the first to share your thoughts!