Any URL to Markdown, instantly

Convert any web page to clean Markdown for AI agents, LLMs, and developers. Open source, edge-deployed, 21 platform adapters.

web2md.cutexiaoguigui.eu.org/

format · selector · force_browser · raw · engine

mp.weixin.qq.com/s/abc123def
📱 WeChat Official Follow
Deep Dive: LLMs in Enterprise Production
John Zhang | 2026-03-25
⚠️
This content requires the WeChat app
Scan QR code
Open in WeChat
Recommended ▼
Ad
Ad
Ad
web2md.cutexiaoguigui.eu.org/mp.weixin.qq.com/s/abc123def
# Deep Dive: LLMs in Enterprise Production
> John Zhang | 2026-03-25
## Key Takeaways
1. LLMs are moving from lab experiments to production environments
2. RAG architecture is the preferred approach for enterprise apps
3. Agent workflows will redefine software development
## Background
With the release of GPT-4, Claude, and other models, enterprises are seriously considering integrating LLMs into their core business workflows...
```python
from langchain import ChatOpenAI
llm = ChatOpenAI(model="gpt-4")
```
X-Method: browser+readability · 2.1s · cached
Works where others fail
JS-heavy SPAs, paywalled content, anti-bot sites. 21 adapters for Chinese & international platforms.
zhihu.com/p/123456789
BLOCKED PAGE
Z Zhihu - Q&A Platform
How do you evaluate LLM adoption in enterprises?
In recent years, with breakthroughs in large language model technology, more and more enterprises are exploring integrating AI into core business...
Login to view full content
Login to view full content
Login Sign up
CLEAN MARKDOWN
# How do you evaluate LLM adoption in enterprises?
In recent years, with breakthroughs in large language model technology, more and more enterprises are exploring integrating AI into core business workflows.
## Three Key Trends
- Multimodal capabilities becoming standard
- Growing demand for private deployment
- Agent frameworks flourishing
Extracted via browser rendering
Claude
User
Read this article and summarize the key points
https://mp.weixin.qq.com/s/abc123
Claude
⚙ convert_url
url: "https://mp.weixin.qq.com/s/..."
3,421 chars · 2.1s
Here are the key takeaways from the article:
1. LLMs are moving from experiments to production
2. RAG is the most practical architecture currently
3. Agents will transform software development
The article also mentions an interesting case study...
AI-native from day one
MCP Server, Agent Skills, llms.txt built-in. Your AI agent just works -- no glue code needed.
Production ready
568 tests, 5-layer fallback pipeline, KV cache, edge-deployed on Cloudflare Workers.
5-Layer Fallback Pipeline
Request: https://example.com/article
Layer 1 — Native Markdown Cloudflare edge ⚡ 0.1s  
│ fail?
Layer 2 — Readability + Turndown HTML parsing ⚡ 0.5s
│ fail?
Layer 3 — Browser Rendering Headless Chrome ⚡ 2-5s
│ fail?
Layer 4 — CF REST API Browser Rendering ⚡ 1-3s
│ fail?
Layer 5 — Jina Reader External fallback ⚡ 2-4s
Result: Clean Markdown · 99.2% success rate

Built for every workflow

AI Agents
Feed web content to LLMs in clean Markdown
Knowledge Base
Crawl docs, wikis, blogs with deep crawl
Content Migration
Batch convert up to 10 URLs
Research
Read any article, no login walls
Chinese Web
WeChat, Zhihu, Feishu, Yuque, CSDN...
Data Extraction
CSS selectors, XPath, or Regex
21 Platform Adapters
WeChatZhihuFeishuYuqueJuejinCSDN36KrToutiaoWeiboNetEaseTwitter/XRedditNotionGitHubSubstackMedium

How it works

i
Prepend URL
Add md.genedai.me/ before any web address
ii
Edge Pipeline
5-layer fallback: Native → Readability → Browser → CF REST → Jina
iii
Clean Output
Markdown, JSON, HTML, or plain text

Frequently asked questions

What is Website2Markdown?
A free, open-source API that converts any web page URL to clean Markdown. Built on Cloudflare Workers with 5-layer fallback: native edge Markdown → Readability → headless browser → CF REST API → Jina Reader.
Is it free?
Yes, completely free and open source under Apache-2.0. Self-host or use the managed service at md.genedai.me.
Which platforms are supported?
21 built-in adapters: WeChat, Zhihu, Feishu/Lark, Yuque, Juejin, CSDN, 36Kr, Toutiao, Weibo, NetEase, Twitter/X, Reddit, Notion, and more. Any public URL works via generic fallback.
How does it handle JS-heavy pages?
Automatic 5-layer fallback. If native extraction fails, it escalates to Readability, then headless Chrome via Cloudflare Browser Rendering, then Jina Reader as last resort. Use ?force_browser=true to skip straight to browser rendering.
How to integrate with my AI agent?
Three ways: (1) Agent Skills for Claude Code/OpenClaw -- one command install. (2) MCP Server for Claude Desktop/Cursor. (3) llms.txt for auto-discovery by any AI system.
How to use the API?
Prepend md.genedai.me/ before any URL. For raw Markdown, add ?raw=true. Example: curl "https://md.genedai.me/https://example.com?raw=true". See the Docs tab for full API reference.

Quick Start

# Get raw Markdown curl -H "Accept: text/markdown" https://web2md.cutexiaoguigui.eu.org/https://example.com
# Get JSON output curl "https://web2md.cutexiaoguigui.eu.org/https://example.com?raw=true&format=json"
# Batch conversion curl -X POST https://web2md.cutexiaoguigui.eu.org/api/batch \ -H "Authorization: Bearer API_TOKEN" \ -H "Content-Type: application/json" \ -d '{"urls":["https://example.com"]}'

API Reference

Routes
GET /{url}Convert a single URL to Markdown
GET /api/streamSSE progress for single-URL conversion (step / done / fail)
POST /api/batchConvert up to 10 URLs (requires API_TOKEN)
POST /api/extractStructured extraction (css / xpath / regex)
POST /api/jobsCreate, query, stream, and run jobs
POST /api/deepcrawlBFS / BestFirst deep crawl with filters and scoring
GET /api/healthHealth check and operational metrics
GET /api/ogShare-image generator
GET /llms.txtAI-readable API description

Query Parameters

?raw=trueReturn raw Markdown (no HTML wrapper)
?format=Output format (markdown | html | text | json)
?selector=.cssExtract only matching CSS selector
?force_browser=trueForce headless browser rendering
?engine=jinaConvert via specific engine (jina / cf)
?no_cache=trueBypass cache, fetch fresh content
?token=Public API token

Authentication

PUBLIC_API_TOKEN — Single-URL convert and /api/stream accept Bearer or ?token=...

API_TOKEN — /api/batch, /api/extract, /api/jobs*, and /api/deepcrawl require API_TOKEN

curl Examples

# Get raw markdown curl -H "Accept: text/markdown" https://web2md.cutexiaoguigui.eu.org/https://example.com
# Get JSON output curl "https://web2md.cutexiaoguigui.eu.org/https://example.com?raw=true&format=json"
# Batch conversion curl -X POST https://web2md.cutexiaoguigui.eu.org/api/batch \ -H "Authorization: Bearer API_TOKEN" \ -H "Content-Type: application/json" \ -d '{"urls":["https://example.com"]}'
# Structured extraction curl -X POST https://web2md.cutexiaoguigui.eu.org/api/extract \ -H "Authorization: Bearer API_TOKEN" \ -H "Content-Type: application/json" \ -d '{"strategy":"css","url":"https://example.com","schema":{"fields":[{"name":"title","selector":"h1","type":"text","required":true}]}}'
# Deep crawl curl -X POST https://web2md.cutexiaoguigui.eu.org/api/deepcrawl \ -H "Authorization: Bearer API_TOKEN" \ -H "Content-Type: application/json" \ -d '{"seed":"https://example.com/docs","stream":true}'

Response Headers

X-Markdown-Methodnative | readability+turndown | browser+readability+turndown | jina
X-Cache-StatusHIT | MISS
X-Source-URLThe original target URL

Choose Your Integration

Does your agent have a terminal?
YES → Agent Skills (fastest, richest context)
NO → MCP Server
All AI → llms.txt auto-discovery

Agent Skills

For: Claude Code, Codex CLI, Gemini CLI, OpenClaw

One command to install, auto-discovered by your agent. Includes full usage patterns, error handling, and guides for all 21 platform adapters.

Claude Code
git clone https://github.com/Digidai/website2markdown-skills ~/.claude/skills/website2markdown
Codex CLI
git clone https://github.com/Digidai/website2markdown-skills ~/.codex/skills/website2markdown
Gemini CLI
git clone https://github.com/Digidai/website2markdown-skills ~/.gemini/skills/website2markdown
OpenClaw
npx clawhub@latest install website2markdown
One command install, auto-discovered in new sessions

MCP Server

For: Claude Desktop, Cursor IDE, Windsurf

Standard MCP protocol with convert_url tool.

npm install -g @digidai/mcp-website2markdown
Claude Desktop config (~/.claude/claude_desktop_config.json)
{ "mcpServers": { "website2markdown": { "command": "mcp-website2markdown", "env": { "WEBSITE2MARKDOWN_API_URL": "https://web2md.cutexiaoguigui.eu.org" } } } }

llms.txt

For: any AI system with web access

Machine-readable API description following the llms.txt standard. Any AI system can discover all capabilities from this endpoint.

https://web2md.cutexiaoguigui.eu.org/llms.txt →

Comparison

SkillsMCPllms.txt
Latency★★★★★★★★
Context★★★★★
Install1 command1 commandNone
Best forCLI AIIDE AIAll