Any URL to Markdown, instantly

Convert any web page to clean Markdown for AI agents, LLMs, and developers. Open source, edge-deployed, 21 platform adapters.

format · selector · force_browser · raw · engine

mp.weixin.qq.com/s/abc123def

📱 WeChat Official Follow

Deep Dive: LLMs in Enterprise Production

John Zhang | 2026-03-25

⚠️

This content requires the WeChat app

Scan QR code

Open in WeChat

Recommended ▼

web2md.cutexiaoguigui.eu.org/mp.weixin.qq.com/s/abc123def

# Deep Dive: LLMs in Enterprise Production

> John Zhang | 2026-03-25

## Key Takeaways

1. LLMs are moving from lab experiments to production environments
2. RAG architecture is the preferred approach for enterprise apps
3. Agent workflows will redefine software development

## Background

With the release of GPT-4, Claude, and other models, enterprises are seriously considering integrating LLMs into their core business workflows...

```python
from langchain import ChatOpenAI
llm = ChatOpenAI(model="gpt-4")
```

✓ X-Method: browser+readability · 2.1s · cached

Works where others fail

JS-heavy SPAs, paywalled content, anti-bot sites. 21 adapters for Chinese & international platforms.

zhihu.com/p/123456789

→

BLOCKED PAGE

Zhihu - Q&A Platform

How do you evaluate LLM adoption in enterprises?

In recent years, with breakthroughs in large language model technology, more and more enterprises are exploring integrating AI into core business...

CLEAN MARKDOWN

# How do you evaluate LLM adoption in enterprises?

In recent years, with breakthroughs in large language model technology, more and more enterprises are exploring integrating AI into core business workflows.

## Three Key Trends

- Multimodal capabilities becoming standard
- Growing demand for private deployment
- Agent frameworks flourishing

✓ Extracted via browser rendering

Claude

User

Read this article and summarize the key points
https://mp.weixin.qq.com/s/abc123

Claude

⚙ convert_url

url: "https://mp.weixin.qq.com/s/..."

✓ 3,421 chars · 2.1s

Here are the key takeaways from the article:

1. LLMs are moving from experiments to production
2. RAG is the most practical architecture currently
3. Agents will transform software development

The article also mentions an interesting case study...

AI-native from day one

MCP Server, Agent Skills, llms.txt built-in. Your AI agent just works -- no glue code needed.

Production ready

568 tests, 5-layer fallback pipeline, KV cache, edge-deployed on Cloudflare Workers.

5-Layer Fallback Pipeline

Request: https://example.com/article

Layer 1 — Native Markdown Cloudflare edge ⚡ 0.1s ✓

│ fail?

Layer 2 — Readability + Turndown HTML parsing ⚡ 0.5s

│ fail?

Layer 3 — Browser Rendering Headless Chrome ⚡ 2-5s

│ fail?

Layer 4 — CF REST API Browser Rendering ⚡ 1-3s

│ fail?

Layer 5 — Jina Reader External fallback ⚡ 2-4s

Result: Clean Markdown · 99.2% success rate

Built for every workflow

AI Agents

Feed web content to LLMs in clean Markdown

Knowledge Base

Crawl docs, wikis, blogs with deep crawl

Content Migration

Batch convert up to 10 URLs

Research

Read any article, no login walls

Chinese Web

WeChat, Zhihu, Feishu, Yuque, CSDN...

Data Extraction

CSS selectors, XPath, or Regex

21 Platform Adapters

WeChatZhihuFeishuYuqueJuejinCSDN36KrToutiaoWeiboNetEaseTwitter/XRedditNotionGitHubSubstackMedium

How it works

Prepend URL

Add md.genedai.me/ before any web address

Edge Pipeline

5-layer fallback: Native → Readability → Browser → CF REST → Jina

iii

Clean Output

Markdown, JSON, HTML, or plain text

Frequently asked questions

What is Website2Markdown?

A free, open-source API that converts any web page URL to clean Markdown. Built on Cloudflare Workers with 5-layer fallback: native edge Markdown → Readability → headless browser → CF REST API → Jina Reader.

Is it free?

Yes, completely free and open source under Apache-2.0. Self-host or use the managed service at md.genedai.me.

Which platforms are supported?

21 built-in adapters: WeChat, Zhihu, Feishu/Lark, Yuque, Juejin, CSDN, 36Kr, Toutiao, Weibo, NetEase, Twitter/X, Reddit, Notion, and more. Any public URL works via generic fallback.

How does it handle JS-heavy pages?

Automatic 5-layer fallback. If native extraction fails, it escalates to Readability, then headless Chrome via Cloudflare Browser Rendering, then Jina Reader as last resort. Use ?force_browser=true to skip straight to browser rendering.

How to integrate with my AI agent?

Three ways: (1) Agent Skills for Claude Code/OpenClaw -- one command install. (2) MCP Server for Claude Desktop/Cursor. (3) llms.txt for auto-discovery by any AI system.

How to use the API?

Prepend md.genedai.me/ before any URL. For raw Markdown, add ?raw=true. Example: curl "https://md.genedai.me/https://example.com?raw=true". See the Docs tab for full API reference.

Try it now.

Try converting a Cloudflare docs page →

Quick Start

# Get raw Markdown
curl -H "Accept: text/markdown" https://web2md.cutexiaoguigui.eu.org/https://example.com

# Get JSON output
curl "https://web2md.cutexiaoguigui.eu.org/https://example.com?raw=true&format=json"

# Batch conversion
curl -X POST https://web2md.cutexiaoguigui.eu.org/api/batch \
  -H "Authorization: Bearer API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls":["https://example.com"]}'

API Reference

Routes
`GET /{url}`	Convert a single URL to Markdown
`GET /api/stream`	SSE progress for single-URL conversion (step / done / fail)
`POST /api/batch`	Convert up to 10 URLs (requires API_TOKEN)
`POST /api/extract`	Structured extraction (css / xpath / regex)
`POST /api/jobs`	Create, query, stream, and run jobs
`POST /api/deepcrawl`	BFS / BestFirst deep crawl with filters and scoring
`GET /api/health`	Health check and operational metrics
`GET /api/og`	Share-image generator
`GET /llms.txt`	AI-readable API description

Query Parameters

`?raw=true`	Return raw Markdown (no HTML wrapper)
`?format=`	Output format (`markdown` \| `html` \| `text` \| `json`)
`?selector=.css`	Extract only matching CSS selector
`?force_browser=true`	Force headless browser rendering
`?engine=jina`	Convert via specific engine (jina / cf)
`?no_cache=true`	Bypass cache, fetch fresh content
`?token=`	Public API token

Authentication

PUBLIC_API_TOKEN — Single-URL convert and /api/stream accept Bearer or ?token=...

API_TOKEN — /api/batch, /api/extract, /api/jobs*, and /api/deepcrawl require API_TOKEN

curl Examples

# Get raw markdown
curl -H "Accept: text/markdown" https://web2md.cutexiaoguigui.eu.org/https://example.com

# Get JSON output
curl "https://web2md.cutexiaoguigui.eu.org/https://example.com?raw=true&format=json"

# Batch conversion
curl -X POST https://web2md.cutexiaoguigui.eu.org/api/batch \
  -H "Authorization: Bearer API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls":["https://example.com"]}'

# Structured extraction
curl -X POST https://web2md.cutexiaoguigui.eu.org/api/extract \
  -H "Authorization: Bearer API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"strategy":"css","url":"https://example.com","schema":{"fields":[{"name":"title","selector":"h1","type":"text","required":true}]}}'

# Deep crawl
curl -X POST https://web2md.cutexiaoguigui.eu.org/api/deepcrawl \
  -H "Authorization: Bearer API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"seed":"https://example.com/docs","stream":true}'

Response Headers

`X-Markdown-Method`	native \| readability+turndown \| browser+readability+turndown \| jina
`X-Cache-Status`	HIT \| MISS
`X-Source-URL`	The original target URL

Choose Your Integration

Does your agent have a terminal?

YES → Agent Skills (fastest, richest context)

NO → MCP Server

All AI → llms.txt auto-discovery

Agent Skills

For: Claude Code, Codex CLI, Gemini CLI, OpenClaw

One command to install, auto-discovered by your agent. Includes full usage patterns, error handling, and guides for all 21 platform adapters.

Claude Code

git clone https://github.com/Digidai/website2markdown-skills ~/.claude/skills/website2markdown

Codex CLI

git clone https://github.com/Digidai/website2markdown-skills ~/.codex/skills/website2markdown

Gemini CLI

git clone https://github.com/Digidai/website2markdown-skills ~/.gemini/skills/website2markdown

OpenClaw

npx clawhub@latest install website2markdown

One command install, auto-discovered in new sessions

MCP Server

For: Claude Desktop, Cursor IDE, Windsurf

Standard MCP protocol with convert_url tool.

npm install -g @digidai/mcp-website2markdown

Claude Desktop config (~/.claude/claude_desktop_config.json)

{
  "mcpServers": {
    "website2markdown": {
      "command": "mcp-website2markdown",
      "env": {
        "WEBSITE2MARKDOWN_API_URL": "https://web2md.cutexiaoguigui.eu.org"
      }
    }
  }
}

llms.txt

For: any AI system with web access

Machine-readable API description following the llms.txt standard. Any AI system can discover all capabilities from this endpoint.

https://web2md.cutexiaoguigui.eu.org/llms.txt →

Comparison

	Skills	MCP	llms.txt
Latency	★★★	★★	★★★
Context	★★★	★	★★
Install	1 command	1 command	None
Best for	CLI AI	IDE AI	All