Skip to content

leosssvip-dot/website-replication-skill

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

website-replication-skill

English · 中文

An agent-harness-agnostic skill that audits any reference website or web app and produces a differentiated parity plan across UI, interactions, API contracts, data model, and architecture — without copying protected expression. Tested on Claude Code and OpenAI Codex; any harness that loads markdown skills and exposes file + browser MCP tools can run it.


English

Overview

Typical uses: benchmarking a competitor · replicating a legacy version of your own product · learning behavior from a partner integration · auditing an inspiration source.

Status: stable templates · domain-agnostic · works for SaaS, e-commerce, content, collaboration, AI tools, marketplaces, and internal tools.

What This Skill Replicates — and What It Does NOT

Read this before installing. The word replication in the name routinely gets misread as "1:1 visual clone". It is not.

Replicates (product behavior parity):

  • Workflow logic and step sequences
  • Field structure, form composition, validation rules
  • State transitions (loading, success, error, gated, paid)
  • Interaction patterns (what each button does, what each menu reveals)
  • API capability mapping (endpoints, payload shape, async patterns)
  • Data model and entity relationships

Does NOT pixel-clone (intentionally, by design):

  • Logos, brand marks, distinctive iconography
  • Exact verbatim copy / wording
  • Distinctive page composition and visual rhythm
  • The reference site's specific color palette, typography, spacing values, radius scale
  • Proprietary imagery, illustrations, photography
  • Class names, design-system tokens (extracted only as reference, not as values to paste)

If you expect the output to look like the reference site — same colors, same fonts, near-identical layout — the skill will not give you that, and that is the correct behavior. Every workflow step pushes the result toward your target product's own brand: HTML/CSS examples use your tokens, design-tokens.js outputs are reference material (not values to paste), the Differentiation preference knob's three options all preserve some level of intentional visual difference.

Why: trademark, trade dress, and the "look and feel" doctrine make pixel-cloning a real legal exposure. The skill treats that risk as the user's, not its own, and refuses to amplify it. See Ethics & Legal below.

If you genuinely need higher visual fidelity (internal clone of your own legacy product, partner integration where contracts permit it, you have explicit license), open an issue. A future visual-fidelity-priority mode may be added under explicit IP-risk acknowledgement, but the safe default will remain behavior-only.

Ethics & Legal

This skill replicates product behavior (workflows, field structure, state handling, API capability mapping). It does not authorize you to:

  • Copy logos, exact copy, proprietary assets, or distinctive page composition.
  • Bypass authentication, paywalls, rate limits, robots restrictions, or technical protections.
  • Substitute for legal advice on trademark, patent, copyright, or Terms-of-Service compliance.

Users are responsible for their own IP, ToS, jurisdiction, and ethics review. Treat the output as engineering planning material, not legal opinion.

The skill enforces evidence redaction: cookies, auth headers, tokens, customer data, private message contents, account identifiers, and one-time URLs must never be saved or shared in reports.

Dependencies

The skill replicates not just UI but interactions — buttons, tab switches, mode toggles, async results, error states. That requires a browser automation layer. The agent auto-detects whatever is available and degrades gracefully.

Required (at least one browser automation tool)

Without one of these, the skill falls back to static HTML fetch and every interaction is marked inferred rather than observed.

Tool Purpose Install
Chrome MCP (Claude_in_Chrome) Click, screenshot, DOM read, network capture in a real Chrome session Configure per the Claude-in-Chrome project's published setup steps
Playwright MCP Headless / headed browser automation, multi-browser Microsoft Playwright MCP — install per microsoft/playwright-mcp
Claude Preview (Claude_Preview) Sandboxed preview environment with click / eval / screenshot / network MCP server provided with Claude Code; enable in MCP settings

Recommended (optional, improves coverage)

Tool Purpose
Mobile MCP (mobile-mcp) Real device or simulator screenshots for mobile parity; otherwise DevTools responsive mode is used and findings are marked inferred
WebFetch / web-reader MCP Reading official product docs, API references, changelogs
WebSearch Locating docs, integration guides, status pages when URLs are unknown
curl Unauthenticated endpoint probing when an MCP is not configured

Runtime

  • Claude Code ≥ current stable, or OpenAI Codex CLI / IDE with skills enabled.
  • Filesystem write access to the evidence directory (default ./audit/).

Repository layout

website-replication-skill/
├── SKILL.md                          # Entry point — agent loads this first
├── SKILL.zh.md                       # Chinese mirror (humans read; agent loads the English)
├── CHANGELOG.md                      # Versioned change history (SemVer)
├── agents/
│   └── openai.yaml                   # OpenAI Codex skill metadata
├── references/
│   ├── output-template.md            # Implementation-ready deliverable
│   ├── quick-audit-template.md       # ≤1h single-page / workflow audit
│   ├── prd-template.md               # Replication PRD handoff
│   ├── region-model-template.md      # Page region relationship model
│   ├── inventory-template.md         # Interactive-inventory format
│   ├── manifest-template.md          # Cross-audit snapshot index
│   ├── parity-checklist.md           # Coverage self-check
│   ├── dom-enumeration.js            # Interactive-element enumerator (browser)
│   ├── dom-distill.js                # Compact DOM outliner (browser)
│   ├── design-tokens.js              # getComputedStyle token histogram (browser)
│   ├── state-diff.js                 # Diff two distill snapshots (Node)
│   ├── network-cluster.js            # Cluster captured requests (Node)
│   └── coverage.js                   # Coverage gate — non-zero exit (Node)
├── scripts/
│   └── validate-skill.mjs            # Self-check: script syntax + template contracts
├── examples/
│   ├── sample-audit.md               # Fictional end-to-end walkthrough (no real data)
│   └── sample-audit-gemini.md        # Real anonymous-tier audit (redacted)
├── LICENSE                           # MIT
├── .gitignore
└── README.md

Install

Claude Code

mkdir -p ~/.claude/skills
git clone https://github.com/leosssvip-dot/website-replication-skill.git \
  ~/.claude/skills/website-replication-skill

Prefer to customize? Fork the repo and substitute your fork's URL.

Trigger by asking Claude something like "audit this site and produce a parity plan", or invoke explicitly via the skill picker.

OpenAI Codex

mkdir -p ~/.codex/skills
git clone https://github.com/leosssvip-dot/website-replication-skill.git \
  ~/.codex/skills/website-replication-skill

Prefer to customize? Fork the repo and substitute your fork's URL.

In Codex CLI / IDE, type $website-replication-skill to invoke explicitly. Implicit invocation is enabled via agents/openai.yaml.

Other agent harnesses

The skill is plain markdown + a JS enumeration script + a YAML manifest — no harness-specific runtime. Any agent that can:

  1. Load markdown files as system / skill context (so SKILL.md reaches the model)
  2. Read / write files (so MANIFEST.md, inventories, and reports persist)
  3. Call a browser automation MCP (Chrome MCP / Playwright MCP / equivalent)

…can run it. Copy or symlink this directory into wherever your harness expects skills:

git clone https://github.com/leosssvip-dot/website-replication-skill.git \
  <your-harness-skill-dir>/website-replication-skill

Tested concretely on Claude Code and OpenAI Codex. Other harnesses are expected to work but have not been verified — please open an issue with results if you try one.

Project-scoped install

Drop the directory under .claude/skills/ or .codex/skills/ (or your harness's equivalent) inside a specific repo to make the skill available only for that project.

Usage

Minimum inputs the skill will ask for

  • Reference site URL(s) and scope (which pages, tabs, modes).
  • Existing codebase path (if rebuilding into a repo) — optional.
  • Existing API or integration docs — optional.
  • Differentiation preference: workflow parity with original style / same features with target design system / research only.

If any are missing, the skill proceeds with explicit assumptions and marks unknowns rather than blocking.

Picking a template

Situation Template
≤ 1h, single page or single workflow references/quick-audit-template.md
Implementation-ready audit with API / data / architecture plan references/output-template.md
Coverage self-check during / after either of the above references/parity-checklist.md

Default evidence directory

Screenshots, DOM dumps, network logs, and the final report go under ./audit/<site-slug>/ unless you specify another path. Keep high-risk evidence ignored: commit only reviewed screenshots and MANIFEST.md when safe; ignore DOM, network, report, inventory, and distill markdown files.

Claim taxonomy

Every claim in a report is tagged:

  • observed — directly seen via interaction or network capture.
  • documented — found in official API / product docs.
  • inferred — reasonable deduction from partial evidence.
  • blocked — would require paid / authenticated access that is unavailable.
  • not applicable — does not apply to this competitor's product category.

This is the single most important contract of the skill: do not let the agent silently flatten these into "facts".

Contributing

Pull requests welcome, especially:

  • New product-category data-model examples for output-template.md §7 Data Model.
  • Real (fully redacted) sample audits under examples/.
  • Translations of SKILL.md / templates.

Please do not submit:

  • Sample audits containing real customer data, auth tokens, or non-redacted screenshots.
  • Examples that copy competitor logos, distinctive composition, or exact copy.
  • Changes that remove the evidence-safety or out-of-scope sections.

License

MIT © 2026 Fred


中文

简介

与 agent harness 无关的 skill,可在任意支持加载 markdown skill + 浏览器 MCP 工具的 agent 框架里运行。已在 Claude CodeOpenAI Codex 上实测;其它符合条件的框架(参考下方 "Other agent harnesses" 段三条要求)按 symlink 或 copy 引入即可——是否真的兼容请自行验证后反馈。

用于审计任意参考网站或 Web 应用,产出一份覆盖 UI、交互、API 契约、数据模型、架构的差异化对等实施方案 —— 而不复制受保护的表达。

典型场景:竞品对标 · 复刻自家旧版产品 · 学习合作方集成行为 · 审计灵感来源。

状态:模板已稳定 · 不绑定具体行业 · 适用 SaaS、电商、内容、协作、AI 工具、Marketplace、内部工具。

这个 skill 复刻什么、不复刻什么

安装前请先读这段。 名字里 "replication" 经常被误读成"像素级 1:1 克隆"。不是。

会复刻(产品行为对等):

  • 工作流逻辑与步骤顺序
  • 字段结构、表单组成、validation 规则
  • 状态切换(loading / success / error / gated / paid)
  • 交互模式(每个按钮做什么、每个菜单展开什么)
  • API 能力映射(endpoint、payload 形状、异步模式)
  • 数据模型与实体关系

不会做视觉 1:1 克隆故意的设计选择):

  • logo、品牌标识、独特图标
  • 一字不差的原文文案
  • 独创性页面构图、视觉节奏
  • 参考站独特的配色、字体、间距值、圆角 scale
  • 专有插画、摄影、图像
  • class 名、设计系统 tokens(只作为参考抽取,不是直接套用的值)

如果你期待产出看上去像参考站(同色、同字体、版式接近),skill 不会给你这个,而这正是正确行为。所有工作流步骤都把结果推向你自己产品的品牌:HTML/CSS 示例用你的 tokens,design-tokens.js 输出是参考材料(不是直接套用的值),Differentiation preference 三个选项都保留某种程度的有意视觉差异。

为什么:商标、商业外观(trade dress)、"look and feel"原则让像素级克隆面临真实诉讼风险。skill 把这个风险视为用户自担,并拒绝放大它。详见下方法律与伦理段。

如果你确实需要更高视觉保真度(内部复刻自家旧版、合同明确允许的合作方集成、有显式授权),开 issue。未来可能加 visual-fidelity-priority 模式,但需要使用者书面承担 IP 风险责任;默认值仍是行为级复刻

法律与伦理

本 skill 复刻的是产品行为(工作流、字段结构、状态处理、API 能力映射),不授权你做以下事情:

  • 复制 logo、原文文案、专有资产或具有独创性的页面构图。
  • 绕过登录、付费墙、限流、robots 限制或任何技术保护措施。
  • 替代关于商标、专利、版权或 ToS 合规的法律意见。

知识产权、ToS、司法管辖区与伦理审查由用户自行负责。skill 输出物是工程规划材料,不是法律意见。

skill 强制执行证据脱敏:cookies、auth headers、tokens、客户数据、私信内容、账号标识、一次性 URL 一律不得保存或写入报告。

依赖

本 skill 不只复刻 UI,还要复刻交互 —— 按钮、tab 切换、mode toggle、异步结果、错误状态。这需要浏览器自动化能力。Agent 会自动检测当前可用工具并优雅降级。

必需依赖(浏览器自动化工具,至少装一个)

如果一个都没有,skill 会回退到纯静态 HTML 抓取,所有交互一律标 inferred,不再是 observed

工具 用途 安装
Chrome MCPClaude_in_Chrome 在真实 Chrome 会话里点击、截图、读取 DOM、抓网络请求 按 Claude-in-Chrome 项目官方说明配置
Playwright MCP Headless / headed 浏览器自动化,多浏览器 微软官方 Playwright MCP,按 microsoft/playwright-mcp 说明安装
Claude PreviewClaude_Preview 沙箱预览环境,支持 click / eval / screenshot / network Claude Code 附带的 MCP server,在 MCP 设置里启用

推荐依赖(可选,覆盖更全)

工具 用途
Mobile MCPmobile-mcp 真机或模拟器移动端截图;没有就退到 DevTools 响应式模式,相关结论标 inferred
WebFetch / web-reader MCP 读官方产品文档、API reference、changelog
WebSearch URL 未知时查文档、集成指南、状态页
curl 未配置 MCP 时探测无需鉴权的接口

运行环境

  • Claude Code ≥ 当前 stable,或启用 skill 的 OpenAI Codex CLI / IDE
  • 对证据目录(默认 ./audit/)的可写权限。

目录结构

website-replication-skill/
├── SKILL.md                          # Agent 入口,最先加载
├── SKILL.zh.md                       # 中文镜像(给人读;agent 仍加载英文版)
├── CHANGELOG.md                      # 版本变更记录(SemVer)
├── agents/
│   └── openai.yaml                   # OpenAI Codex skill 元数据
├── references/
│   ├── output-template.md            # 可落地的完整交付模板
│   ├── quick-audit-template.md       # ≤1h 单页 / 单流程审计
│   ├── prd-template.md               # 复刻 PRD 交接模板
│   ├── region-model-template.md      # 页面区域关系模型
│   ├── inventory-template.md         # 交互清单格式
│   ├── manifest-template.md          # 跨次审计截图索引
│   ├── parity-checklist.md           # 覆盖度自检清单
│   ├── dom-enumeration.js            # 交互元素枚举器(浏览器)
│   ├── dom-distill.js                # DOM 结构精简器(浏览器)
│   ├── design-tokens.js              # getComputedStyle 设计 token 直方图(浏览器)
│   ├── state-diff.js                 # 两份 distill 快照 diff(Node)
│   ├── network-cluster.js            # 网络请求聚类(Node)
│   └── coverage.js                   # 覆盖度 gate — 非零退出码(Node)
├── scripts/
│   └── validate-skill.mjs            # 自检:脚本语法 + 模板契约
├── examples/
│   ├── sample-audit.md               # 虚构端到端样例(无真实数据)
│   └── sample-audit-gemini.md        # 真实匿名层审计(已脱敏)
├── LICENSE                           # MIT
├── .gitignore
└── README.md

安装

Claude Code

mkdir -p ~/.claude/skills
git clone https://github.com/leosssvip-dot/website-replication-skill.git \
  ~/.claude/skills/website-replication-skill

想自定义?fork 后用你自己仓库的 URL 替换上面的地址即可。

之后向 Claude 说"audit 这个网站,产出 parity plan"之类的话即可触发;也可在 skill picker 中显式调用。

OpenAI Codex

mkdir -p ~/.codex/skills
git clone https://github.com/leosssvip-dot/website-replication-skill.git \
  ~/.codex/skills/website-replication-skill

想自定义?fork 后用你自己仓库的 URL 替换上面的地址即可。

在 Codex CLI / IDE 输入 $website-replication-skill 显式调用。隐式调用由 agents/openai.yaml 启用。

其它 agent 框架

Skill 本体是纯 markdown + JS 枚举脚本 + YAML manifest,没有 harness 专属 runtime。只要你的 agent 能:

  1. 把 markdown 文件作为 system / skill 上下文加载(让 SKILL.md 能被模型读到)
  2. 读写文件(让 MANIFEST.md、inventory、报告能持久化)
  3. 调用浏览器自动化 MCP(Chrome MCP / Playwright MCP / 同类)

就能跑。把目录 clone / symlink 到 harness 期望的 skill 目录即可:

git clone https://github.com/leosssvip-dot/website-replication-skill.git \
  <你的-harness-skill-目录>/website-replication-skill

Claude Code 与 OpenAI Codex 已实测可用;其它 harness 预期可用但未验证 —— 跑通了欢迎开 issue 反馈。

项目级安装

把目录放进具体仓库的 .claude/skills/.codex/skills/(或你 harness 的等价目录),skill 只在该项目下可用。

使用

Skill 会问的最少输入

  • 参考站 URL 与范围(页面 / tab / mode)。
  • 现有代码仓库路径(重建时)— 可选。
  • 现有 API 或集成文档 — 可选。
  • 差异化偏好:workflow parity with original style / same features with target design system / research only

缺项时不阻塞,skill 会显式标注假设并标记未知项。

模板选择

场景 模板
≤ 1h,单页或单流程 references/quick-audit-template.md
含 API / 数据 / 架构的可落地审计 references/output-template.md
上述过程中 / 之后做覆盖度自检 references/parity-checklist.md

默认证据目录

截图、DOM、网络日志、最终报告写入 ./audit/<site-slug>/,除非你指定其它路径。高风险证据必须忽略:只有确认脱敏后的截图与 MANIFEST.md 才适合提交;DOM、网络、报告、inventory、distill markdown 都应留在本地。

Claim 分级体系

报告里每条结论都打标签:

  • observed —— 通过交互或网络抓包直接看到的。
  • documented —— 官方 API / 产品文档里写明的。
  • inferred —— 从局部证据合理推断的。
  • blocked —— 需要付费 / 鉴权访问而当前不可达的。
  • not applicable —— 该竞品品类不适用。

这是 skill 最重要的契约:不允许 agent 把它们偷偷合并为"事实"

贡献

欢迎 PR,尤其是:

  • output-template.md §7 Data Model 补新品类的数据模型样例。
  • examples/ 下提交完全脱敏的真实审计样例。
  • SKILL.md / 模板的多语言翻译。

不要提交:

  • 含真实客户数据、auth token、未脱敏截图的样例。
  • 复制了竞品 logo、独创性构图或原文文案的样例。
  • 删除证据安全或 out-of-scope 章节的改动。

许可证

MIT © 2026 Fred

About

Skill for Claude Code / OpenAI Codex: audit a reference site and produce a differentiated parity plan (UI, interactions, API, data, architecture).

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors