Crawl a site into a WACZ

Requires a Pro license

Start at the Winnie-the-Pooh table of contents, follow its links one hop out, and archive every page into a single WACZ, saved to winnie-the-pooh.wacz. Drop the file on ReplayWeb.page to browse the archived site.

Run locally

zshot -t wacz -f winnie-the-pooh.wacz \
  --crawl-links --crawl-link-depth 1 \
  https://zshot-cli.com/example_assets/winnie-the-pooh/

{
  "url": "https://zshot-cli.com/example_assets/winnie-the-pooh/",
  "output_type": "wacz",
  "crawl_links": true,
  "crawl_link_depth": 1
}

curl -X POST http://127.0.0.1:3000/ \
  -H "Content-Type: application/json" \
  -d '{"url":"https://zshot-cli.com/example_assets/winnie-the-pooh/","output_type":"wacz","crawl_links":true,"crawl_link_depth":1}' \
  -o winnie-the-pooh.wacz

From the table of contents, one hop reaches every chapter. zshot ≥ 0.23.0

--crawl-links follows the links discovered on each captured page; --crawl-link-depth bounds how many hops from the seed it travels. Discovered links stay on the same registrable domain and honor robots.txt. To write a bare WARC instead, use -t warc.