Crawl a site into a WACZ
Requires a Pro license
Start at the Winnie-the-Pooh table of contents, follow its links one hop out, and archive every page into a single WACZ, saved to winnie-the-pooh.wacz. Drop the file on ReplayWeb.page to browse the archived site.
zshot -t wacz -f winnie-the-pooh.wacz \
--crawl-links --crawl-link-depth 1 \
https://zshot-cli.com/example_assets/winnie-the-pooh/{
"url": "https://zshot-cli.com/example_assets/winnie-the-pooh/",
"output_type": "wacz",
"crawl_links": true,
"crawl_link_depth": 1
}curl -X POST http://127.0.0.1:3000/ \
-H "Content-Type: application/json" \
-d '{"url":"https://zshot-cli.com/example_assets/winnie-the-pooh/","output_type":"wacz","crawl_links":true,"crawl_link_depth":1}' \
-o winnie-the-pooh.wacz--crawl-links follows the links discovered on each captured page; --crawl-link-depth bounds how many hops from the seed it travels. Discovered links stay on the same registrable domain and honor robots.txt. To write a bare WARC instead, use -t warc.