Argiope

Christian Resma Helle (christianhelle) Publisher
Development

A web crawler for broken-link detection and image downloading

A high-performance web crawler written for broken-link detection and image downloading.

Overview: argiope is a fast, lightweight web crawler designed for broken-link detection, website archiving, and batch image downloading. It features intelligent HTML scanning, URL normalization, and sophisticated report generation. Built with zero external dependencies and compiled to a single static binary.

Key Features:

Crawl websites and detect broken links (4xx/5xx/timeout errors)
Generate comprehensive reports in text, Markdown, or self-contained HTML format
Download images from web pages to organized directory structures
Generate portable HTML browsing pages for downloaded image libraries with library.html landing page, nested index.html navigation, and per-folder reader.html viewers
Specialized support for downloading manga chapters from MangaFox (fanfox.net) by title with optional chapter range filtering
BFS (breadth-first search) traversal with configurable crawl depth, request timeouts, and rate limiting
Domain-restricted crawling with automatic same-origin security checks
Lightweight HTML scanner for precise link and image extraction from complex pages
Intelligent URL normalization and relative-to-absolute URL resolution
Parallel crawling support for improved performance on large sites
Zero external dependencies — uses only Zig's standard library
Single static binary — no runtime, no installation hassles

Commands:

check <url>: Crawl a website and generate a detailed broken-link report with timing statistics
images <url>: Download all images from a website or manga chapters from MangaFox to an organized structure
library <dir>: Generate or regenerate HTML browser for existing image directories

Usage Examples:

argiope check https://example.com --depth 5 — Check site for broken links up to 5 levels deep
argiope images https://example.com/gallery -o ./images — Archive gallery images with HTML browser
argiope images https://fanfox.net/manga/title --chapters 1-50 — Download manga chapters 1-50
argiope check https://example.com --report report.html --report-format html — Generate HTML report for CI pipelines
argiope images https://example.com -o ./archive --parallel --depth 3 — Fast parallel crawling and download

Report Features: Reports include detailed statistics: total URLs checked, OK count, broken count, error count, internal vs external split, and timing information (total time, average/min/max response times). HTML reports are self-contained with inline CSS and styled with status badges.

HTML Browser: The generated portable HTML browser features light/dark/system theme modes with localStorage-backed preferences, relative links for offline browsing, percent-encoded filenames for local file access, and thumbnail galleries with ordered prev/next navigation. Works with generic image collections and deep MangaFox chapter trees alike.

MangaFox Features:

Automatic chapter detection via RSS feeds with fallback to HTML parsing
Numeric chapter ordering (1, 2, 10, 11, 100) not alphabetic
Full support for decimal chapter numbers (5.5, 100.1) with correct sorting
Chapter range filtering via --chapters N-M flag
Organized output: [manga-title]/[chapter]/[page].jpg
Verbose mode shows detailed chapter discovery information

Performance & Reliability: Configurable request timeouts, delay between requests, and concurrent crawling options ensure efficient resource usage and reliable operation on rate-limited sites. Automatic redirect following, response size limiting, and detailed error reporting make it production-ready for CI pipelines and large-scale archiving. Perfect for website archiving, CI/CD integration, and offline browsing.

Details for Argiope

Package name

argiope

License

Last updated

15 March 2026 - latest/stable
28 April 2026 - latest/edge

Websites

Contact

Donations

Enable snaps on CentOS and install Argiope

Snaps are applications packaged with all their dependencies to run on all popular Linux distributions from a single build. They update automatically and roll back gracefully.

Snaps are discoverable and installable from the Snap Store, an app store with an audience of millions.

Enable snapd

Snap is available for CentOS 7.6+, and Red Hat Enterprise Linux 7.6+, from the Extra Packages for Enterprise Linux (EPEL) repository. The EPEL repository can be added to your system with the following command:

sudo yum install epel-release

Snap can now be installed as follows:

sudo yum install snapd

Once installed, the systemd unit that manages the main snap communication socket needs to be enabled:

sudo systemctl enable --now snapd.socket

To enable classic snap support, enter the following to create a symbolic link between /var/lib/snapd/snap and /snap:

sudo ln -s /var/lib/snapd/snap /snap

Either log out and back in again, or restart your system, to ensure snap’s paths are updated correctly.

Install Argiope

To install Argiope, simply use the following command:

sudo snap install argiope

Install Argiope
on CentOS

Argiope

A web crawler for broken-link detection and image downloading

Details for Argiope

Package name

License

Last updated

Websites

Contact

Donations

Source code

Report a bug

Report a Snap Store violation

Enable snaps on CentOS and install Argiope

Enable snapd

Install Argiope

Other popular snaps…

More things to do…

Get the snap store

Learn more about snaps

Install Argiope on CentOS

Argiope

A web crawler for broken-link detection and image downloading

Details for Argiope

Package name

License

Last updated

Websites

Contact

Donations

Source code

Report a bug

Report a Snap Store violation

Enable snaps on CentOS and install Argiope

Enable snapd

Install Argiope

Other popular snaps…

More things to do…

Get the snap store

Learn more about snaps

Install Argiope
on CentOS