Broken Link Checker
Enterprise-grade SEO crawler and link validator. Detects 404s, redirect chains, and server errors with configurable recursion depth, concurrency control, and CORS proxy routing.
| Status | Target URL | Source (Parent) | Anchor Text |
|---|
About
The Broken Link Checker is a client-side recursive crawler designed to audit web architecture for navigational integrity. In the context of Search Engine Optimization (SEO), link rot - the gradual decay of external links - can significantly penalize a domain's authority score. This tool automates the traversal of the Document Object Model (DOM), extracting hyper references (href) and validating their HTTP status codes against standard protocols.
Unlike simple ping tools, this engine utilizes a Breadth-First Search (BFS) algorithm to map site topology up to a user-defined depth (d). It handles complex edge cases including infinite redirect loops, protocol-relative URLs, and varying User-Agent restrictions. Due to browser Same-Origin Policy (SOP), a configurable CORS proxy layer is integrated to enable cross-origin inspections directly from the client interface.
Formulas
The crawl efficiency is modeled by the request concurrency and latency. The total scan time T is approximated by:
We define the Link Health Score (H) as:
Where Lok is the count of valid endpoints (2xx/3xx) and Ltotal is the unique link set size.
Reference Data
| Status Code | Class | SEO Impact | Recommended Action |
|---|---|---|---|
| 200 OK | Success | Positive signal. Content is accessible. | No action required. |
| 301 Moved Permanently | Redirection | Transfers ≈90-99% of link equity (PageRank). | Update internal links to the new destination to reduce latency. |
| 302 Found (Temporary) | Redirection | Does not pass link equity effectively. | Change to 301 if the move is permanent. |
| 403 Forbidden | Client Error | Crawlers are blocked. Content is not indexed. | Check server permissions or firewall rules. |
| 404 Not Found | Client Error | Negative user experience; high bounce rate signal. | Restore content or 301 redirect to relevant page. |
| 410 Gone | Client Error | Explicitly tells search engines to de-index immediately. | Use intentionally for permanently deleted content. |
| 500 Server Error | Server Error | Indicates backend failure; prevents crawling. | Investigate server logs and application code. |
| 503 Service Unavailable | Server Error | Temporary maintenance; search engines may retry. | Ensure Retry-After header is set. |