FAQ

Frequently Asked Questions

Answers to the most common questions about dislike404.com.

Getting Started

What is dislike404.com and what is it for?

dislike404.com automatically monitors your website for broken links, 404 errors and connection issues.
Once a scan is complete, you'll get an email summary if anything went wrong.

How do I add my first website?

Sign up for a free account, add your website URL and you're done.
Our crawler will pick it up automatically and start scanning on the schedule you set.

What does the service cost?

Nothing. dislike404.com is completely free - no hidden costs, no free trial that expires, no paywalls.

How is dislike404.com funded if the service is free?

dislike404.com is a one-person hobby project and currently runs entirely out of my own pocket.
At some point I may add some simple ads to help cover the server costs - but that's it

How can I delete my account?

You can delete your account at any time via your settings page.
All your data will be deleted immediately and permanently.
To delete your account, please follow these steps:

  • Log in to your account

  • Go to the settings page

  • Click on the "Delete Account" button

  • Confirm the deletion

  • Your account will be deleted immediately

Please note that this action is irreversible, and all your data will be permanently deleted.

Scanning & Errors

Why isn't my website being scanned?

There are a few things worth checking:

  • Does your robots.txt file exclude our bot?

  • Are you using a CDN like Cloudflare that might be blocking the crawler?

  • Have you set up a blacklist entry that might be too broad?

  • And is your website actually reachable and the URL correct?

  • You can also check the "Not Scanned" tab in your scan details to see why specific URLs were skipped

If none of that helps, feel free to contact me.

Why are there many 'Connection Failed' errors when scanning my site?

The most common reason is that our bot is being blocked by a security measure or CDN like Cloudflare.
Since our bot identifies itself, you may be able to whitelist it by name or by our IP address 5.9.153.60

If you need help with that, just get in touch.

Why are some external websites reported with 'method not allowed' or 'connection failed'?

This can have several reasons:

  • It might be a temporary issue - in that case it will likely resolve itself on the next scan.

  • It could also be a real error on the external website

  • Some services like Amazon or Instagram simply block bots by default.
    Since the dislike404.com bot identifies itself as such, there's nothing I can do about that.
    You can add these URLs to your personal blacklist to keep them out of your scan results.

Why do some URLs appear as 'global blacklist' in the 'Not Scanned' results?

We maintain a global blacklist of sites that can't or don't want to be scanned.
We also respect each site's robots.txt.

If you think a URL has been blacklisted by mistake or you'd like to suggest an addition, feel free to contact me.

Does the bot follow HTTP 301, 302, 307 and 308 redirects?

Yes – the bot follows all redirects, both internal and external.

If the redirect target is an internal page, it will also be scanned for errors and its links will be checked as usual.
If the redirect ends up on an external page, we follow it to verify the target is reachable - but we don't crawl any further from there.

Both URLs will appear in your scan results – the original URL as HTTP 30x and the target URL with its actual status code.

How does the website blacklist work?

With the blacklist, you can exclude individual sections of your website or external websites from being scanned.

This can be useful if, for example, the external website blocks bots or if you do not want certain areas of your site to be checked.
You can access the blacklist via the blacklist button in your website control panel.
You can exclude exact URLs or use the wildcard character *, which matches zero or more characters.
A few examples:

  • Exact URL: https://example.com/dont_scan_me.html

  • Both http and https: http*://example.com/dont_scan_me.html

  • Entire path: http*://example.com/dont_scan_me/*

  • Subdomain: http*://dontscan.example.com*

  • Entire domain:*example.com*

Limits & Verification

What are the current limits for scanning?

Each scan has two limits: internal pages (any page on your website with actual content) and total requests (every asset we check along the way - images, scripts, stylesheets and more).

  • Unverified: up to 200 internal pages and 1,000 total requests

  • Verified: up to 50,000 internal pages and 500,000 total requests

Need higher limits? Just send me a message.

What does "verification" mean?

Verification proves that you actually own the website you're monitoring.
Once verified, you unlock higher scan limits and additional options.
Verification is free and takes just a few minutes.

How do I verify my website?

Verification works by uploading a small text file to your web server. The exact steps and your personal verification code are shown directly in your dashboard once you add a website.

The file needs to be accessible at the root of your domain - for example yourdomain.com/dislike404-verification.txt.
That means you need to be able to upload files to your web server.

WordPress users
Upload the verification file to the root directory of your WordPress installation. The same folder where wp-config.php is located.
You can do this via FTP, SFTP or your hosting provider's file manager.
Do not put the file inside wp-content or any other subfolder.

Still have questions?