Question 1

Do I really need a robots.txt?

Accepted Answer

You don't — without one, crawlers default to "allow everything." But you almost always want one for two reasons: declaring your sitemap location, and blocking specific paths (admin areas, search-result pages, faceted-navigation URLs that produce infinite duplicates).

Question 2

Where should robots.txt and sitemap.xml live?

Accepted Answer

Both must be at the domain root: yourdomain.com/robots.txt and yourdomain.com/sitemap.xml. Google looks there first. You can override the sitemap location by declaring it in robots.txt, but the convention is the root.

Question 3

How big can my sitemap be?

Accepted Answer

Single sitemap: max 50,000 URLs and 50 MB uncompressed. Larger sites use a sitemap index file that points to multiple sitemap files (each under the limits). Most CMSes do this automatically.

Question 4

Will this catch noindex tags too?

Accepted Answer

Not in this tool — those are per-page meta tags. The AI Visibility and SEO Score tools both flag noindex on the page they audit. CrawlTide's full audit flags noindex across your whole site after a crawl.

Question 5

My sitemap is dynamically generated — will this still work?

Accepted Answer

Yes, as long as it returns 200 with valid XML. We don't care whether it's a static file or generated on demand by your framework (Next.js sitemap.ts, Rails routes, etc).

Question 6

Should I block AI crawlers like GPTBot?

Accepted Answer

Depends on your goals. If you want to be cited in ChatGPT and similar, allow GPTBot. If you don't want OpenAI training on your content, disallow it. Most SaaS marketing sites benefit from being visible. The full CrawlTide product can audit per-bot rules across all 30+ AI crawlers.

Robots.txt & Sitemap Checker

How it works

You enter your domain

We fetch /robots.txt

We fetch /sitemap.xml

You get a verdict + fixes

Why this matters

Want the full story across every page?

Frequently asked questions

More free tools

Indexability Checker

SEO Score Checker

Broken Link Checker