Crawl Rules
/admin /private
Default access
Site Metadata
Live Preview
Pro Tip: Place the generated robots.txt in the root directory of your website (e.g., fluxtoolkit.com/robots.txt) to control how search engines crawl your site.
Frequently Asked Questions
A robots.txt file is a text file placed in your website root directory that tells search engine crawlers which pages they can and cannot access. It uses the Robots Exclusion Protocol to communicate crawling rules. When a search engine bot visits your site, it reads robots.txt first to determine what content it's allowed to index. This file can block specific pages, directories, or even entire sections of your site from being crawled and indexed.
To block a specific page, add a Disallow directive with the page path. For example, "Disallow: /private-page.html" blocks that specific page. To block an entire directory, use "Disallow: /directory-name/". You can target specific bots using User-agent directives, like "User-agent: Googlebot" followed by your Disallow rules. Test with Google's robots.txt Tester before deploying.
Disallow in robots.txt prevents a bot from crawling a page, but doesn't prevent indexing if the page is linked from elsewhere. The noindex meta tag (or X-Robots-Tag HTTP header) prevents a crawled page from appearing in search results. For complete exclusion, use both: robots.txt to prevent crawling (saves server resources) and noindex to prevent indexing (if somehow crawled).
Robots.txt itself doesn't directly affect rankings, but it impacts how search engines crawl and index your site. A well-configured robots.txt helps crawlers find your important content efficiently, which can indirectly benefit SEO by ensuring key pages get indexed. Poor configuration that blocks important content will hurt your visibility. It doesn't pass "ranking signals" to search engines.
Target specific bots using User-agent directives. List the rules for one bot, then add another User-agent section for a different bot. For example, "User-agent: Googlebot" with its Disallow rules, then "User-agent: *" with different rules for all other bots. Common bot names include Googlebot, Googlebot-Image, Bingbot, and Slurp. List specific rules before wildcard rules.
Place robots.txt in your website root directory—it must be accessible at yourdomain.com/robots.txt. For most websites, upload it to the public_html or www directory. WordPress sites with proper configuration typically have it at the root. Apache servers recognize it automatically; nginx may need configuration. After uploading, test the URL directly in your browser.
If no robots.txt exists, bots will attempt to crawl all accessible content on your site. This isn't necessarily bad—small sites with clean architecture often don't need one. However, robots.txt helps manage crawl budget on larger sites, blocks unnecessary pages from indexing, and provides your sitemap location. Most production websites benefit from having one.
Yes, using "User-agent: *" followed by "Disallow: /" blocks all compliant crawlers from accessing any content. This completely removes your site from search results. Never do this accidentally—it's one of the most common SEO disasters. Always double-check your robots.txt before deploying. If you need to block access, test first with a specific User-agent.
Related Tools
You might also find these utilities helpful for your robots.txt generator workflow.