Linux 软件免费装

Block AI Crawlers

开发者 lastsplash
更新时间 2024年11月11日 01:37
PHP版本: 7.4 及以上
WordPress版本: 6.7
版权: GPLv2 or later
版权网址: 版权信息

标签

ai robots.txt crawlers chatgpt

下载

1.3.4 1.3.0 1.3.3 1.2.0 1.4.0 1.0.0 1.1.0 1.2.2 1.3.1 1.3.5 1.3.7 1.3.8 1.3.6 1.3.9 1.4.1

详情介绍:

Protect Your Content from AI Scraping This plugin helps you prevent AI crawlers from using your content as training data for their products. By updating your site's robots.txt, it blocks common AI crawlers and scrapers, aiming to protect your content from being used in the training of Large Language Models (LLMs). Features Blocks AI Crawlers Includes: Experimental Meta Tags The plugin adds the "noai, noimageai" directive to your site's meta tags, instructing AI bots not to use your content in their datasets. Please note that these tags are experimental and have not been standardized. Installation
  1. Download the plugin zip file.
  2. Go to your WordPress admin panel.
  3. Navigate to Plugins > Add New > Upload Plugin.
  4. Choose the zip file and click "Install Now."
  5. Activate the plugin.
Usage After activation, the plugin will automatically update your robots.txt and add the necessary meta tags. No further configuration is required, but you can check the settings page for a full list of blocked crawlers. Limitations While this plugin aims to block specified crawlers, it cannot guarantee complete protection against all forms of scraping, as some bots may disregard robots.txt directives. Support For questions or support, please post on the forums or on GitHub.

安装:

  1. Activate the plugin through the 'Plugins' menu in WordPress
  2. Once installed the plugin is automatically activated. There are no user configured settings
  3. You can view more about what crawlers are being blocked at "Settings > Block AI Crawlers"

屏幕截图:

  • Plugin page showing which crawlers are blocked

常见问题:

Will this remove my site from existing data sets?

Unfortunately, no. However, it does tell bots that your site shouldn't be used for future datasets.

How does this work?

The plugin adds directives to the robots.txt file to tell AI crawlers that they shouldn't index your site. It also adds the noai meta tag to your site's header to do the same.

How often is this updated?

I try to keep up with new crawlers and update the block list regularly.

Can I suggest crawlers for blocking?

Yes! please share suggestions on the forums or on GitHub.

What if I already have a robots.txt file on my web server?

If you have a physical robots.txt file on your web server, you won't be able to activate this plugin. The plugin only works when using WordPress' built-in virtual robots.txt.

Will this work with other plugins that modify the virtual robots.txt?

It should in theory. It just appends the directives to the robots.txt file.

Will this prevent my site from being indexed by search engines?

No. Search engines follow different robots.txt rules.

更新日志:

1.4.1 = New: Block Turnitinbot 1.4.0 1.3.9 1.3.8 1.3.7 1.3.6 1.3.5 1.3.3 1.3.1 1.3.0 1.2.2 1.2.1 1.2.0 1.1.0 1.0.0 Initial Release.