update ai.robots.txt
add robots.txt add apache-badbots.conf
This commit is contained in:
parent
bcb0414008
commit
6afb7dc2b0
5 changed files with 89 additions and 0 deletions
1
apache-badbots.conf
Normal file
1
apache-badbots.conf
Normal file
|
@ -0,0 +1 @@
|
|||
I2Bot|Ai2Bot\-Dolma|aiHitBot|Amazonbot|anthropic\-ai|Applebot|Applebot\-Extended|Brightbot\ 1\.0|Bytespider|CCBot|ChatGPT\-User|Claude\-Web|ClaudeBot|cohere\-ai|cohere\-training\-data\-crawler|Cotoyogi|Crawlspace|Diffbot|DuckAssistBot|FacebookBot|Factset_spyderbot|FirecrawlAgent|FriendlyCrawler|Google\-Extended|GoogleOther|GoogleOther\-Image|GoogleOther\-Video|GPTBot|iaskspider/2\.0|ICC\-Crawler|ImagesiftBot|img2dataset|imgproxy|ISSCyberRiskCrawler|Kangaroo\ Bot|meta\-externalagent|Meta\-ExternalAgent|meta\-externalfetcher|Meta\-ExternalFetcher|NovaAct|OAI\-SearchBot|omgili|omgilibot|Operator|PanguBot|Perplexity\-User|PerplexityBot|PetalBot|Scrapy|SemrushBot\-OCOB|SemrushBot\-SWA|Sidetrade\ indexer\ bot|TikTokSpider|Timpibot|VelenPublicWebCrawler|Webzio\-Extended|YouBot|AhrefsBot|Baiduspider|Barkrowler|Bingbot|BLEXBot|Bytedance|DotBot|EmailCollector|facebookcatalog|facebookexternalhit|fidget-spinner-bot|Franck the Fediverse Graph Crawler|Googlebot|Livelapbot|Mediapartners-Google|MJ12bot|SemrushBot|SeznamBot|VelenPublicWebCrawler|WebEMailExtrac|YandexBot|YisouSpider|
|
Loading…
Add table
Add a link
Reference in a new issue