Quantcast
Channel: phpBB.com
Viewing all articles
Browse latest Browse all 1199

phpBB Discussion • Re: Claudebot attack

$
0
0
Hello,
...(...)...
"helpful, honest, and harmless AI system" as they say it is.
We can skew their training by blocking access to the data.
Just a few notes from my point of view and experiences. The individual case decides, as you know.
So I guess we are swimming against the tide when they test, catch, check the AI ... They come in the middle of the night, some are caught (too many requests) and others may not and so on.
Yesterday there were 89 pages, this morning 66 pages (and unfortunately mostly from China). So I will probably have to "lock out" this country (?) ...

.htaccess: my own examples, tests

Code:

RewriteCond %{HTTP_USER_AGENT} 11A465|Ahrefs|ArchiveBot|Baiduspider|BLEXBot|Bytedance|Bytespider|CCBot|ChatGPT-User|ClaudeBot|Curebot|Daum|Detectify|DotBot|Elisabot|Grapeshot|GPTBot|heritrix|Kinza|LieBaoFast|Linguee|LMY47V|MauiBot|Mb2345Browser|MegaIndex|MetaJobBot|MicroMessenger|MJ12bot|MQQBrowser|PageFreezer|PiplBot|Riddler|Screaming.Frog|Search365bot|SearchBlox|Seekport|SemanticScholarBot|SEOkicks|serpstatbot|Siteimprove.com|Sogou.web.spider|trendictionbot|TurnitinBot|UCBrowser|UptimeRobot|weborama-fetcher|Vagabondo|VelenPublicWebCrawler|YandexBot|YisouSpider [NC]RewriteRule ^.* - [F,L]
I have since expanded the suggestion from P_I
# 28-Mar-24, P_I, another one to banish ->

I have since expanded it: my own examples, tests

Code:

BrowserMatchNoCase "BLEXBot/1.0" bad_botBrowserMatchNoCase "claudebot" bad_botBrowserMatchNoCase "ClaudeBot" bad_botBrowserMatchNoCase "ClaudeBot/1.0" bad_botBrowserMatchNoCase "Elisabot" bad_botBrowserMatchNoCase "UptimeRobot" bad_botBrowserMatchNoCase "UptimeRobot/1.0" bad_botBrowserMatchNoCase "UptimeRobot/2.0" bad_botOrder Deny,AllowDeny from env=bad_bot
And also added the robots.txt: my own examples, tests

Code:

User-agent: BLEXBot/1.0User-agent: CCBotUser-agent: ChatGPT-UserUser-agent: ClaudeBotUser-agent: ClaudeBot/1.0User-agent: ElisabotUser-agent: GPTBotUser-agent: UptimeRobotUser-agent: UptimeRobot/1.0User-agent: UptimeRobot/2.0Disallow: /
====
Well, I'm getting tired and thinking about giving it all up. Almost 20 years of phpBB are perhaps enough ....
Life is precious - best wishes

Statistics: Posted by Talk19Zehn — Fri Apr 26, 2024 1:33 pm



Viewing all articles
Browse latest Browse all 1199

Trending Articles