So, upon further investigation, it turns out that the spam bots are using countermeasures to thwart the less intensive anti-spam measures we could use. Unfortunately this means that there is no easy solution. Currently I can see three possible remedies.

Name
Email
Subject	Spoiler Image
Comment
File
Password	(For file deletion.)

File: 1446334933252.png (347.64 KB, 566x800, c28d40feaff4d394c7d519bcbc….png)

How should we stop the spam? Seisatsu ## Owner 10/31/15 (Sat) 23:42:13 No.2484

So, upon further investigation, it turns out that the spam bots are using countermeasures to thwart the less intensive anti-spam measures we could use. Unfortunately this means that there is no easy solution. Currently I can see three possible remedies.

1) Use captchas. (Once-a-day captchas are a feature that is specific to 8chan and I don't know if we can implement them.)

2) Use an automated spam-detection service like Akismet. The con here is that we'd have to share every post's content with Akismet's servers in order to check it for spam, including the user's IP address.

3) Hire a ridiculous number of Janitors and try to manage them all.

Word/phrase filters are impossible because the bots are too smart. Any comments or alternative suggestions are welcome.

Anonymous 10/31/15 (Sat) 23:54:27 No.2485

>>2484
Janitors or captcha, lots of people can get spooked by sharing their IP.

Seisatsu ## Owner 11/01/15 (Sun) 00:06:12 No.2486

I'm going to try one more thing real quick and see if it helps.

Anonymous 11/01/15 (Sun) 00:14:13 No.2487

>>2486
what are you testing?

Seisatsu ## Owner 11/01/15 (Sun) 00:17:39 No.2488

>>2487
I'm going to try using another DNS Blocklist service, it checks the poster's IP only (without sharing post contents) against a list of known bot IPs. We already were using two blocklists but neither of them were catching these particular bots. I don't have a good feeling about it working though.

Anonymous 11/01/15 (Sun) 00:21:23 No.2489

>>2488
yeah, favouring toward captcha now, but still need a janitor all the time

Seisatsu ## Owner 11/01/15 (Sun) 00:22:48 No.2490

>>2489
I'm worried that enabling a captcha will immediately and permanently kill the entire site.

Anonymous 11/01/15 (Sun) 00:24:19 No.2491

>>2490
How would that happen?

Seisatsu ## Owner 11/01/15 (Sun) 00:28:05 No.2492

>>2491
People would be too lazy to fill out the captcha, throw a bitch fit / complain, or possibly lose their entire post if they accidentally fill it out wrong. Maybe the place would be nicer (albeit slower) if all those people left though.

Anonymous 11/01/15 (Sun) 00:30:41 No.2493

>>2492
sounds like very minor problems to me but they'd complain a lot more from Akismet, but what are the problems of hiring a bunch of janitors? Can guess a few problems but don't know much about this stuff

Seisatsu ## Owner 11/01/15 (Sun) 00:32:34 No.2494

>>2493
The biggest problem is there's no way to record the contents of posts that have been deleted. If a corrupt Janitor is deleting posts they dislike and saying they were spam posts, there is literally no way to prove them wrong. That's why I try to be careful about who I hire.

Anonymous 11/01/15 (Sun) 00:33:02 No.2495

File: 1446337982804.gif (354.6 KB, 480x359, 1368255580169.gif)

Me for janitor.

Anonymous 11/01/15 (Sun) 00:34:41 No.2496

>>2494
Yeah, that was one of my guesses, but how many people do you think you could trust with being a janitor? Gonna be problematic if you don't know a lot of people

Seisatsu ## Owner 11/01/15 (Sun) 00:37:21 No.2497

>>2496
I could try just picking users I trust instead of holding applications.

Anonymous 11/01/15 (Sun) 00:42:44 No.2498

File: 1446338563911.jpg (31.12 KB, 400x410, image.jpg)

>>2497
I-I have a lot of free time and I'd be glad to help, guessing you wouldn't want to hire me this quick t-though

Seisatsu ## Owner 11/01/15 (Sun) 00:45:38 No.2500

(Sorry if my posts keep changing, I update them as I'm doing quick fact checks to make sure I'm not talking out of my ass or getting something wrong.)

Here's another thing, I prefer to only hire IRC users, because it allows me to communicate with the staff in real time. Sometimes people want to be staff but refuse to participate in the IRC community, which is an instant disqualification.

In fact, during the last Janitor applications, anyone who wasn't already an IRC user or who I hadn't seen on IRC frequently was disqualified, since I found it unlikely that I would be able to keep in contact with them on a regular basis.

Anonymous 11/01/15 (Sun) 00:45:38 No.2501

>>2498
You're damn right you stuttering loser weeb.

Anonymous 11/01/15 (Sun) 00:50:16 No.2502

File: 1446339016302.jpg (95.63 KB, 1280x720, image.jpg)

>>2500
I'm in the IRC sometimes and going on more frequently now
>>2501
hey no bully

Seisatsu ## Owner 11/01/15 (Sun) 01:01:06 No.2503

This is not a Janitor application thread, let's please get back on topic.

Anonymous 11/01/15 (Sun) 01:06:22 No.2504

>>2503
alright, sorry, I'd say go for the captcha, but try to find a way that they shouldn't re-type their post if they didn't get the captcha right, pretty sure there's not a lot of people who'd give up on posting just for having to type a blurry word

Seisatsu ## Owner 11/01/15 (Sun) 01:20:08 No.2505

>>2504
Yeah, you're probably right. I'll try it if the new blocklist doesn't work.

Anonymous 11/01/15 (Sun) 01:24:32 No.2507

File: 1446341071978.jpg (23.58 KB, 480x360, scioli2.jpg)

>Word/phrase filters are impossible because the bots are too smart.
I'm pretty sure they always post almost the same links, albeit with a different pre/sufix each time. Do we have some kind of register? It'd be worth to give it a shot, although I'm gonna assume the links are really different and this isn't gonna work. In that case, I'd say that captchas are the best option, while also trying to hire new janitors. So far nobody has really complained about this proposal, plus we have already tried this once and it was well received by the community.
Now, it didn't work last time, are we sure it's gonna do something?

About the blocklist, well, we already had two, one more won't harm anyone.

Anonymous 11/01/15 (Sun) 18:04:26 No.2508

File: 1446401066606.jpg (84.51 KB, 650x780, 1403307186832.jpg)

>1) Use captchas. (Once-a-day captchas are a feature that is specific to 8chan and I don't know if we can implement them.)
As long as it's not google captcha I'm fine w/ this

Anonymous 11/01/15 (Sun) 19:34:28 No.2509

If all else fails, then yeah i'd be okay with captchas too. just not something annoying. The ones that are just numbers on a house or something are simple and easy.

Seisatsu ## Owner 11/01/15 (Sun) 20:25:57 No.2510

>>2508
If we go the captcha route it might have to be Google captcha. Some bots are capable of solving simpler captchas these days. Of course we could test with another captcha to start with and see if it keeps them out.

Seisatsu ## Owner 11/01/15 (Sun) 22:03:23 No.2511

>>2510
Oops, ReCaptcha is the only captcha that Vichan supports apparently :/

!BkkDbCR6P6 11/01/15 (Sun) 23:07:31 No.2512

Why are Google captchas bad?

Seisatsu ## Owner 11/01/15 (Sun) 23:26:45 No.2513

>>2512
Good question. Can anyone give me a reason why I shouldn't enable ReCaptcha?

Anonymous 11/01/15 (Sun) 23:50:36 No.2514

File: 1446421836771.jpg (43.86 KB, 600x336, image.jpg)

>>2513
Nope.

Anonymous 11/02/15 (Mon) 16:15:23 No.2515

>>2510
>>2511
>>2512
>>2513
Because google is a monster, I don't want to be datamined when I post on imageboards. Google captcha was one of the reasons why I left 4chan

Seisatsu ## Owner 11/02/15 (Mon) 16:48:09 No.2516

>>2515
The alternative is literally infinite CP spam.

Anonymous 11/02/15 (Mon) 17:11:53 No.2517

>>2515
>Google captcha was one of the reasons why I left 4chan
Go home Stallman, you're drunk.

Anonymous 11/02/15 (Mon) 19:31:52 No.2518

>>2494
>The biggest problem is there's no way to record the contents of posts that have been deleted. If a corrupt Janitor is deleting posts they dislike and saying they were spam posts, there is literally no way to prove them wrong.
>there's no way to record the contents of posts that have been deleted.
This sounds like a desirable feature, some sort of logging system that keeps a temporary hidden store of deleted posts that management can check to keep jannies accountable for what they delete.

Anonymous 11/02/15 (Mon) 19:55:36 No.2519

>>2515
I have to agree with this anon, as someone who's quite conscious about privacy, Google is a surveillance data-gathering behemoth to be avoided at all costs.

>>2511
Have you tried contacting czaks to see if he can add 8chan's captcha system to vichan or something? I think he was involved in its development ( related: https://github.com/vichan-devel/vichan/issues/140#issuecomment-94216050 )
You can find him on irc #vichan @irc.6irc.net ( https://webchat.6irc.net/?channels=vichan )

It seems to work at stopping spambots on 8chan, although as with all anti-spam measures it's a constant arms race so the spammers may one day break it with OCR or something, other new imageboards like Infinity Next are developing their own captcha systems ( https://github.com/infinity-next/infinity-next https://infinitydev.org/ ), iirc czaks and a bunch of other imageboard owners and developers got together half a year or so ago to discuss the development of future imageboards, I'm not sure what's been going on with it now but the channel is #metachan @irc.rizon.net you can read the logs on http://carrier.6irc.net/metachan/

Anonymous 11/02/15 (Mon) 20:01:07 No.2520

File: 1446494464776.jpg (14.58 KB, 336x365, nano captcha.jpg)

>>2516
I'm sorry to say this but I'm definitely not going to post here if you enable google captcha, I'm not trying to be a douche or sound something like in the lines of "hurr I'm leaving if u don't listen to me". Also it's not like I contributed much to this community, I just saw that you were affiliated with lainchan and I decided to lurk here.

Seisatsu ## Owner 11/02/15 (Mon) 20:29:31 No.2521

>>2519
I'm in contact with czaks, but he's not so involved with vichan anymore and is trying to find a new maintainer. It's worth asking I guess.

>>2520
I understand your concern, so I'll try to find another solution first. At least the ghost thread bug is fixed so the CP threads won't stick around after we delete them. It makes it less obnoxious in the meantime while I come up with something.

Anonymous 11/02/15 (Mon) 20:55:20 No.2522

>>2521
I'll stay tuned and lurk around in the meantime

Seisatsu ## Owner 11/02/15 (Mon) 20:56:55 No.2523

For now I'm going to study the CP spam and try to filter some common phrases which appear more often. At least it should reduce the volume of spam.

Anonymous 11/02/15 (Mon) 22:25:55 No.2524

ITT: freetards complaining about privacy over anonymous posts about laziness and shitposting.

Anonymous 11/02/15 (Mon) 23:52:00 No.2525

Plz2 explain how captchas enable datamining.

Beyond that, I toss in a vote for more janitors with a tempzone outside of the public eye that only a core-tier (or just site admin) can view to keep an eye on the janitors and reactivate posts that didn't actually need to be removed.

(Not saying I favor captcha, I just dislike CP spam as much as most people. I actually HATE captchas because I just wanna post dammit >_<)

Anonymous 11/03/15 (Tue) 05:13:45 No.2526

File: 1446527625151.png (1.24 MB, 958x916, 1446188947815.png)

>>2523
Make sure you 'study' those pictures long and hard!

Anonymous 11/03/15 (Tue) 08:46:15 No.2527

>>2526
>long and hard
heh

Lumanare!G34Os4lEpE 11/03/15 (Tue) 13:43:12 No.2528

>>2523
So, what would this mean for Janitors?

Anonymous 11/03/15 (Tue) 17:15:22 No.2530

File: 1446570922093.png (635.47 KB, 1000x750, 1441132396772.png)

>>2528
It means
>u nigs do ur fucking job I have to literally design a word filter because janitors are ineffective as fuck

Anonymous 11/03/15 (Tue) 23:07:13 No.2531

File: 1446592033493.jpg (48.54 KB, 800x535, 3792799-robber-with-laptop.jpg)

I have been lurking this image board for a small amount of time mainly on /n/. Can someone fill me in on what's going on? Reading through this thread it seems we're being spammed with CP and such. Does anyone know who is doing it and what motives they have?

Anonymous 11/03/15 (Tue) 23:08:45 No.2532

File: 1446592125909.jpg (125.56 KB, 644x582, 1370650976120.jpg)

The blocklist

It's not working

Anonymous 11/03/15 (Tue) 23:14:12 No.2533

>>2531
>Does anyone know who is doing it
Nobody in particular, if that's what you're asking. At least that's what I believe, it could be one of the spanish forum guys still mad or something for all we know.

>and what motives they have?
Spam for the sake of spam, to catch people interested in the material and try to get dem shekels.

>>2532
I personally believe they're not bots, since I saw this same shit in 8chan. That's also why I think captchas ain't gonna do, since they didn't work last time.
Sei, do you think it'd be possible to create a global "minimal time" between each post users make? Since, from what I recall, the "flood" detector only works if you're trying to post with a similar body in the same board, but not in different ones.

Seisatsu ## Owner 11/03/15 (Tue) 23:25:03 No.2534

>>2533
We already have a minimum time between posts. I think they're just waiting it out.

Anonymous 11/03/15 (Tue) 23:25:07 No.2535

File: 1446593107375.gif (266.11 KB, 500x281, LOL.gif)

mfw everything fails

Anonymous 11/03/15 (Tue) 23:29:04 No.2536

>>2534
I have an idea it is stupid but it could work. Set up bans so that the person doesn't know that they are banned, allow them to post however hide the banned users post from everyone else so that they think they are spamming but in reality the only ones able to see the posts are admins and the poster

maidnaut !!JC9J1fjx4. 11/04/15 (Wed) 00:32:37 No.2538

File: 1446597157530.jpg (106.57 KB, 932x651, happy-anime-reaction-gif-9.jpg)

We've come up with a solution, but it'll take some time to implement because we have to edit vichan's core. Banning image hashes is something that was only partially built into tinyboard in a usable capacity and unfortunately vichan hasn't expanded it any.

It's not a permanent solution, but since they (mostly) post the same image it'll slow them down until they cycle in a new one. Word filtering and ip banning don't seem to work either so this feels like the best option right now.

We'll keep you posted, but in the meantime just keep reporting posts like usual!

maidnaut ## Mod 11/04/15 (Wed) 00:38:49 No.2539

oops i forgot how to use my capcode

Seisatsu !!4HMnNQSfnQ 11/04/15 (Wed) 00:53:33 No.2540

>>2536
I had that same idea a little while ago because it would be funny to punk the spammer like that, but it wouldn't work because they keep cycling between IP addresses whether we ban them or not.

Seisatsu ## Owner 11/04/15 (Wed) 00:54:04 No.2541

>>2540
I also forgot how to use my capcode.

Anonymous 11/04/15 (Wed) 01:56:19 No.2542

>>2541
Perhaps start to ban using vpn, proxies and tor nodes like 4chan is atm

Seisatsu ## Owner 11/04/15 (Wed) 02:56:54 No.2543

>>2542
We've done that since the beginning.

Anonymous 11/04/15 (Wed) 08:55:22 No.2544

>>2543
Any solution found yet?

Anonymous 11/04/15 (Wed) 17:48:02 No.2545

>>2538
If you guys manage to come up with a way to actually do this, please push it through to the vichan git and don't sit on it. A lot of vichan imageboards have a terrible problem with spam and it would help a lot. Or at least make it publicly available. I run a french chan and we get spammed to all hell with CP, and even though I've banned entire countries it generally does not help.

maidnaut ## Mod 11/05/15 (Thu) 23:14:05 No.2550

>>2538
I literally just >>2538

>>2544
It won't be that big of a modification, if you know php and can find where the post filters are in vichan you can pretty easily see where the idea for this is going.

Anonymous 11/14/15 (Sat) 11:50:37 No.2557

File: 1447501837008.png (1000.13 KB, 628x788, 1403478160117.png)

>super fap

Seisatsu ## Owner 11/17/15 (Tue) 08:02:37 No.2566

File: 1447747357604.jpg (49.15 KB, 640x360, Hacker1.jpg)

We now have someone working on advanced countermeasures.

Anonymous 11/17/15 (Tue) 11:38:08 No.2567

File: 1447760288055.jpg (5.85 KB, 145x145, 1369429307271.jpg)

The advanced countermeasures,

it does nothing.

Anonymous 11/17/15 (Tue) 12:33:28 No.2568

File: 1447763608684.jpg (30.37 KB, 369x292, 1447501741369.jpg)

Just shove captchas here I guess.

Anonymous 11/17/15 (Tue) 15:08:22 No.2569

File: 1447772902906.jpg (104.97 KB, 608x430, 1441762739878.jpg)

>>2566
>advanced countermeasures.
Go Stallman, go!

Anonymous 11/17/15 (Tue) 19:10:52 No.2570

File: 1447787451936.gif (1018.02 KB, 317x218, STAFF.gif)

Dont worry, Ubuu staff is preparing advanced countermeasures

Seisatsu ## Owner 11/17/15 (Tue) 23:17:20 No.2571

>>2568
Captchas don't work against humans. The spammer is confirmed human.

I'm not going to describe here the countermeasures we're developing because we don't know if the spammer is watching our site. Last time we tried to stop the spam they adjusted their bots within the day to defeat our efforts.

Seisatsu ## Owner 12/17/15 (Thu) 03:10:52 No.2641

Hmm. As a test, I could try putting the site behind Cloudflare. Maybe one of their countermeasures will catch the bot.

y/n?

Anonymous 01/03/16 (Sun) 17:15:45 No.2649

File: 1451841345702.webm (2.72 MB, 845x480, Zetsubou Ramen.webm)

>>2641
y

The alternative is to get more janitors.
I'm tired of reporting CP threads with hours of being on the frontpage.

Anonymous 01/03/16 (Sun) 17:59:32 No.2650

More janitors, I logged on at school once and there would a bunch of disgusting spam so I was like NOPE

Seisatsu ## Owner 01/05/16 (Tue) 02:30:54 No.2651

Cloudflare is activated. Let's hope this works.

Anonymous 01/11/16 (Mon) 21:01:56 No.2653

File: 1452546116271.jpg (23.35 KB, 478x350, 1449447484093.jpg)

Didn't work, get competent janitors ffs

Seisatsu ## Owner 01/14/16 (Thu) 23:59:02 No.2654

We will occasionally keep trying new methods of thwarting the CP spam. Haven't given up yet.

Anonymous 01/15/16 (Fri) 23:23:32 No.2657

>>2654
Thank you very much for the bugfixes.

Anonymous 02/18/16 (Thu) 07:04:33 No.2826

Have you considered using heuristics like Mozilla Thunderbird's disturbingly effective junk email filter uses? It is "trained" by marking messages as junk or not junk and after a sufficient number of messages it begins to gain a pretty high degree of accuracy. It won't stop 100% of the crap but I would be surprised if it didn't cut down significantly on the spam. You could also consider using shadowban-like tactics to make it harder for the spammers to know that their posts failed; if a post matches a junk heuristic check, let it "post successfully" for that particular IP address and show in the thread as usual, but place it in a moderator queue before allowing it to show site-wide (this also gives the mods the chance to catch false positives and further refine the heuristics).

Anonymous 02/18/16 (Thu) 07:16:50 No.2827

>>2826
To save you the time of finding it, the source (C++) is at http://hg.mozilla.org/comm-central/file/tip/mailnews/extensions/bayesian-spam-filter/src and you'll probably be interested in the info at https://en.wikipedia.org/wiki/Naive_Bayes_spam_filtering

I also just had the idea that you could make messages not post immediately even if not shadowbanned, but rather appear after an unspecified delay longer than just a few seconds. This would make it harder for the spammer to identify a shadowban condition by comparing the loaded page from a different IP address than the posting one.

I'd also suggest detecting open proxies and Tor exit nodes and possibly banning the use of them if that's where a good chunk of the spam is originating.

Booger-chan 02/27/16 (Sat) 14:37:07 No.2848

just a small comment, the mass reduction of the numbr of boards made it waaaaaay easier to clean up a wave of spam because they have less places to post it : D
but that solves nothing

Anonymous 02/27/16 (Sat) 15:20:51 No.2850

>>2848
We haven't had spam in a good while. Or at least I didn't notice, which would be weird since I'm 24/7 here.

Booger-chan 02/27/16 (Sat) 15:39:20 No.2851

>>2850
I just cleaned a wave and I haven't been here in a few months so I got no idea how often its been. Just noticed that it was a lot easier to clean than it used to be

Anonymous 02/27/16 (Sat) 16:05:44 No.2852

File: 1456589144000.jpg (54.38 KB, 480x360, smug37.jpg)

>>2850
>We haven't had spam in a good while.
See Sei?

Booger-chan 02/27/16 (Sat) 16:35:45 No.2853

>>2852
spam came back because i decided to go give gardening advice in /hikki at 12AM on a thursday
look what i've done

Seisatsu ## Owner 02/27/16 (Sat) 20:27:12 No.2854

Yeah the spam still happens several times a week, up to once a day. But, it hasn't been a problem since Jove made the IRC bot. Every time someone makes a post on the boards, the bot gives us a summary right away in the moderation channel. The spam posts are pretty obvious, and we have someone watching just about all the time, so we usually catch and stop new waves in their tracks in a couple minutes or less nowadays before anyone is likely to see them. For the most part it's put the issue to rest, though it would be great if the spambot would go away for good.

Anonymous 03/23/16 (Wed) 00:20:28 No.2901

>>2854
As a simpler solution to an IRC bot (I spoke to you on IRC about spamming like a week ago), I enabled the RSS theme on my board and installed an RSS client on my PC to show new posts.

Word filters are pretty useless since the bots adjust. Good thing is that they always post a generic message, so I can tell when a post is probably a spambot, click the rss popup and immediately D+B.