- LibraryThing
- All topics
- Hot topics
- Book discussions
- All discussions
- Your LibraryThing
- Join to start using.
 |
Spam catchingJoin LibraryThing to post. This topic is currently marked as "dormant"—the last message is more than 90 days old. You can revive it by posting a reply. 1 timspaldingEdited: Feb 27, 2010, 1:41am 
We've added a fun and rather unique spam-fighting technique to the site—a cover-based "captcha." Normally, you get it when you create a new account. But you can try it out here: http://www.librarything.com/signup_verify.php Basically, you get a book cover, and you have to name the author of the book. Most of the time it's obvious. When it's not, you can request another cover. It's tolerant of case and punctuation. We "rolled our own" after experiencing continuing irritating with the "reCaptcha" solution. That's the one that looks like this:  It didn't seem to be stopping spammers, and it required a separate registration for all our subdomains. It just never worked for us. The new one will, I hope, stop automatic spamming. (Maybe 1/4 of our spam isn't automated—it's people whacking away at it on keyboard in cheap-labor companies.) Unlike reCaptcha, nobody else uses it. Also, while book covers can be run though a text converter, it's not at all obvious to a computer what part of a book is the title. Humans can tell titles from authors. Not so a computer. Anyway, I think it's clever. And it took very little time to do. Let's cross our fingers... More spam-fighting coming.
Very clever, Tim.
Maybe you can stop them dead in their tracks by also asking for the OCLC number. {:>)
Cool - any non English-language covers there?
The "another cover" link isn't working for me. 5 timspaldingEdited: Feb 27, 2010, 2:01am 
The "another cover" link isn't working for me.
Really? Are you sure? What's your browser? Anyone else?
Ah. A change I just made. Sec.
Fixed 6AndrewBFeb 27, 2010, 2:04am 
That is so cool! :D 7loraxFeb 27, 2010, 2:11am 
I like this. Pure text captchas have reached the point where if they're hard enough to fool good text-detection algorithms, they're hard enough to stump a lot of people, too. (The "rippling flag" ones drive me bananas.) It's distinctive, non-annoying, and hopefully it should do some good.
8r.orrisonEdited: Feb 27, 2010, 3:17am 
I tried the "try it out" link and got blank space where the cover should be. I'm not sure whether to be insulted or honored. The relevant bit from the page is <img src="/bookcaptchaimage.php?isbn=c4b31ce7d95c75ca70d50c19aef08bf1" height="400">. Do you need to find a way to make sure the cover is valid? (Actually, you probably do that already, and maybe just need to add another test case.) (Asking for another cover worked fine.) I failed on /bookcaptchaimage.php?isbn=0d0871f0806eae32d30983b62252da50, presumably because I entered both authors. That might be tricky to filter. Look for another "Author" role listed in Other Authors? (Or the word "and" in the Author field?) Perhaps you could filter images whose author's is gender n/a, since often organizations won't put their name on the cover. E.g. /bookcaptchaimage.php?isbn=621c090c3a61c6809130675a63897a0f ( http://www.librarything.com/work/9337725 by Australian Women's Weekly) Of course, you can just punt these things and let the user click "another cover" -- the vast majority work great, and it appears the chances of having two impossible ones in a row is pretty slim. Very cool feature, and so perfect for LibraryThing. (Although the goal of reCaptcha is somewhat relevant to book lovers, too.) Edited to make the bookcaptchaimage links not links. If you clicked them, you got binary data displayed as text, i.e. garbage.
Very good. Excellent idea. Whose was it?
Ok, this is weird. I edited the above message, and forgot to fix the > and < in the first snippet of html. It is now showing random covers every time I look at the message. I'll fix the html again.
(Sure would be nice if it remembered that the user had literally entered > and < instead of putting in > and < when editing, so you don't have to remember to change them back.)
>8
How the code relates to the cover you get and whether it changes isn't something I'm getting into.
it appears the chances of having two impossible ones in a row is pretty slim
Right.
Very good. Excellent idea. Whose was it?
Mine. Thanks.
This is wonderful. Much easier on the eye than the normal ones, and really related to the site. I love it! I give it five stars.
I have a cool related idea. But not tonight. I'm working on spam tonight. 14justjimFeb 27, 2010, 5:10am 
That's a really clever idea, Tim. Just one suggestion, you said Also, while book covers can be run though a text converter, it's not at all obvious to a computer what part of a book is the title. Humans can tell titles from authors. Not so a computer. Perhaps you should ask for the title since in many cases the author would be any words after the word 'by' in the string the text converter returns. 15AndrewBFeb 27, 2010, 5:41am 
If spammers start catching on and manage to read the text - you could get it to ask random questions about the cover: author, title - but also things like "enter the first four characters of the author name". By introducing some random-ness, it will make their trickery more difficult.
If I was writing a spam bot, I'd take the URL of the image and feed it to Tineye. Next scrape the results for Amazon or LibraryThing, ;-), pages, from which you can probably get a reasonablygood automated guess at the author. :-) Jesting aside, it's a nice idea! 17_Zoe_Feb 27, 2010, 9:20am 
I like it!
Clever AND fun. Good way to get serendipitous recommendations of books to read (or ... more devious ... publishers could pay YOU to use their books for captcha purposes). 20FicusFanFeb 27, 2010, 11:33am 
Love it. I can never figure out/see the usual captcha phrases and its just luck that I can get it right.
21 timspaldingEdited: Feb 27, 2010, 12:15pm 
The thing that I think is cool here is that Captchas are about making something that's easy for a human to do, but hard for a computer. Parsing text is relatively easy for a computer, but figuring out what's the author and what's the title is not. Anyone today can pick the author off a book—you know when the author name "James Paterson" is huge, because he's him, and when it's not.
Obviously, if you applied lots of bookish intelligence, you could write something to get by this. But spam is a numbers game. LibraryThing is very small potatoes. Writing something to add stuff automatically via a virtual browser was easy. This would take many hours of work, and involve lots of scraping of Amazon, or something. If they attempt it, I'll just move to user-provided covers, or titles, or whatever.
Okay, I just babbled to my forum-friends about this because I think it's SO COOL. That's how awesome this is.
I agree with others that the Captchas and similar are waaaay too hard for normal people to read... The wavy lines, the multiple colors, it doesn't stop spammers it stops me!
But I loooove this cover thing. It's a great idea!
Thanks. Stay tuned. I'm bursting to get this related thing out. It's my weekend fun project :)
#18....hahahahahhaah the book that came up when I gave it a try had an interesting title, so I checked it out and The Pig That Wants To Be Eaten is now on my wishlist! *G* 25tardisFeb 28, 2010, 11:46am 
24> Spam fighting for the win!! I LOVE a place where even anti-Spam measures can turn people on to new books. 26KingRatFeb 28, 2010, 12:18pm 
>16,21
Just hope that Google Goggles doesn't open access via API. Right now I can take a picture of any book with it, and it will send me to amazon or Google Books to the correct book 99.9% of the time.
One other question: accessibility. What's the option for low-sight or blind users?
>3 Only if you also allow the LC number as well. Some people have much easier access to an LC-using library than a Dewey-using one. (Not that I fall into that category at the moment. But I did when I was at college and living on campus!) 29SuncatMar 1, 2010, 11:29am 
I like how it works, and how harmonious it is with LT.
Any idea how well this is working to deter spammers? I know we've been getting some spammers lately, but I don't have a sense of whether it's more or less, or the same.
Much less. Daily spam accounts have virtually vanished. What's left are the non-automated people. They do more damage, though, per user. 33vaneskaEdited: Mar 6, 2010, 3:46am 
There's something peculiarly satisfying about flagging a spam commercial account and knowing that somewhere in the background it's being dealt with. It's a good way of handling the problem.
v
I tried it and the first two covers I got had no author, but it worked for the third one! 35justjimMar 6, 2010, 5:21am 
#23 ...I'm bursting to get this related thing out. ...
This had been in the back of my head for a while... it was the cover tagging game, wasn't it?
Or is there something else??? 37ojchaseMar 24, 2010, 9:33am 
This is just great! I just love the amount of creativity and cleverness you guys have!
38> WTH? Did we really just get drive-by spammed on the thread about spam catching?
I wonder, do you make money from home by spam-catching. That would be meta. |  31,362 messages This group does not accept members.  AboutThis topic is not marked as primarily about any work, author or other topic.  Touchstones
|