www.1001TopWords.com |
How Spammers Fool Bayesian Filters - And How to Stop Them
Effectively stopping spam over the long-term requires much more than blocking individual IP addresses and creating rules based on keywords that spammers typically use. The increasing sophistication of spam tools coupled with the increasing number of spammers in the wild has created a hyper-evolution in the variety and volume of spam. The old ways of blocking the bad guys just don't work anymore. Examining spam and spam-blocking technology can illuminate how this evolution is taking place and what can be done to combat spam and reclaim e-mail as the efficient, effective communication tool it was intended to be. One method used to combat spam is Bayesian Filtering. Named after Thomas Bayes, an English mathematician, Bayesian Logic is used in decision making and inferential statistics. Bayesian Filers maintain a database of known spam and ham, or legitimate email. Once the database is large enough, the system ranks the words according to the probability they will appear in a spam message. Words more likely to appear in spam are given a high score (between 51 and 100), and words likely to appear in legitimate email are given a low score (between 1 and 50). For example, the words "free" and "sex" generally have values between 95 and 98, whereas the words "emphasis" or "disadvantage" may have a score between 1 and 4. Commonly used words such as "the" and "that", and words new to the Bayesian filters are given a neutral score between 40 and 50 and would not be used in the system's algorithm. When the system receives an email, it breaks the message down into tokens, or words with values assigned to them. The system utilizes the tokens with scores on the high and low end of the range and develops a score for the email as a whole. If the email has more spam tokens than ham tokens, the email will have a high spam score. The email administrator determines a threshold score the system uses to allow email to pass through to users. Bayesian filters are effective at filtering spam and minimizing false positives. Because they adapt and learn based on user feedback, Bayesian Filers produce better results as they are used within an organization over time. They are not, however, foolproof. Spammers have learned which words Bayesian Filters consider spammy and have developed ways to insert non-spammy words into emails to lower the message's overall spam score. By adding in paragraphs of text from novels or news stories, spammers can dilute the effects of high-ranking words. Text insertion has also caused normally legitimate words that are found in novels or news stories to have an inflated spam score. This may potentially render Bayesian filters less effective over time. Another approach spammers use to fool Bayesian filters is to create less spammy emails. For example, a spammer may send an email containing only the phrase, "Here's the link?". This approach can neutralize the spam score and entice users to click on a link to a Web site containing the spammer's message. To block this type of spam, the filter would have to be designed to follow the link and scan the content of the Web site users are asked to visit. This type of filtering is not currently employed by Bayesian filters because it would be prohibitively expensive in terms of server resources and could potentially be used as a method of launching denial of service attacks against commercial servers. As with all single-method spam filtering methodologies, Bayesian filters are effective against certain techniques spammers use to fool spam filters, but are not a magic bullet to solving the spam problem. Bayesian filters are most effective when combined with other methods of spam detection. The Solution When used individually, each anti-spam technique has been systematically overcome by spammers. Grandiose plans to rid the world of spam, such as charging a penny for each e-mail received or forcing servers to solve mathematical problems before delivering e-mail, have been proposed with few results. These schemes are not realistic and would require a large percentage of the population to adopt the same anti-spam method in order to be effective. You can learn more about the fight against spam by visiting our website at www.ciphertrust.com and downloading our whitepapers. Dr. Paul Judge is a noted scholar and entrepreneur. He is Chief Technology Officer at CipherTrust, the industry's largest provider of enterprise email security. The company's flagship product, IronMail provides a best of breed enterprise anti spam solution designed to stop spam, phishing attacks and other email-based threats. Learn more by visiting http://www.ciphertrust.com/products/spam_and_fraud_protection today.
|
RELATED ARTICLES
Invasion of the Email Snatchers They're sneaky. And stealthy. They're quiet and mostly unobtrusive, but once you've been visited by them, you'll know it. Because you'll be inundated with a seemingly never-ending stream of spam-mails. A War on SPAM: Attacking The Evil As most small, medium and large businesses in this country have seen the SPAM Emails have hurt our productivity and caused excessive costs. Six Tips to Get Rid of Spam Email 1. Ignore Spam Email Pst... Pass It On... I Found Out Its a Hoax When you receive an email telling you about a virus, what do you do with it? Do you send it to everyone in your address book to help them protect themselves too? Managing Spam in 2005 In 1998, nearly 10% of all email traffic on the internet was SPAM. By 2003 that number had climbed to 50%, and the problem had gotten so bad that Congress passed the CAN-SPAM Act of 2003 (Controlling the Assault of Non-Solicited Pornography and Marketing Act) Protecting Your Business From Spam Even being as careful as possible with my email address, I still used to receive more than 100 email messages a day, which is no exaggeration. Only about 10% of those emails were from people that I knew and the rest of the messages were unwanted email?"spam". And I'm sure you can relate to my frustration. It is estimated that over seventy-six billion unwanted email messages were delivered in 2003, costing companies more than $10 billion each year. The Anti Spam Challenge ? Minimizing False Positives Email is the quintessential business communication tool, so when it doesn't work like it's supposed to, business suffers. Anti spam software is designed to protect your inbox from unwanted messages, but unless your system is properly trained even the best software misses the mark and flags legitimate messages as spam. These messages are referred to as "false positives." All About Spam Spam is annoying. Period. Why people would want to send all of us stupid messages about buying prescription drugs or getting some outrageously good mortgage rate is beyond me. Well, not really. Demand for Spam? It exists Do you like spam? No, I'm not kidding. Everybody knows what spam is, almost everybody seems to have learned by heart simple advice like "do not click ?" "do not respond?" , "do not buy?" but-- Email Chain Letters - Harmless Fun or Not? I'm sure I'm not the only person on the planet that remembers getting the occasional chain letter in school.. you know, the kind that was actually written with a pen or pencil on paper that told you either something wonderful would happen or something terrible would happen or both if you did or didn't send out 20 copies within 7 days. Ugh. I still remember my fingers cramping as I tediously re-wrote the letter 20 times and the looks on my friends faces when they realized the note i just passed them was leading them to the same path of wasted paper. How Can I Stop Getting Spam? Are you getting too much spam? We all are, but if you're a webmaster the word spam takes on a whole new meaning. Is Spam Affecting Your Business Email? 5 Ways Spam Is Affecting Your Business And what we can alldo to prevent it. Why Is Spam Such a Problem? Spam can be a lot more damaging than you might think. Obviously, they are the most annoying thing that you can receive through your inbox, but it goes deeper than that. If you are like the millions of other internet email users, you know that sending and receiving email is a free service that comes with your internet service. Block Ads, Defeat Pop-Ups, and STOP Page Hijacking You're not alone! Spammer in the Slammer: Jeremy Jaynes Sentenced to Nine Years Will other spammers take heed? Don't count on it. Beware of the Newest Activity Online: Phishing No. I'm not talking here about the outdoor activity enjoyed by many. And no again; I did not misspell it. Phishing is the name given to the latest online scam where millions of unwary Americans are getting their identities stolen. Why Your ISP Takes Bribes From Spammers The lifeblood of the spammer is email. They need to be able to send lots of it on an ongoing basis to stay in "business". High profile spammers can send 80 million pieces of junk email every single day. Yes one single person. Edating Readers One of our Australian clients sent out a campaign using a list which had been complied manually. Spam with Typos: Why Do They All Have Spelling Errors? A friend asked me: I don't get it. Why do spammers have such a hard time spelling things properly? I get mail trying to sell me "viagggra", increase my "brest" size, or save me money accessing "pr0n" sites or buying "seks toys". Even more puzzling, there are plenty of spam messages where it takes me a few minutes to even figure out what the subject actually is, with subjects like "sa vem oneyo n vviiiaaagra" or similar. What's the story? Why can't these people use a spell checking program?? Three Faces of SPAM Like everybody who will ever read this, I get spam in my e-mail. Mine seems to fall into one of three categories. The first is the Nigerian scam about helping some poor, pathetic soul collect megabucks, supposedly from someone who has died and left a fortune. I'm not sure what is worse: that there are people desperate enough to believe those messages, or that there are people despicable enough to prey on the desperate. The net result is the despicable con the desperate into sending money which the desperate will never see again. |
© Athifea Distribution LLC - 2013 |