Top Banner
Evaluating Anti-Spam Filtering Solutions Michael Lamont Senior Software Engineer Process Software
32

Evaluating Anti-Spam Filtering Solutions

Jan 15, 2015

Download

Software

Michael Lamont

Methodology for a technical evaluation of software-based spam filters - a hot topic back in 2005. It was originally going to be given at the HP Tech Forum in New Orleans in Sept 2005 - Katrina forced the conference to cancel while I was literally on the way to the Boston airport. Ended up giving this presentation at the rescheduled conference in Orlando in Oct 2005.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Evaluating Anti-Spam Filtering Solutions

Evaluating Anti-Spam Filtering Solutions

Michael Lamont Senior Software Engineer

Process Software

Page 2: Evaluating Anti-Spam Filtering Solutions

Agenda

• Examine in detail several sets of criteria for evaluating spam filters.

• How to setup and run both lab and production tests.

• Best practices for testing spam filters.

• Some common mistakes made while testing spam filters.

Page 3: Evaluating Anti-Spam Filtering Solutions

Introduction

• There’s a lot of spam out there,

• it’s ticking off everybody with an email account,

• and there are a LOT of companies that would love to help you solve the problem.

• Solution quality varies.

• Every site has different needs.

Page 4: Evaluating Anti-Spam Filtering Solutions

Goals

• Have a stack of criteria you can use for constructing RFPs and formal evaluations.

• Learn how to construct good, objective filter tests.

Page 5: Evaluating Anti-Spam Filtering Solutions

Why Evaluate?

• Spam has gotten so bad, that life is tough without some kind of filter.

• You (or your users) may be unhappy with your current solution.

• Even if you’re happy, there might be something better out there.

Page 6: Evaluating Anti-Spam Filtering Solutions

Different Solution Classes

• Integrated

− Works with your existing MTA

− Usually installed on same system as MTA

• Proxy

− Separate system running the filter sits in front of MTA

− Intercepts and filters mail before it reaches the MTA

• Appliance

− Works like a proxy, but more plug’n’play

• Service

− All mail routes through vendor’s systems at a data center to be filtered

Page 7: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - Supported Platforms

• Operating system and email server software.

• If you have an esoteric OS or email server, consider a proxy.

• Make sure software supports multiple filtering systems for a domain.

Page 8: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - User Authentication

• User interfaces for personalized settings, quarantined messages, etc need to be able to authenticate users.

• Lots of solutions require LDAP for user functionality.

• Ability to authenticate multiple ways against multiple sources.

Page 9: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - Site Specific Needs

• Make sure solution supports any special needs you have.

• Example: Change colors & images in user interface to match site’s email portal.

Page 10: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - Cost

• If a solution costs too much, it’s not a solution.

• Lots of vendors are willing to negotiate deals that take budgetary constraints into account.

• Ask about applicable discounts - i.e. educational institution.

Page 11: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - Technical Support

• Tech support costs should be included in price quote.

• Make sure telephone support is available.

• Look for 24/7 support, if your site needs it.

Page 12: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - Geographical Location

• Can affect problem resolution times.

• Time zone differences

• Language differences

Page 13: Evaluating Anti-Spam Filtering Solutions

Basic Criteria - Vendor Stability

• How long has company been in business?

• How long have they been selling/supporting anti-spam?

Page 14: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Accuracy

• High spam detection rate

• Low false positive rate

• Good filters strike a balance between spam detection and false positives.

• Provide a way for users and admins to view/release filtered messages (quarantine).

• Be able to recognize a message as spam without having to write a rule for each message.

Page 15: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Configurability

• Filter should be tailorable to fit your site’s definition of spam.

• Filter should be effective out of the box.

• Admins should be able to modify each individual rule used to filter spam.

• Users should be able to determine how “spammy” a message is before it’s filtered.

• System-wide & per-user whitelists and blacklists.

• Admin interface should be user-friendly

• No software to be installed on desktops

Page 16: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Logging/Information

• Both users and admins should be able to tell why a message was filtered.

• Information about why the message was filtered should be available in the message headers.

• Master log file should contain one entry for each message, and reasons it was filtered.

• Product should provide statistics on message filtering activity.

Page 17: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Filtering Methods

• Filter should use proven methods that balance accuracy and system resource usage.

• Filters that use more than one filtering method are harder for spammers to circumvent.

• Using too many methods eats up lots of system resources for no significant gains in accuracy.

Page 18: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Performance

• Email is a “highly visible application”

• If messages are delayed by spam filter, users will let you know.

• Email volume is constantly increasing, so filter should be scaleable.

Page 19: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Security

• Spam filter should make sure confidential email stays that way.

• Filter shouldn’t send information about your site to anyone without your permission.

• Automated rule updates.

• Proxies and services should support TLS.

Page 20: Evaluating Anti-Spam Filtering Solutions

Evaluation Criteria - Time Cost

• Quick installation and integration.

• Minimal admin requirements.

• Push as many tasks as possible out to end users.

Page 21: Evaluating Anti-Spam Filtering Solutions

User Interface Criteria

• Simple instructions and labels

• Natural language support

• Minimize user memory load

• Consistency

• Feedback

• Clearly marked exits

• Good error messages

• Help and documentation easily available

Page 22: Evaluating Anti-Spam Filtering Solutions

Non-Production (Lab) Evaluation

• Good way to play with lots of products simultaneously.

• Doesn’t interfere with end users or production mail stream.

• Easiest way to conduct fair accuracy tests.

Page 23: Evaluating Anti-Spam Filtering Solutions

Corpus Testing

• Uses large bodies of collected messages (a corpus).

• Each corpus contains thousands of messages that are either spam or non-spam.

• Entire corpus is sent through filter; count is kept of filtered messages.

Page 24: Evaluating Anti-Spam Filtering Solutions

Corpus Messages - Spam

• Set up a public IMAP folder that users can place spam messages in.

• Online spam sources - www.spamarchive.org

• Watch out for missing/invalid headers/

Page 25: Evaluating Anti-Spam Filtering Solutions

Corpus Messages - Non-Spam

• Public IMAP folder; less participation

• UseNet newsgroups

• Watch the headers!

Page 26: Evaluating Anti-Spam Filtering Solutions

Forking User Mail

• Sends copies of user messages through the filter.

• Lets you see how the filter handles real mail send to your site.

• Still invisible to end users (if you want it to be)

Page 27: Evaluating Anti-Spam Filtering Solutions

Forking User Mail - Method

• Start by selecting appropriate group of users.

• Create alias that “forks” mail to test system:

jdoe: [email protected], [email protected]

• Users can log into interface on test system.

Page 28: Evaluating Anti-Spam Filtering Solutions

Production Evaluation

• Uses filtering system on your production mail stream.

• Opt-in/opt-out feature lets you select small test groups.

• Good idea to start with least intrusive features, and work up to full functionality.

• Some filters can be set to scan messages and log results w/o altering the message.

Page 29: Evaluating Anti-Spam Filtering Solutions

Best Practices For Production Testing

• Select sizeable group of users to participate in testing.

• Give test users plenty of warning before starting/stopping test.

• Customize user interface to be in language end users are most familiar with.

• Create mailing list for test users to post problems and suggestions to.

• Set up accounts for users to forward false positives and false negatives to.

• At the end of the testing period, solicit feedback.

Page 30: Evaluating Anti-Spam Filtering Solutions

Sample Feedback Questions

• Use a sliding scale (1 to 5 works well)

− Using xyz would improve my email workflow

− xyz keeps obscene messages out of my Inbox

− xyz reduces the amount of time I spend dealing with junk email

− Learning to use xyz was easy for me

− It was easy to get xyz to do what I want it to do

− xyz was easy to use on a day to day basis

• Be sure to solicit free-form comments and suggestions.

Page 31: Evaluating Anti-Spam Filtering Solutions

Common Testing Problems

• Using a small group of test users.

• Using only testers from one workgroup or department.

• Using mail client software to forward test spam.

• Using raw messages from public repositories.

• Using homogenous messages blocks on AI filters.

Page 32: Evaluating Anti-Spam Filtering Solutions

Quick Review

• We talked about:

− Lots of criteria for evaluating spam filters.

− How to setup and run both lab and production tests.

− Some best practices for testing spam filters.

− A few common mistakes made while testing spam filters.