TY - GEN
T1 - Enhancing collaborative spam detection with Bloom filters
AU - Yan, Jeff
AU - Cho, Pook Leong
PY - 2006/12/26
Y1 - 2006/12/26
N2 - Signature-based collaborative spam detection (SCSD) systems provide a promising solution addressing many problems facing statistical spam filters, the most widely adopted technology for detecting junk emails. In particular, some SCSD systems can identify previously unseen spam messages as such, although intuitively this would appear to be impossible. However, the SCSD approach usually relies on huge databases of email signatures, demanding lots of resource in signature lookup, storage, transmission and merging. In this paper, we report our enhancements to two representative SCSD systems. In our enhancements, signature lookups can be performed in constant time, independent of the number of signatures in the database. Space-efficient representation can significantly reduce signature database size. A simple but fast algorithm for merging different signature databases is also supported. We use the Bloom filter technique and a novel variant of this technique to achieve all this.
AB - Signature-based collaborative spam detection (SCSD) systems provide a promising solution addressing many problems facing statistical spam filters, the most widely adopted technology for detecting junk emails. In particular, some SCSD systems can identify previously unseen spam messages as such, although intuitively this would appear to be impossible. However, the SCSD approach usually relies on huge databases of email signatures, demanding lots of resource in signature lookup, storage, transmission and merging. In this paper, we report our enhancements to two representative SCSD systems. In our enhancements, signature lookups can be performed in constant time, independent of the number of signatures in the database. Space-efficient representation can significantly reduce signature database size. A simple but fast algorithm for merging different signature databases is also supported. We use the Bloom filter technique and a novel variant of this technique to achieve all this.
KW - collaboration
KW - unsolicited electronic mail
KW - databases
KW - merging
KW - information filtering
KW - information filters
KW - internet
KW - web server
KW - performance analysis
KW - code standards
UR - https://www.scopus.com/pages/publications/39049090542
U2 - 10.1109/ACSAC.2006.26
DO - 10.1109/ACSAC.2006.26
M3 - Conference contribution book
SN - 0769527167
T3 - Proceedings - Annual Computer Security Applications Conference, ACSAC
BT - 2006 22nd Annual Computer Security Applications Conference (ACSAC'06)
PB - IEEE
CY - Piscataway, NJ
T2 - 22nd Annual Computer Security Applications Conference, ACSAC 2006
Y2 - 11 December 2006 through 15 December 2006
ER -