August 23, 2004
An algorithm named Chung-Kwei, originally designed to analyze DNA sequences, was able to identify spam with 96.56 per cent accuracy by analyzing text patterns.
Instead of chains of characters representing DNA sequences, the research group fed the algorithm 65,000 examples of known spam. Each email was treated as a long, DNA-like chain of characters. Teiresias identified six million recurring patterns in this collection, such as “Viagra”.