Skip to main content

Enhancing Email Spam Detection with LLMs: Practical Experience with Rspamd and GPT

K.4.601 | Day 1 | 13:30 - 14:00 | Speakers: Vsevolod Stakhov

Enhancing Email Spam Detection with LLMs: Practical Experience with Rspamd and GPT
A picture of a devroom at FOSDEM 2024
Open in browser
Get involved in the conversation!Join the chat

Notes

Abstract

This talk explores the practical implementation of Large Language Models (LLMs) in email filtering, giving the example of the integration between Rspamd and various LLM services. We'll discuss how LLMs can complement traditional filtering methods, comparing supervised (Bayes) and unsupervised (LLM-based) approaches to spam detection.

We'll examine real-world results from different models (GPT-3.5, GPT-4, and alternatives via OpenRouter), analyzing their effectiveness, false positive rates, and cost implications. The presentation will cover advanced features such as content categorization, password extraction from archives, and message anonymization for privacy-preserving learning.

Special attention will be given to practical deployment considerations, including:

  • Cost-effective strategies for different scales of operation
  • Self-hosted models vs. cloud APIs
  • Privacy considerations and message anonymization techniques
  • Integration with existing email infrastructure
  • Extended message analysis capabilities

The talk will conclude with insights into future developments and best practices for implementing LLM-based email filtering in both personal and enterprise environments.

Target Audience: Email administrators, spam filtering specialists, and developers interested in modern email security solutions.

Speakers

Vsevolod Stakhov

Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.