Implementing Block-Max Pruning in Rust: Faster Learned Sparse Retrieval for Modern Search
UB4.136 | Day 2 | 13:55 - 14:25 | Speakers: Ferdinand Schlatt, Antonio Mallia
Abstract
Learned sparse retrieval models such as SPLADE, uniCOIL, and other transformer-based sparse encoders have become popular for delivering neural-level relevance while preserving the efficiency of inverted indexes. But these models also produce indexes with statistical properties radically different from classic BM25: longer queries, compressed vocabularies, and posting lists with unusual score distributions. As a result, traditional dynamic pruning algorithms like WAND and Block-Max WAND often fail to exploit their full potential.
This talk presents Block-Max Pruning (BMP) from a systems and Rust-engineering perspective. We will walk through how BMP restructures query processing by partitioning document space into small, contiguous blocks and maintaining lightweight, on-the-fly score upper bounds that guide safe or approximate early termination.
The talk is aimed at developers building retrieval engines, Rust-based data systems, or ML-powered search pipelines who want to push sparse retrieval performance further. Attendees will leave with a clear understanding of how BMP works, why learned sparse models require new pruning strategies, and how to integrate these ideas into modern, high-performance Rust codebases.
Code and resources: BMP GitHub repository: https://github.com/pisa-engine/BMP/ Paper (SIGIR 2024): https://www.antoniomallia.it/uploads/SIGIR24.pdf
Attachments
Speakers
Ferdinand Schlatt is a PhD student from the Friedrich-Schiller-University in Jena and a research engineer at Seltz. His research is focused on making transformer-based language retrieval models more efficient and/or more effective by aligning the attention mechanism with the task of information retrieval.
Antonio is currently building a new project in search and AI, with a focus on efficient and open technologies. Previously, he was a Staff Research Scientist at Pinecone and an Applied Scientist on Amazon’s AGI team. He received his Ph.D. from New York University, where his research centered on information retrieval and efficient search architectures.
Links
External Links
Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.
