Authors
Hannes Mühleisen, Thaer Samar, Jimmy Lin, Arjen De Vries
Publication date
2014/7/3
Book
Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval
Pages
863-866
Description
We make the suggestion that instead of implementing custom index structures and query evaluation algorithms, IR researchers should simply store document representations in a column-oriented relational database and implement ranking models using SQL. For rapid prototyping, this is particularly advantageous since researchers can explore new scoring functions and features by simply issuing SQL queries, without needing to write imperative code. We demonstrate the feasibility of this approach by an implementation of conjunctive BM25 using two modern column stores. Experiments on a web collection show that a retrieval engine built in this manner achieves effectiveness and efficiency on par with custom-built retrieval engines, but provides many additional advantages, including cleaner query semantics, a simpler architecture, built-in support for error analysis, and the ability to exploit advances in database …
Total citations
20152016201720182019202020212022202320242532452321
Scholar articles
H Mühleisen, T Samar, J Lin, A De Vries - Proceedings of the 37th international ACM SIGIR …, 2014