Publications
2024
2024
-
CIKMPairing Clustered Inverted Indexes with k-NN Graphs for Fast Approximate Retrieval over Learned Sparse RepresentationsIn CIKM 2024: The 33rd International Conference on Information and Knowledge Management, 2024\\Conference rating (GII-GRIN-SCIE): \bf A+
-
SIGIREfficient Inverted Indexes for Approximate Retrieval over Learned Sparse RepresentationsIn SIGIR 2024: The 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024\\Conference rating (GII-GRIN-SCIE): \bf A++ (\bf Best Paper Runner-up Award)
-
DCCFaster Wavelet Trees with Quad VectorsIn DCC 2024: Proceedings of the 34th Data Compression Conference, 2024\\Conference rating (GII-GRIN-SCIE): \bf B
-
ECIREfficient Multi-vector Dense Retrieval with Bit VectorsIn ECIR 2024: Proceedings of the 46th European Conference on Information Retrieval, 2024\\Conference rating (GII-GRIN-SCIE): \bf A-
-
ICDEDistilled Neural Networks for Efficient Learning to RankIn ICDE 2024: Proceedings of the 40th IEEE International Conference on Data Engineering, 2024\\Conference rating (GII-GRIN-SCIE): \bf A++
2023
2023
-
DCLearning Bivariate Scoring Functions for RankingSpringer Discover Computing, 2023
-
TKDEAn Optimal Algorithm for Finding Champions in Tournament GraphsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
-
TKDEDistilled Neural Networks for Efficient Learning to RankIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
2022
2022
-
TOISFast Filtering of Search Results Sorted by AttributeACM Transactions on Information Systems (TOIS), 2022
2021
2021
-
TCS
-
TKDECompressed Indexes for Fast Search of Semantic DataIEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
-
SPEPractical trade-offs for the prefix-sum problemJournal of Software: Practice and Experience (SPE), 2021
-
CSUR
2021
-
SPIRETSXor: A Simple Time Series Compression AlgorithmIn SPIRE 2021: Proceedings of the 28th International Symposium on String Processing and Information Retrieval, 2021\\Conference rating (GII-GRIN-SCIE): \bf B
-
CPMCompressed Weighted de Bruijn GraphsIn CPM 2021: Proceedings of the 32nd Symposium on Combinatorial Pattern Matching, 2021\\Conference rating (GII-GRIN-SCIE): \bf B
-
ICDECompressed Indexes for Fast Search of Semantic DataIn ICDE 2021: Proceedings of the 37th IEEE International Conference on Data Engineering, 2021\\Conference rating (GII-GRIN-SCIE): \bf A++
2020
2020
-
TKDEOn Optimally Partitioning Variable-Byte CodesIEEE Transactions on Knowledge and Data Engineering (TKDE), 2020
2020
-
SIGIREfficient and Effective Query Auto-CompletionIn SIGIR 2020: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2020\\Conference rating (GII-GRIN-SCIE): \bf A++
2019
2019
-
SICOMP
-
ITPDSParallel Traversal of Large Ensembles of Decision TreesIEEE Transactions on Parallel and Distributed Systems (ITPDS), 2019
-
TOISHandling Massive N-Gram Datasets EfficientlyACM Transactions on Information Systems (TOIS), 2019
2019
-
SPIREA new Linear-time Algorithm for Centroid DecompositionIn SPIRE 2019: Proceedings of the 26th International Symposium on String Processing and Information Retrieval, 2019\\Conference rating (GII-GRIN-SCIE): \bf B
-
SPIREAn Optimal Algorithm to Find Champions of Tournament GraphsIn SPIRE 2019: Proceedings of the 26th International Symposium on String Processing and Information Retrieval, 2019\\Conference rating (GII-GRIN-SCIE): \bf B
-
SIGIRFast Approximate Filtering of Search Results Sorted by AttributeIn SIGIR 2019: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019\\Conference rating (GII-GRIN-SCIE): \bf A++
2019
-
ChapterInverted Index CompressionIn Encyclopedia of Big Data Technologies., 2019
2018
2018
-
CIKMEfficient and Effective Query Expansion for Web SearchIn CIKM 2018: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018\\Conference rating (GII-GRIN-SCIE): \bf A+
2018
-
ChapterText CompressionIn Encyclopedia of Database Systems, Second Edition, 2018
-
ChapterIndexing Compressed TextIn Encyclopedia of Database Systems, Second Edition, 2018
2017
2017
-
TOIS
2017
-
ESAAn Encoding for Order-Preserving MatchingIn ESA 2017: Proceedings of 25th Annual European Symposium on Algorithms, 2017\\Conference rating (GII-GRIN-SCIE): \bf A-
-
SIGIREfficient Data Structures for Massive N-Gram DatasetsIn SIGIR 2017: Proceedings of the 40th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017\\Conference rating (GII-GRIN-SCIE): \bf A++
-
PKDDQuickScorer: Efficient Traversal of Large Ensembles of Decision TreesIn ECML PKDD 2017: Proceedings of the European Conference Machine Learning and Knowledge Discovery, 2017\\Conference rating (GII-GRIN-SCIE): \bf A
-
SIGIRFaster BlockMax WAND with Variable-sized BlocksIn SIGIR 2017: Proceedings of the 40th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017\\Conference rating (GII-GRIN-SCIE): \bf A++
-
CPMDynamic Elias-Fano RepresentationIn CPM 2017: Proceedings of the 28th Symposium on Combinatorial Pattern Matching, 2017\\Conference rating (GII-GRIN-SCIE): \bf B
2016
2016
-
TOISFast Ranking with Additive Ensembles of Oblivious and Non-Oblivious Regression TreesACM Transactions on Information Systems (TOIS) , 2016
-
TALG
-
Algorithmica
-
Algorithmica
2016
-
SIGIRFast and compact Hamming distance indexIn SIGIR 2016: Proceedings of the 39th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2016\\Conference rating (GII-GRIN-SCIE): \bf A++
-
SIGIRExploiting CPU SIMD Extensions to Speed-up Document Scoring with Tree EnsemblesIn SIGIR 2016: Proceedings of the 39th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2016\\Conference rating (GII-GRIN-SCIE): \bf A++
-
SIGIRSuccinct Data Structures in Information Retrieval: Theory and PracticeIn SIGIR 2016: Proceedings of the 39th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2016\\Conference rating (GII-GRIN-SCIE): \bf A++
2015
2015
-
SIGIRQuickScorer: a Fast Algorithm to Rank Documents with Additive Ensembles of Regression TreesIn SIGIR 2015: Proceedings of the 38th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015\\Conference rating (GII-GRIN-SCIE): \bf A++ (\bf Best Paper Award)
-
WWWCompressed indexes for string-searching in labeled graphsIn WWW 2015: Proceedings of the 24th International Conference on World Wide Web, 2015\\Conference rating (GII-GRIN-SCIE): \bf A++
-
WSDMOptimal Space-time Tradeoffs for Inverted IndexesIn WSDM 2015: Proceedings of the 8th Annual International ACM Conference on Web Search and Data Mining, 2015\\Conference rating (GII-GRIN-SCIE): \bf A+
2014
2014
-
ESABicriteria data compression: efficient and usableIn ESA 2014: Proceedings of 22th Annual European Symposium on Algorithms, 2014\\Conference rating (GII-GRIN-SCIE): \bf A-
-
SIGIRPartitioned Elias-Fano IndexesIn SIGIR 2014: Proceedings of the 37th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014\\Conference rating (GII-GRIN-SCIE): \bf A++ (\bf Best Paper Award)
-
DCCCache-Oblivious Peeling of Random HypergraphsIn DCC 2014: Proceedings of the 24th IEEE Data Compression Conference, 2014\\Conference rating (GII-GRIN-SCIE): \bf B
-
SODABicriteria data compressionIn SODA 2014: Proceedings of the 25th Annual ACM-SIAM Symposium on Discrete Algorithms, 2014\\Conference rating (GII-GRIN-SCIE): \bf A+
2014
-
BookCompressed Data Structures for Strings2014
-
ChapterRecommender SystemsIn Mining User Generated Content, 2014
2013
2013
-
SICOMP
-
Algorithmica
2013
-
ESACompressed Cache-Oblivious String B-treeIn ESA 2013: Proceedings of 21th Annual European Symposium on Algorithms, 2013\\Conference rating (GII-GRIN-SCIE): \bf A-
-
ICALPDynamic Compressed Strings with Random AccessIn ICALP 2013: Proceedings of the 40th International Colloquium on Automata, Languages and Programming, 2013\\Conference rating (GII-GRIN-SCIE): \bf A
-
SODACompressed Static Functions with ApplicationsIn SODA 2013: Proceedings of the 24th Annual ACM-SIAM Symposium on Discrete Algorithms, 2013\\Conference rating (GII-GRIN-SCIE):\bf A+
2013
-
ChapterWeb SearchIn The Power of Algorithms, 2013
2012
2012
-
CIKMMaking your interests follow you on twitterIn CIKM 2012: Proceedings of 21th ACM International Conference on Information and Knowledge Management, 2012\\Conference rating (GII-GRIN-SCIE): \bf A+
-
SIGIREfficient query recommendations in the long tail via center-piece subgraphsIn SIGIR 2012: Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2012\\Conference rating (GII-GRIN-SCIE): \bf A++
-
CPMCompressed String Dictionary Look-up with Edit Distance OneIn CPM 2012: Proceedings of 23rd Annual Symposium on Combinatorial Pattern Matching, 2012\\Conference rating (GII-GRIN-SCIE): \bf B
-
ECIRHow Random Walks Can Help TourismIn ECIR 2012: Proceedings of 34th European Conference on IR Research, 2012\\Conference rating (GII-GRIN-SCIE): \bf A-
2011
2011
-
Algorithmica
2011
-
WWWRecommendations for the long tail by term-query graphIn WWW 2011 (Companion Volume): Proceedings of the 20th International Conference on World Wide Web, 2011\\Conference rating (GII-GRIN-SCIE): \bf A++
-
PODSSpace-efficient substring occurrence estimationIn PODS 2011: Proceedings of the 30th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, 2011\\Conference rating (GII-GRIN-SCIE): \bf A+
-
ESADistribution-Aware Compressed Full-Text IndexesIn ESA 2011: Proceedings of 19th Annual European Symposium on Algorithms, 2011\\Conference rating (GII-GRIN-SCIE): \bf A-
2010
2010
-
TALG
-
TCSOn compact representations of All-Pairs-Shortest-Path-Distance matricesTheoretical Computer Science (TCS), 2010
2010
-
CIKMVSEncoding: Efficient Coding and Fast Decoding of Integer Lists via Dynamic ProgrammingIn CIKM 2010: Proceedings of 19th ACM International Conference on Information and Knowledge Management, 2010\\Conference rating (GII-GRIN-SCIE): \bf A+
2010
-
PHD Thesis
2009
2009
-
ESAOn Optimally Partitioning a Text to Improve Its CompressionIn ESA 2009: Proceedings of 17th Annual European Symposium on Algorithms, 2009\\Conference rating (GII-GRIN-SCIE): \bf A-
-
SODAOn the bit-complexity of Lempel-Ziv compressionIn SODA 2009: Proceedings of the 20th Annual ACM-SIAM Symposium on Discrete Algorithms, 2009\\Conference rating (GII-GRIN-SCIE): \bf A+
2009
-
ChapterIndexing Compressed TextIn Encyclopedia of Database Systems, 2009
2008
2008
-
JEACompressed text indexes: From theory to practiceACM Journal of Experimental Algorithmics (JEA), 2008
2008
-
CPMOn Compact Representations of All-Pairs-Shortest-Path-Distance MatricesIn CPM 2008: Proceedings of the 19th Annual Symposium on Combinatorial Pattern Matching, 2008\\Conference rating (GII-GRIN-SCIE): \bf B
2007
2007
-
TCSA simple storage scheme for strings achieving entropy boundsTheoretical Computer Science (TCS), 2007
2007
-
SIGIRCompressed permuterm indexIn SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007\\Conference rating (GII-GRIN-SCIE): \bf A++
-
SODAA simple storage scheme for strings achieving entropy boundsIn SODA 2007: Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms, 2007\\Conference rating (GII-GRIN-SCIE): \bf A+
Tutorials
- Succinct Data Structures in Information Retrieval: Theory and Practice. Full-day tutorial at ACM Sigir 2016 with Simon Gog