ML Research

Exploring depth in a 'retrieve-and-rerank' pipeline
ML Research

Exploring depth in a 'retrieve-and-rerank' pipeline

Select an optimal re-ranking depth for your model and dataset.

Thanos Papaoikonomou

Thomas Veasey

Quentin Herreros

All Articles
What is semantic reranking and how to use it?
ML Research

What is semantic reranking and how to use it?

Learn about the trade-offs using semantic reranking in search and RAG pipelines.

Thomas Veasey

Quentin Herreros

Thanos Papaoikonomou

Better Binary Quantization 101
LuceneML Research

Better Binary Quantization 101

Understand what binary quantization is, how it works and its benefits. This guide also covers the math behind the quantization and examples.

John Wagster

Introducing Elastic Rerank: Elastic's new semantic re-ranker model
ML Research

Introducing Elastic Rerank: Elastic's new semantic re-ranker model

Learn about how Elastic's new re-ranker model was trained and how it performs

Thomas Veasey

Quentin Herreros

Thanos Papaoikonomou

Evaluating search relevance part 2 - Phi-3 as relevance judge
ML Research

Evaluating search relevance part 2 - Phi-3 as relevance judge

Using the Phi-3 language model as a relevance judge, with tips & techniques to improve the agreement with human-generated annotation

Thanos Papaoikonomou

Thomas Veasey

Evaluating search relevance part 1 - The BEIR benchmark
ML Research

Evaluating search relevance part 1 - The BEIR benchmark

Learn to evaluate your search system in the context of better understanding the BEIR benchmark, with tips & techniques to improve your search evaluation processes.

Thanos Papaoikonomou

Thomas Veasey

Evaluating scalar quantization in Elasticsearch
ML Research

Evaluating scalar quantization in Elasticsearch

Learn how scalar quantization can be used to reduce the memory footprint of vector embeddings in Elasticsearch through an experiment.

Thanos Papaoikonomou

Thomas Veasey

Scalar quantization optimized for vector databases
ML Research

Scalar quantization optimized for vector databases

Optimizing scalar quantization for the vector database use case allows us to achieve significantly better performance for the same retrieval quality at high compression ratios.

Thomas Veasey

Benjamin Trent

Understanding Int4 scalar quantization in Lucene
LuceneML Research

Understanding Int4 scalar quantization in Lucene

This blog explains how int4 quantization works in Lucene, how it lines up, and the benefits of using int4 quantization.

Benjamin Trent

Thomas Veasey

RAG evaluation metrics: A journey through metrics
ML Research

RAG evaluation metrics: A journey through metrics

Explore RAG evaluation metrics like BLEU score, ROUGE score, PPL, BARTScore, and more. Discover how Elastic is evaluating RAG with UniEval.

Quentin Herreros

Thomas Veasey

Thanos Papaoikonomou