ML Research

Evaluating search relevance - Part 1
ML Research

Evaluating search relevance - Part 1

How to evaluate your own search systems in the context of better understanding the BEIR benchmark, with specific tips and techniques to improve your search evaluation processes. Part 1 of the series.

Thanos Papaoikonomou

Thomas Veasey

All Articles
Evaluating scalar quantization in Elasticsearch
ML Research

Evaluating scalar quantization in Elasticsearch

Learn how scalar quantization can be used to reduce the memory footprint of vector embeddings in Elasticsearch through an experiment.

Thanos Papaoikonomou

Thomas Veasey

Scalar Quantization Optimized for Vector Databases
ML Research

Scalar Quantization Optimized for Vector Databases

Optimizing scalar quantization for the vector database use case allows us to achieve significantly better performance for the same retrieval quality at high compression ratios.

Thomas Veasey

Benjamin Trent

Understanding Int4 scalar quantization in Lucene
LuceneML Research

Understanding Int4 scalar quantization in Lucene

This blog explains how int4 quantization works in Lucene, how it lines up, and the benefits of using int4 quantization.

Benjamin Trent

Thomas Veasey

RAG evaluation metrics: A journey through metrics
ML Research

RAG evaluation metrics: A journey through metrics

Explore RAG evaluation metrics like BLEU score, ROUGE score, PPL, BARTScore, and more. Discover how Elastic is evaluating RAG with UniEval.

Quentin Herreros

Thomas Veasey

Thanos Papaoikonomou

Understanding scalar quantization in Lucene
LuceneML Research

Understanding scalar quantization in Lucene

Explore how Elastic introduced scalar quantization into Lucene, including automatic byte quantization, quantization per segment & performance insights.

Benjamin Trent

Scalar quantization 101
LuceneML Research

Scalar quantization 101

Understand what scalar quantization is, how it works and its benefits. This guide also covers the math behind quantization and examples.

Benjamin Trent

Improving information retrieval in the Elastic Stack: Improved inference performance with ELSER v2
ML Research

Improving information retrieval in the Elastic Stack: Improved inference performance with ELSER v2

Learn about the improvements we've made to the inference performance of ELSER v2.

Thomas Veasey

Quentin Herreros

Valeriy Khakhutskyy

Improving information retrieval in the Elastic Stack: Optimizing retrieval with ELSER v2
ML Research

Improving information retrieval in the Elastic Stack: Optimizing retrieval with ELSER v2

Learn about how we're reducing retrieval costs for ELSER v2.

Thomas Veasey

Quentin Herreros

Valeriy Khakhutskyy

Generative AI architectures with transformers explained from the ground up
ML ResearchGenerative AI

Generative AI architectures with transformers explained from the ground up

Here's how generative AI works from the ground up, including embeddings, transformer-encoder architecture, training/fine-tuning models & more.

Aris Papadopoulos