๐Ÿ” Search Relevance Lab

Eval Runs

Each row is one evaluation run โ€” a search backend (lexical, vector, or a hybrid fusion config) scored against the same 323 benchmark queries. The columns are quality scores (higher = better) and response time (lower = faster). Pick two runs below to compare them query-by-query.

What do these scores mean?

Each backend is scored against NFCorpus โ€” a benchmark of 323 medical search queries, where humans have judged which documents are actually relevant to each query. Every quality score runs 0โ€“1 and higher is better; for latency, lower is better.

nDCG@10
Overall quality of the top-10 ranking โ€” it rewards putting more-relevant documents nearer the top. This is the headline metric.
precision@10 (P@k)
Of the 10 results shown, the fraction that are relevant.
recall@10
Of all the relevant documents that exist for a query, the fraction that made it into the top 10.
MRR
How high the first relevant result lands, on average (1.0 = always at rank 1).
p50 / p95 ms
Response time: the median (p50) and the slow-tail 95th percentile (p95), in milliseconds.
idbackendkmodelfusionnDCGP@krecallMRRp50 msp95 msn
20hybrid10BAAI/bge-small-en-v1.5rrf k=100.33840.25290.17110.52811219.71688.6323
19hybrid10BAAI/bge-small-en-v1.5weighted ฮฑ=0.30.35570.26130.17190.55011210.81704.9323
18hybrid10BAAI/bge-small-en-v1.5weighted ฮฑ=0.50.34120.25730.17490.51961189.51743.3323
17hybrid10BAAI/bge-small-en-v1.5rrf k=600.33890.25420.17060.52931170.71650.4323
16vector10BAAI/bge-small-en-v1.5โ€”0.34280.25540.16180.52721067.21599.3323
15lexical10โ€”โ€”0.22350.15170.09670.4112124.3232.0323
14hybrid10BAAI/bge-small-en-v1.5weighted ฮฑ=0.50.34120.25730.17490.51961140.41600.1323
13hybrid10BAAI/bge-small-en-v1.5rrf k=600.33890.25420.17060.52931147.41748.1323
12vector10BAAI/bge-small-en-v1.5โ€”0.34280.25540.16180.52721195.61633.9323
11lexical10โ€”โ€”0.22350.15170.09670.4112115.6301.0323
10hybrid10BAAI/bge-small-en-v1.5weighted ฮฑ=0.50.34120.25730.17490.51961257.91683.0323
9hybrid10BAAI/bge-small-en-v1.5rrf k=600.33890.25420.17060.52931309.91733.1323
8vector10BAAI/bge-small-en-v1.5โ€”0.34280.25540.16180.52721069.21608.0323
7lexical10โ€”โ€”0.22350.15170.09670.4112136.2243.2323
6vector10BAAI/bge-small-en-v1.5โ€”0.34330.25570.16190.52881107.31657.0323
5lexical10โ€”โ€”0.22350.15170.09670.4112136.9254.2323
4vector10BAAI/bge-small-en-v1.5โ€”0.34330.25570.16190.52881142.71662.4323
3lexical10โ€”โ€”0.21490.13960.09330.410091.8156.9323
2vector10BAAI/bge-small-en-v1.5โ€”0.34330.25570.16190.5288875.71280.5323
1lexical10โ€”โ€”0.21500.13900.09330.4121112.3225.3323