mixedbread-ai/mxbai-rerank-base-v2 (Score)

The crispy rerank family from Mixedbread.

Architecture

Qwen2

Parameters

494M

Tasks

Score

Outputs

Score

Max Sequence Length

8,192 tokens

License

apache-2.0

Languages

af, am, ar, as, az, be, bg, bn, br, bs, ca, cs, cy, da, de, el, en, eo, es, et, eu, fa, ff, fi, fr, fy, ga, gd, gl, gn, gu, ha, he, hi, hr, ht, hu, hy, id, ig, is, it, ja, jv, ka, kk, km, kn, ko, ku, ky, la, lg, li, ln, lo, lt, lv, mg, mk, ml, mn, mr, ms, my, ne, nl, no, ns, om, or, pa, pl, ps, pt, qu, rm, ro, ru, sa, sc, sd, si, sk, sl, so, sq, sr, ss, su, sv, sw, ta, te, th, tl, tn, tr, ug, uk, ur, uz, vi, wo, xh, yi, yo, zh, zu

View on HuggingFace →

Benchmarks

CQADupstackPhysicsRetrieval

scientific retrieval en

Duplicate question retrieval from StackExchange Physics

Corpus: 38,314 Queries: 1,039

Performance L4 b1 c16

Query 4.1K tok/s

Query p50 593.2ms

Reference →

CosQA

technology retrieval en

Code search with natural language queries

Corpus: 6,267 Queries: 500

Performance L4 b1 c16

Query 2.1K tok/s

Query p50 444.7ms

Reference →

LegalBenchConsumerContractsQA

legal retrieval en

Question answering on consumer contracts

Corpus: 153 Queries: 396

Performance L4 b1 c16

Query 14.6K tok/s

Query p50 450.9ms

Reference →

SCIDOCS

scientific retrieval en

Citation prediction, document classification, and recommendation for scientific papers

Corpus: 25,656 Queries: 1,000

Performance L4 b1 c16

Query 7.0K tok/s

Query p50 457.1ms

Reference →

StackOverflowQA

technology retrieval en

Programming question answering from Stack Overflow

Corpus: 19,931 Queries: 1,994

Performance L4 b1 c16

Query 11.4K tok/s

Query p50 534.8ms

Reference →

Benchmarks

CQADupstackPhysicsRetrieval

CosQA

LegalBenchConsumerContractsQA

SCIDOCS

StackOverflowQA

Self-hosted inference for search & document processing