https://store-images.s-microsoft.com/image/apps.19458.3ce9930d-ea73-4abf-901d-b286762ea624.5493af6f-6762-4960-ae18-f51252c29a2e.5f038d15-1c33-42c6-8f43-cd485e220cfd

rerank-2.5 Reranker
MongoDB, Inc.

Categorías

IA y Machine Learning Proceso

Soporte técnico

Legal

Contrato de licencia Directiva de privacidad

rerank-2.5 Reranker

MongoDB, Inc.

Información general Planes Ratings + reviews

Reranker model for refining retrieval/search accuracy with instruction-following. 32K context length

[This offering is not optimized for latency. A latency-optimized version is coming soon.]

High-accuracy instruction-following reranker that refines search results and improves retrieval quality across domains. Supports 32K context length. Throughput varies significantly by workload pattern based on factors like GPU type, model size, sequence length, batch size, and vector dimensionality. Typically we see ~75k~150k tokens/sec for this model on A100 GPUs. We recommend customers benchmark their own throughput and token volume during testing to inform token TCO estimates.

Rerank 2.5:

Outperforms Cohere Rerank v3.5 by 7.94% on 93 benchmark datasets across multiple domains
Introduces instruction-following capability to steer reranking using natural language
Improves accuracy by 12.70% on the Massive Instructed Retrieval Benchmark (MAIR)
Delivers double the context length of rerank-2 (32K vs. 16K) at the same cost
Optimized to enhance results from first-stage retrieval methods like BM25, OpenAI v3-large, voyage-3, and voyage-3.5
Provides a seamless upgrade path from rerank-2 with better quality, broader domain coverage, and no pricing changes

Más información

Voyage AI Documentation Voyage AI rerank-2.5

rerank-2.5 RerankerMongoDB, Inc.

rerank-2.5 Reranker

MongoDB, Inc.

rerank-2.5 Reranker

MongoDB, Inc.

Reranker model for refining retrieval/search accuracy with instruction-following. 32K context length

Más información

rerank-2.5 Reranker
MongoDB, Inc.