https://store-images.s-microsoft.com/image/apps.10812.0d3370c0-f762-4140-b81c-a488c2153b03.7b0e01b1-4db2-4345-baa8-b5fde216d878.98735569-d798-4c09-a829-6366eb60067f

Jina Embeddings v2 Base - es

Jina AI

Jina Embeddings v2 Base - es

Jina AI

Text embedding model (base) for English and Spanish input of size up to 8192 tokens.

  • jina-embeddings-v2-base-es is an open-source bilingual Spanish-English embedding model supporting 8192 sequence length.
  • This state-of-the-art AI embedding model enables many applications, such as document clustering, classification, content personalization, vector search, or retrieval augmented generation.

Highlights:
  • State-of-the-art: This model is designed for high performance in mono-lingual & cross-lingual applications and has been trained specifically to support mixed Spanish-English input without bias.

  • Extended Context: An 8192-token length enables jina-embeddings-v2-base-es to support longer texts and document fragments, far surpassing models that only support a few hundred tokens at a time.

  • Compact Size: jina-embeddings-v2-base-es is built for high performance on standard computer hardware. With only 161 million parameters, the entire model is only 322MB. The embeddings themselves are 768 dimensions, a relatively small vector size compared to many models, saving space and run-time for applications.