Model Name | e5-Base |
Description | A good general model for similarity search or downstream enrichments |
Use For | General text blobs |
Limitations | This model only works for English texts. Long texts will be truncated to at most 512 tokens |
Graft Default | Yes |
Reference information
Source | e5-base - hugging face |
Trained on | Common Crawl's web crawl corpus |
Paper | Text Embeddings by Weakly-Supervised Contrastive Pre-training |
Embedding Dimension | 768 |