Inference
Build end-to-end faster with models hosted by Pinecone.
Request a modelllama-text-embed-v2
NVIDIA| Task | Embedding |
| Modality | Text |
| Max Input Tokens | 2048 |
| Price | $0.16 / million tokens |
multilingual-e5-large
MICROSOFT| Task | Embedding |
| Modality | Text |
| Max Input Tokens | 507 |
| Price | $0.08 / million tokens |
cohere-rerank-3.5
COHERE| Task | Rerank |
| Modality | Text |
| Max Input Tokens | 4096 |
| Price | $2.00 / 1k requests |

pinecone-sparse-english-v0
PINECONE| Task | Embedding |
| Modality | Text |
| Max Input Tokens | 512 |
| Price | $0.08 / million tokens |

bge-reranker-v2-m3
BAAI| Task | Rerank |
| Modality | Text |
| Max Input Tokens | 1024 |
| Price | $2.00 / 1k requests |