Qdrant
Qdrant/all_miniLM_L6_v2_with_attentions
ONNX port of sentence-transformers/all-MiniLM-L6-v2 adjusted to return attention weights....
Model Documentation
ONNX port of sentence-transformers/all-MiniLM-L6-v2 adjusted to return attention weights.
This model is intended to be used for BM42 searches.
Here's an example of performing inference using the model with FastEmbed.
> Note: This model is supposed to be used with Qdrant. Vectors have to be configured with Modifier.IDF.
This model is intended to be used for BM42 searches.
Usage
Here's an example of performing inference using the model with FastEmbed.
> Note: This model is supposed to be used with Qdrant. Vectors have to be configured with Modifier.IDF.
py
from fastembed import SparseTextEmbedding
documents = [
"You should stay, study and sprint.",
"History can only prepare us to be surprised yet again.",
]
model = SparseTextEmbedding(model_name="Qdrant/bm42-all-minilm-l6-v2-attentions")
embeddings = list(model.embed(documents))
[
SparseEmbedding(values=array([0.26399775, 0.24662513, 0.47077307]),
indices=array([1881538586, 150760872, 1932363795])),
SparseEmbedding(values=array(
[0.38320042, 0.25453135, 0.18017513, 0.30432631, 0.1373556]),
indices=array([
733618285, 1849833631, 1008800696, 2090661150,
1117393019
]))
]
Files & Weights
| Filename | Size | Action |
|---|---|---|
| model.onnx | 0.08 GB |