Skip to content

"recall_at_2": 0.162 for "sentence-transformers/all-MiniLM-L6-v2" and "LIMITSmallRetrieval" #14

@michaelwechner

Description

@michaelwechner

Hi

I have been running MTEB for "sentence-transformers/all-MiniLM-L6-v2" and "LIMITSmallRetrieval" and receive

"recall_at_2": 0.162

I have been running my own code using Apache Lucene Vector Search Implementation for "sentence-transformers/all-MiniLM-L6-v2" and "LIMITSmallRetrieval" and receive

"recall_at_2": 0.129

or when I run it a second time, then I receive

"recall_at_2": 0.134

Apache Lucene is using HNSW graph implementation https://lucene.apache.org/core/9_1_0/core/org/apache/lucene/util/hnsw/HnswGraph.html which quite probably has an effect on the search results, and therefore recall@2 is quite probably different than MTEB, but it would be good to compare this with the results of others.

I could not find anything on https://huggingface.co/spaces/mteb/leaderboard or otherwise.

Where could I find other results to compare my own implementation(s)?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions