UAE-Large-V1/README.md

65 KiB

tags model-index license language
mteb
sentence_embedding
feature_extraction
sentence-transformers
transformers
transformers.js
name results
UAE-Large-V1
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_counterfactual MTEB AmazonCounterfactualClassification (en) en test e8379541af4e31359cca9fbcf4b00f2671dba205
type value
accuracy 75.55223880597015
type value
ap 38.264070815317794
type value
f1 69.40977934769845
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_polarity MTEB AmazonPolarityClassification default test e2d317d38cd51312af73b3d32a06d1a08b442046
type value
accuracy 92.84267499999999
type value
ap 89.57568507997713
type value
f1 92.82590734337774
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_reviews_multi MTEB AmazonReviewsClassification (en) en test 1399c76144fd37290681b995c656ef9b2e06e26d
type value
accuracy 48.292
type value
f1 47.90257816032778
task dataset metrics
type
Retrieval
type name config split revision
arguana MTEB ArguAna default test None
type value
map_at_1 42.105
type value
map_at_10 58.181000000000004
type value
map_at_100 58.653999999999996
type value
map_at_1000 58.657000000000004
type value
map_at_3 54.386
type value
map_at_5 56.757999999999996
type value
mrr_at_1 42.745
type value
mrr_at_10 58.437
type value
mrr_at_100 58.894999999999996
type value
mrr_at_1000 58.897999999999996
type value
mrr_at_3 54.635
type value
mrr_at_5 56.99999999999999
type value
ndcg_at_1 42.105
type value
ndcg_at_10 66.14999999999999
type value
ndcg_at_100 68.048
type value
ndcg_at_1000 68.11399999999999
type value
ndcg_at_3 58.477000000000004
type value
ndcg_at_5 62.768
type value
precision_at_1 42.105
type value
precision_at_10 9.110999999999999
type value
precision_at_100 0.991
type value
precision_at_1000 0.1
type value
precision_at_3 23.447000000000003
type value
precision_at_5 16.159000000000002
type value
recall_at_1 42.105
type value
recall_at_10 91.11
type value
recall_at_100 99.14699999999999
type value
recall_at_1000 99.644
type value
recall_at_3 70.341
type value
recall_at_5 80.797
task dataset metrics
type
Clustering
type name config split revision
mteb/arxiv-clustering-p2p MTEB ArxivClusteringP2P default test a122ad7f3f0291bf49cc6f4d32aa80929df69d5d
type value
v_measure 49.02580759154173
task dataset metrics
type
Clustering
type name config split revision
mteb/arxiv-clustering-s2s MTEB ArxivClusteringS2S default test f910caf1a6075f7329cdf8c1a6135696f37dbd53
type value
v_measure 43.093601280163554
task dataset metrics
type
Reranking
type name config split revision
mteb/askubuntudupquestions-reranking MTEB AskUbuntuDupQuestions default test 2000358ca161889fa9c082cb41daa8dcfb161a54
type value
map 64.19590406875427
type value
mrr 77.09547992788991
task dataset metrics
type
STS
type name config split revision
mteb/biosses-sts MTEB BIOSSES default test d3fb88f8f02e40887cd149695127462bbcf29b4a
type value
cos_sim_pearson 87.86678362843676
type value
cos_sim_spearman 86.1423242570783
type value
euclidean_pearson 85.98994198511751
type value
euclidean_spearman 86.48209103503942
type value
manhattan_pearson 85.6446436316182
type value
manhattan_spearman 86.21039809734357
task dataset metrics
type
Classification
type name config split revision
mteb/banking77 MTEB Banking77Classification default test 0fd18e25b25c072e09e0d92ab615fda904d66300
type value
accuracy 87.69155844155844
type value
f1 87.68109381943547
task dataset metrics
type
Clustering
type name config split revision
mteb/biorxiv-clustering-p2p MTEB BiorxivClusteringP2P default test 65b79d1d13f80053f67aca9498d9402c2d9f1f40
type value
v_measure 39.37501687500394
task dataset metrics
type
Clustering
type name config split revision
mteb/biorxiv-clustering-s2s MTEB BiorxivClusteringS2S default test 258694dd0231531bc1fd9de6ceb52a0853c6d908
type value
v_measure 37.23401405155885
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackAndroidRetrieval default test None
type value
map_at_1 30.232
type value
map_at_10 41.404999999999994
type value
map_at_100 42.896
type value
map_at_1000 43.028
type value
map_at_3 37.925
type value
map_at_5 39.865
type value
mrr_at_1 36.338
type value
mrr_at_10 46.969
type value
mrr_at_100 47.684
type value
mrr_at_1000 47.731
type value
mrr_at_3 44.063
type value
mrr_at_5 45.908
type value
ndcg_at_1 36.338
type value
ndcg_at_10 47.887
type value
ndcg_at_100 53.357
type value
ndcg_at_1000 55.376999999999995
type value
ndcg_at_3 42.588
type value
ndcg_at_5 45.132
type value
precision_at_1 36.338
type value
precision_at_10 9.17
type value
precision_at_100 1.4909999999999999
type value
precision_at_1000 0.196
type value
precision_at_3 20.315
type value
precision_at_5 14.793000000000001
type value
recall_at_1 30.232
type value
recall_at_10 60.67399999999999
type value
recall_at_100 83.628
type value
recall_at_1000 96.209
type value
recall_at_3 45.48
type value
recall_at_5 52.354
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackEnglishRetrieval default test None
type value
map_at_1 32.237
type value
map_at_10 42.829
type value
map_at_100 44.065
type value
map_at_1000 44.199
type value
map_at_3 39.885999999999996
type value
map_at_5 41.55
type value
mrr_at_1 40.064
type value
mrr_at_10 48.611
type value
mrr_at_100 49.245
type value
mrr_at_1000 49.29
type value
mrr_at_3 46.561
type value
mrr_at_5 47.771
type value
ndcg_at_1 40.064
type value
ndcg_at_10 48.388
type value
ndcg_at_100 52.666999999999994
type value
ndcg_at_1000 54.67100000000001
type value
ndcg_at_3 44.504
type value
ndcg_at_5 46.303
type value
precision_at_1 40.064
type value
precision_at_10 9.051
type value
precision_at_100 1.4500000000000002
type value
precision_at_1000 0.193
type value
precision_at_3 21.444
type value
precision_at_5 15.045
type value
recall_at_1 32.237
type value
recall_at_10 57.943999999999996
type value
recall_at_100 75.98700000000001
type value
recall_at_1000 88.453
type value
recall_at_3 46.268
type value
recall_at_5 51.459999999999994
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackGamingRetrieval default test None
type value
map_at_1 38.797
type value
map_at_10 51.263000000000005
type value
map_at_100 52.333
type value
map_at_1000 52.393
type value
map_at_3 47.936
type value
map_at_5 49.844
type value
mrr_at_1 44.389
type value
mrr_at_10 54.601
type value
mrr_at_100 55.300000000000004
type value
mrr_at_1000 55.333
type value
mrr_at_3 52.068999999999996
type value
mrr_at_5 53.627
type value
ndcg_at_1 44.389
type value
ndcg_at_10 57.193000000000005
type value
ndcg_at_100 61.307
type value
ndcg_at_1000 62.529
type value
ndcg_at_3 51.607
type value
ndcg_at_5 54.409
type value
precision_at_1 44.389
type value
precision_at_10 9.26
type value
precision_at_100 1.222
type value
precision_at_1000 0.13699999999999998
type value
precision_at_3 23.03
type value
precision_at_5 15.887
type value
recall_at_1 38.797
type value
recall_at_10 71.449
type value
recall_at_100 88.881
type value
recall_at_1000 97.52
type value
recall_at_3 56.503
type value
recall_at_5 63.392
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackGisRetrieval default test None
type value
map_at_1 27.291999999999998
type value
map_at_10 35.65
type value
map_at_100 36.689
type value
map_at_1000 36.753
type value
map_at_3 32.995000000000005
type value
map_at_5 34.409
type value
mrr_at_1 29.04
type value
mrr_at_10 37.486000000000004
type value
mrr_at_100 38.394
type value
mrr_at_1000 38.445
type value
mrr_at_3 35.028
type value
mrr_at_5 36.305
type value
ndcg_at_1 29.04
type value
ndcg_at_10 40.613
type value
ndcg_at_100 45.733000000000004
type value
ndcg_at_1000 47.447
type value
ndcg_at_3 35.339999999999996
type value
ndcg_at_5 37.706
type value
precision_at_1 29.04
type value
precision_at_10 6.192
type value
precision_at_100 0.9249999999999999
type value
precision_at_1000 0.11
type value
precision_at_3 14.802000000000001
type value
precision_at_5 10.305
type value
recall_at_1 27.291999999999998
type value
recall_at_10 54.25299999999999
type value
recall_at_100 77.773
type value
recall_at_1000 90.795
type value
recall_at_3 39.731
type value
recall_at_5 45.403999999999996
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackMathematicaRetrieval default test None
type value
map_at_1 18.326
type value
map_at_10 26.290999999999997
type value
map_at_100 27.456999999999997
type value
map_at_1000 27.583000000000002
type value
map_at_3 23.578
type value
map_at_5 25.113000000000003
type value
mrr_at_1 22.637
type value
mrr_at_10 31.139
type value
mrr_at_100 32.074999999999996
type value
mrr_at_1000 32.147
type value
mrr_at_3 28.483000000000004
type value
mrr_at_5 29.963
type value
ndcg_at_1 22.637
type value
ndcg_at_10 31.717000000000002
type value
ndcg_at_100 37.201
type value
ndcg_at_1000 40.088
type value
ndcg_at_3 26.686
type value
ndcg_at_5 29.076999999999998
type value
precision_at_1 22.637
type value
precision_at_10 5.7090000000000005
type value
precision_at_100 0.979
type value
precision_at_1000 0.13799999999999998
type value
precision_at_3 12.894
type value
precision_at_5 9.328
type value
recall_at_1 18.326
type value
recall_at_10 43.824999999999996
type value
recall_at_100 67.316
type value
recall_at_1000 87.481
type value
recall_at_3 29.866999999999997
type value
recall_at_5 35.961999999999996
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackPhysicsRetrieval default test None
type value
map_at_1 29.875
type value
map_at_10 40.458
type value
map_at_100 41.772
type value
map_at_1000 41.882999999999996
type value
map_at_3 37.086999999999996
type value
map_at_5 39.153
type value
mrr_at_1 36.381
type value
mrr_at_10 46.190999999999995
type value
mrr_at_100 46.983999999999995
type value
mrr_at_1000 47.032000000000004
type value
mrr_at_3 43.486999999999995
type value
mrr_at_5 45.249
type value
ndcg_at_1 36.381
type value
ndcg_at_10 46.602
type value
ndcg_at_100 51.885999999999996
type value
ndcg_at_1000 53.895
type value
ndcg_at_3 41.155
type value
ndcg_at_5 44.182
type value
precision_at_1 36.381
type value
precision_at_10 8.402
type value
precision_at_100 1.278
type value
precision_at_1000 0.16199999999999998
type value
precision_at_3 19.346
type value
precision_at_5 14.09
type value
recall_at_1 29.875
type value
recall_at_10 59.065999999999995
type value
recall_at_100 80.923
type value
recall_at_1000 93.927
type value
recall_at_3 44.462
type value
recall_at_5 51.89
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackProgrammersRetrieval default test None
type value
map_at_1 24.94
type value
map_at_10 35.125
type value
map_at_100 36.476
type value
map_at_1000 36.579
type value
map_at_3 31.840000000000003
type value
map_at_5 33.647
type value
mrr_at_1 30.936000000000003
type value
mrr_at_10 40.637
type value
mrr_at_100 41.471000000000004
type value
mrr_at_1000 41.525
type value
mrr_at_3 38.013999999999996
type value
mrr_at_5 39.469
type value
ndcg_at_1 30.936000000000003
type value
ndcg_at_10 41.295
type value
ndcg_at_100 46.92
type value
ndcg_at_1000 49.183
type value
ndcg_at_3 35.811
type value
ndcg_at_5 38.306000000000004
type value
precision_at_1 30.936000000000003
type value
precision_at_10 7.728
type value
precision_at_100 1.226
type value
precision_at_1000 0.158
type value
precision_at_3 17.237
type value
precision_at_5 12.42
type value
recall_at_1 24.94
type value
recall_at_10 54.235
type value
recall_at_100 78.314
type value
recall_at_1000 93.973
type value
recall_at_3 38.925
type value
recall_at_5 45.505
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackRetrieval default test None
type value
map_at_1 26.250833333333333
type value
map_at_10 35.46875
type value
map_at_100 36.667
type value
map_at_1000 36.78025
type value
map_at_3 32.56733333333334
type value
map_at_5 34.20333333333333
type value
mrr_at_1 30.8945
type value
mrr_at_10 39.636833333333335
type value
mrr_at_100 40.46508333333333
type value
mrr_at_1000 40.521249999999995
type value
mrr_at_3 37.140166666666666
type value
mrr_at_5 38.60999999999999
type value
ndcg_at_1 30.8945
type value
ndcg_at_10 40.93441666666667
type value
ndcg_at_100 46.062416666666664
type value
ndcg_at_1000 48.28341666666667
type value
ndcg_at_3 35.97575
type value
ndcg_at_5 38.3785
type value
precision_at_1 30.8945
type value
precision_at_10 7.180250000000001
type value
precision_at_100 1.1468333333333334
type value
precision_at_1000 0.15283333333333332
type value
precision_at_3 16.525583333333334
type value
precision_at_5 11.798333333333332
type value
recall_at_1 26.250833333333333
type value
recall_at_10 52.96108333333333
type value
recall_at_100 75.45908333333334
type value
recall_at_1000 90.73924999999998
type value
recall_at_3 39.25483333333333
type value
recall_at_5 45.37950000000001
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackStatsRetrieval default test None
type value
map_at_1 24.595
type value
map_at_10 31.747999999999998
type value
map_at_100 32.62
type value
map_at_1000 32.713
type value
map_at_3 29.48
type value
map_at_5 30.635
type value
mrr_at_1 27.607
type value
mrr_at_10 34.449000000000005
type value
mrr_at_100 35.182
type value
mrr_at_1000 35.254000000000005
type value
mrr_at_3 32.413
type value
mrr_at_5 33.372
type value
ndcg_at_1 27.607
type value
ndcg_at_10 36.041000000000004
type value
ndcg_at_100 40.514
type value
ndcg_at_1000 42.851
type value
ndcg_at_3 31.689
type value
ndcg_at_5 33.479
type value
precision_at_1 27.607
type value
precision_at_10 5.66
type value
precision_at_100 0.868
type value
precision_at_1000 0.11299999999999999
type value
precision_at_3 13.446
type value
precision_at_5 9.264
type value
recall_at_1 24.595
type value
recall_at_10 46.79
type value
recall_at_100 67.413
type value
recall_at_1000 84.753
type value
recall_at_3 34.644999999999996
type value
recall_at_5 39.09
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackTexRetrieval default test None
type value
map_at_1 17.333000000000002
type value
map_at_10 24.427
type value
map_at_100 25.576
type value
map_at_1000 25.692999999999998
type value
map_at_3 22.002
type value
map_at_5 23.249
type value
mrr_at_1 20.716
type value
mrr_at_10 28.072000000000003
type value
mrr_at_100 29.067
type value
mrr_at_1000 29.137
type value
mrr_at_3 25.832
type value
mrr_at_5 27.045
type value
ndcg_at_1 20.716
type value
ndcg_at_10 29.109
type value
ndcg_at_100 34.797
type value
ndcg_at_1000 37.503
type value
ndcg_at_3 24.668
type value
ndcg_at_5 26.552999999999997
type value
precision_at_1 20.716
type value
precision_at_10 5.351
type value
precision_at_100 0.955
type value
precision_at_1000 0.136
type value
precision_at_3 11.584999999999999
type value
precision_at_5 8.362
type value
recall_at_1 17.333000000000002
type value
recall_at_10 39.604
type value
recall_at_100 65.525
type value
recall_at_1000 84.651
type value
recall_at_3 27.199
type value
recall_at_5 32.019
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackUnixRetrieval default test None
type value
map_at_1 26.342
type value
map_at_10 35.349000000000004
type value
map_at_100 36.443
type value
map_at_1000 36.548
type value
map_at_3 32.307
type value
map_at_5 34.164
type value
mrr_at_1 31.063000000000002
type value
mrr_at_10 39.703
type value
mrr_at_100 40.555
type value
mrr_at_1000 40.614
type value
mrr_at_3 37.141999999999996
type value
mrr_at_5 38.812000000000005
type value
ndcg_at_1 31.063000000000002
type value
ndcg_at_10 40.873
type value
ndcg_at_100 45.896
type value
ndcg_at_1000 48.205999999999996
type value
ndcg_at_3 35.522
type value
ndcg_at_5 38.419
type value
precision_at_1 31.063000000000002
type value
precision_at_10 6.866
type value
precision_at_100 1.053
type value
precision_at_1000 0.13699999999999998
type value
precision_at_3 16.014
type value
precision_at_5 11.604000000000001
type value
recall_at_1 26.342
type value
recall_at_10 53.40200000000001
type value
recall_at_100 75.251
type value
recall_at_1000 91.13799999999999
type value
recall_at_3 39.103
type value
recall_at_5 46.357
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackWebmastersRetrieval default test None
type value
map_at_1 23.71
type value
map_at_10 32.153999999999996
type value
map_at_100 33.821
type value
map_at_1000 34.034
type value
map_at_3 29.376
type value
map_at_5 30.878
type value
mrr_at_1 28.458
type value
mrr_at_10 36.775999999999996
type value
mrr_at_100 37.804
type value
mrr_at_1000 37.858999999999995
type value
mrr_at_3 34.123999999999995
type value
mrr_at_5 35.596
type value
ndcg_at_1 28.458
type value
ndcg_at_10 37.858999999999995
type value
ndcg_at_100 44.194
type value
ndcg_at_1000 46.744
type value
ndcg_at_3 33.348
type value
ndcg_at_5 35.448
type value
precision_at_1 28.458
type value
precision_at_10 7.4510000000000005
type value
precision_at_100 1.5
type value
precision_at_1000 0.23700000000000002
type value
precision_at_3 15.809999999999999
type value
precision_at_5 11.462
type value
recall_at_1 23.71
type value
recall_at_10 48.272999999999996
type value
recall_at_100 77.134
type value
recall_at_1000 93.001
type value
recall_at_3 35.480000000000004
type value
recall_at_5 41.19
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackWordpressRetrieval default test None
type value
map_at_1 21.331
type value
map_at_10 28.926000000000002
type value
map_at_100 29.855999999999998
type value
map_at_1000 29.957
type value
map_at_3 26.395999999999997
type value
map_at_5 27.933000000000003
type value
mrr_at_1 23.105
type value
mrr_at_10 31.008000000000003
type value
mrr_at_100 31.819999999999997
type value
mrr_at_1000 31.887999999999998
type value
mrr_at_3 28.466
type value
mrr_at_5 30.203000000000003
type value
ndcg_at_1 23.105
type value
ndcg_at_10 33.635999999999996
type value
ndcg_at_100 38.277
type value
ndcg_at_1000 40.907
type value
ndcg_at_3 28.791
type value
ndcg_at_5 31.528
type value
precision_at_1 23.105
type value
precision_at_10 5.323
type value
precision_at_100 0.815
type value
precision_at_1000 0.117
type value
precision_at_3 12.384
type value
precision_at_5 9.02
type value
recall_at_1 21.331
type value
recall_at_10 46.018
type value
recall_at_100 67.364
type value
recall_at_1000 86.97
type value
recall_at_3 33.395
type value
recall_at_5 39.931
task dataset metrics
type
Retrieval
type name config split revision
climate-fever MTEB ClimateFEVER default test None
type value
map_at_1 17.011000000000003
type value
map_at_10 28.816999999999997
type value
map_at_100 30.761
type value
map_at_1000 30.958000000000002
type value
map_at_3 24.044999999999998
type value
map_at_5 26.557
type value
mrr_at_1 38.696999999999996
type value
mrr_at_10 50.464
type value
mrr_at_100 51.193999999999996
type value
mrr_at_1000 51.219
type value
mrr_at_3 47.339999999999996
type value
mrr_at_5 49.346000000000004
type value
ndcg_at_1 38.696999999999996
type value
ndcg_at_10 38.53
type value
ndcg_at_100 45.525
type value
ndcg_at_1000 48.685
type value
ndcg_at_3 32.282
type value
ndcg_at_5 34.482
type value
precision_at_1 38.696999999999996
type value
precision_at_10 11.895999999999999
type value
precision_at_100 1.95
type value
precision_at_1000 0.254
type value
precision_at_3 24.038999999999998
type value
precision_at_5 18.332
type value
recall_at_1 17.011000000000003
type value
recall_at_10 44.452999999999996
type value
recall_at_100 68.223
type value
recall_at_1000 85.653
type value
recall_at_3 28.784
type value
recall_at_5 35.66
task dataset metrics
type
Retrieval
type name config split revision
dbpedia-entity MTEB DBPedia default test None
type value
map_at_1 9.516
type value
map_at_10 21.439
type value
map_at_100 31.517
type value
map_at_1000 33.267
type value
map_at_3 15.004999999999999
type value
map_at_5 17.793999999999997
type value
mrr_at_1 71.25
type value
mrr_at_10 79.071
type value
mrr_at_100 79.325
type value
mrr_at_1000 79.33
type value
mrr_at_3 77.708
type value
mrr_at_5 78.546
type value
ndcg_at_1 58.62500000000001
type value
ndcg_at_10 44.889
type value
ndcg_at_100 50.536
type value
ndcg_at_1000 57.724
type value
ndcg_at_3 49.32
type value
ndcg_at_5 46.775
type value
precision_at_1 71.25
type value
precision_at_10 36.175000000000004
type value
precision_at_100 11.940000000000001
type value
precision_at_1000 2.178
type value
precision_at_3 53.583000000000006
type value
precision_at_5 45.550000000000004
type value
recall_at_1 9.516
type value
recall_at_10 27.028000000000002
type value
recall_at_100 57.581
type value
recall_at_1000 80.623
type value
recall_at_3 16.313
type value
recall_at_5 20.674
task dataset metrics
type
Classification
type name config split revision
mteb/emotion MTEB EmotionClassification default test 4f58c6b202a23cf9a4da393831edf4f9183cad37
type value
accuracy 51.74999999999999
type value
f1 46.46706502669774
task dataset metrics
type
Retrieval
type name config split revision
fever MTEB FEVER default test None
type value
map_at_1 77.266
type value
map_at_10 84.89999999999999
type value
map_at_100 85.109
type value
map_at_1000 85.123
type value
map_at_3 83.898
type value
map_at_5 84.541
type value
mrr_at_1 83.138
type value
mrr_at_10 89.37
type value
mrr_at_100 89.432
type value
mrr_at_1000 89.43299999999999
type value
mrr_at_3 88.836
type value
mrr_at_5 89.21
type value
ndcg_at_1 83.138
type value
ndcg_at_10 88.244
type value
ndcg_at_100 88.98700000000001
type value
ndcg_at_1000 89.21900000000001
type value
ndcg_at_3 86.825
type value
ndcg_at_5 87.636
type value
precision_at_1 83.138
type value
precision_at_10 10.47
type value
precision_at_100 1.1079999999999999
type value
precision_at_1000 0.11499999999999999
type value
precision_at_3 32.933
type value
precision_at_5 20.36
type value
recall_at_1 77.266
type value
recall_at_10 94.063
type value
recall_at_100 96.993
type value
recall_at_1000 98.414
type value
recall_at_3 90.228
type value
recall_at_5 92.328
task dataset metrics
type
Retrieval
type name config split revision
fiqa MTEB FiQA2018 default test None
type value
map_at_1 22.319
type value
map_at_10 36.943
type value
map_at_100 38.951
type value
map_at_1000 39.114
type value
map_at_3 32.82
type value
map_at_5 34.945
type value
mrr_at_1 44.135999999999996
type value
mrr_at_10 53.071999999999996
type value
mrr_at_100 53.87
type value
mrr_at_1000 53.90200000000001
type value
mrr_at_3 50.77199999999999
type value
mrr_at_5 52.129999999999995
type value
ndcg_at_1 44.135999999999996
type value
ndcg_at_10 44.836
type value
ndcg_at_100 51.754
type value
ndcg_at_1000 54.36
type value
ndcg_at_3 41.658
type value
ndcg_at_5 42.354
type value
precision_at_1 44.135999999999996
type value
precision_at_10 12.284
type value
precision_at_100 1.952
type value
precision_at_1000 0.242
type value
precision_at_3 27.828999999999997
type value
precision_at_5 20.093
type value
recall_at_1 22.319
type value
recall_at_10 51.528
type value
recall_at_100 76.70700000000001
type value
recall_at_1000 92.143
type value
recall_at_3 38.641
type value
recall_at_5 43.653999999999996
task dataset metrics
type
Retrieval
type name config split revision
hotpotqa MTEB HotpotQA default test None
type value
map_at_1 40.182
type value
map_at_10 65.146
type value
map_at_100 66.023
type value
map_at_1000 66.078
type value
map_at_3 61.617999999999995
type value
map_at_5 63.82299999999999
type value
mrr_at_1 80.365
type value
mrr_at_10 85.79
type value
mrr_at_100 85.963
type value
mrr_at_1000 85.968
type value
mrr_at_3 84.952
type value
mrr_at_5 85.503
type value
ndcg_at_1 80.365
type value
ndcg_at_10 73.13499999999999
type value
ndcg_at_100 76.133
type value
ndcg_at_1000 77.151
type value
ndcg_at_3 68.255
type value
ndcg_at_5 70.978
type value
precision_at_1 80.365
type value
precision_at_10 15.359
type value
precision_at_100 1.7690000000000001
type value
precision_at_1000 0.19
type value
precision_at_3 44.024
type value
precision_at_5 28.555999999999997
type value
recall_at_1 40.182
type value
recall_at_10 76.793
type value
recall_at_100 88.474
type value
recall_at_1000 95.159
type value
recall_at_3 66.036
type value
recall_at_5 71.391
task dataset metrics
type
Classification
type name config split revision
mteb/imdb MTEB ImdbClassification default test 3d86128a09e091d6018b6d26cad27f2739fc2db7
type value
accuracy 92.7796
type value
ap 89.24883716810874
type value
f1 92.7706903433313
task dataset metrics
type
Retrieval
type name config split revision
msmarco MTEB MSMARCO default dev None
type value
map_at_1 22.016
type value
map_at_10 34.408
type value
map_at_100 35.592
type value
map_at_1000 35.64
type value
map_at_3 30.459999999999997
type value
map_at_5 32.721000000000004
type value
mrr_at_1 22.593
type value
mrr_at_10 34.993
type value
mrr_at_100 36.113
type value
mrr_at_1000 36.156
type value
mrr_at_3 31.101
type value
mrr_at_5 33.364
type value
ndcg_at_1 22.579
type value
ndcg_at_10 41.404999999999994
type value
ndcg_at_100 47.018
type value
ndcg_at_1000 48.211999999999996
type value
ndcg_at_3 33.389
type value
ndcg_at_5 37.425000000000004
type value
precision_at_1 22.579
type value
precision_at_10 6.59
type value
precision_at_100 0.938
type value
precision_at_1000 0.104
type value
precision_at_3 14.241000000000001
type value
precision_at_5 10.59
type value
recall_at_1 22.016
type value
recall_at_10 62.927
type value
recall_at_100 88.72
type value
recall_at_1000 97.80799999999999
type value
recall_at_3 41.229
type value
recall_at_5 50.88
task dataset metrics
type
Classification
type name config split revision
mteb/mtop_domain MTEB MTOPDomainClassification (en) en test d80d48c1eb48d3562165c59d59d0034df9fff0bf
type value
accuracy 94.01732786137711
type value
f1 93.76353126402202
task dataset metrics
type
Classification
type name config split revision
mteb/mtop_intent MTEB MTOPIntentClassification (en) en test ae001d0e6b1228650b7bd1c2c65fb50ad11a8aba
type value
accuracy 76.91746466028272
type value
f1 57.715651682646765
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_massive_intent MTEB MassiveIntentClassification (en) en test 31efe3c427b0bae9c22cbb560b8f15491cc6bed7
type value
accuracy 76.5030262273033
type value
f1 74.6693629986121
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_massive_scenario MTEB MassiveScenarioClassification (en) en test 7d571f92784cd94a019292a1f45445077d0ef634
type value
accuracy 79.74781439139207
type value
f1 79.96684171018774
task dataset metrics
type
Clustering
type name config split revision
mteb/medrxiv-clustering-p2p MTEB MedrxivClusteringP2P default test e7a26af6f3ae46b30dde8737f02c07b1505bcc73
type value
v_measure 33.2156206892017
task dataset metrics
type
Clustering
type name config split revision
mteb/medrxiv-clustering-s2s MTEB MedrxivClusteringS2S default test 35191c8c0dca72d8ff3efcd72aa802307d469663
type value
v_measure 31.180539484816137
task dataset metrics
type
Reranking
type name config split revision
mteb/mind_small MTEB MindSmallReranking default test 3bdac13927fdc888b903db93b2ffdbd90b295a69
type value
map 32.51125957874274
type value
mrr 33.777037359249995
task dataset metrics
type
Retrieval
type name config split revision
nfcorpus MTEB NFCorpus default test None
type value
map_at_1 7.248
type value
map_at_10 15.340000000000002
type value
map_at_100 19.591
type value
map_at_1000 21.187
type value
map_at_3 11.329
type value
map_at_5 13.209999999999999
type value
mrr_at_1 47.678
type value
mrr_at_10 57.493
type value
mrr_at_100 58.038999999999994
type value
mrr_at_1000 58.07
type value
mrr_at_3 55.36600000000001
type value
mrr_at_5 56.635999999999996
type value
ndcg_at_1 46.129999999999995
type value
ndcg_at_10 38.653999999999996
type value
ndcg_at_100 36.288
type value
ndcg_at_1000 44.765
type value
ndcg_at_3 43.553
type value
ndcg_at_5 41.317
type value
precision_at_1 47.368
type value
precision_at_10 28.669
type value
precision_at_100 9.158
type value
precision_at_1000 2.207
type value
precision_at_3 40.97
type value
precision_at_5 35.604
type value
recall_at_1 7.248
type value
recall_at_10 19.46
type value
recall_at_100 37.214000000000006
type value
recall_at_1000 67.64099999999999
type value
recall_at_3 12.025
type value
recall_at_5 15.443999999999999
task dataset metrics
type
Retrieval
type name config split revision
nq MTEB NQ default test None
type value
map_at_1 31.595000000000002
type value
map_at_10 47.815999999999995
type value
map_at_100 48.811
type value
map_at_1000 48.835
type value
map_at_3 43.225
type value
map_at_5 46.017
type value
mrr_at_1 35.689
type value
mrr_at_10 50.341
type value
mrr_at_100 51.044999999999995
type value
mrr_at_1000 51.062
type value
mrr_at_3 46.553
type value
mrr_at_5 48.918
type value
ndcg_at_1 35.66
type value
ndcg_at_10 55.859
type value
ndcg_at_100 59.864
type value
ndcg_at_1000 60.419999999999995
type value
ndcg_at_3 47.371
type value
ndcg_at_5 51.995000000000005
type value
precision_at_1 35.66
type value
precision_at_10 9.27
type value
precision_at_100 1.1520000000000001
type value
precision_at_1000 0.12
type value
precision_at_3 21.63
type value
precision_at_5 15.655
type value
recall_at_1 31.595000000000002
type value
recall_at_10 77.704
type value
recall_at_100 94.774
type value
recall_at_1000 98.919
type value
recall_at_3 56.052
type value
recall_at_5 66.623
task dataset metrics
type
Retrieval
type name config split revision
quora MTEB QuoraRetrieval default test None
type value
map_at_1 71.489
type value
map_at_10 85.411
type value
map_at_100 86.048
type value
map_at_1000 86.064
type value
map_at_3 82.587
type value
map_at_5 84.339
type value
mrr_at_1 82.28
type value
mrr_at_10 88.27199999999999
type value
mrr_at_100 88.362
type value
mrr_at_1000 88.362
type value
mrr_at_3 87.372
type value
mrr_at_5 87.995
type value
ndcg_at_1 82.27
type value
ndcg_at_10 89.023
type value
ndcg_at_100 90.191
type value
ndcg_at_1000 90.266
type value
ndcg_at_3 86.37
type value
ndcg_at_5 87.804
type value
precision_at_1 82.27
type value
precision_at_10 13.469000000000001
type value
precision_at_100 1.533
type value
precision_at_1000 0.157
type value
precision_at_3 37.797
type value
precision_at_5 24.734
type value
recall_at_1 71.489
type value
recall_at_10 95.824
type value
recall_at_100 99.70599999999999
type value
recall_at_1000 99.979
type value
recall_at_3 88.099
type value
recall_at_5 92.285
task dataset metrics
type
Clustering
type name config split revision
mteb/reddit-clustering MTEB RedditClustering default test 24640382cdbf8abc73003fb0fa6d111a705499eb
type value
v_measure 60.52398807444541
task dataset metrics
type
Clustering
type name config split revision
mteb/reddit-clustering-p2p MTEB RedditClusteringP2P default test 282350215ef01743dc01b456c7f5241fa8937f16
type value
v_measure 65.34855891507871
task dataset metrics
type
Retrieval
type name config split revision
scidocs MTEB SCIDOCS default test None
type value
map_at_1 5.188000000000001
type value
map_at_10 13.987
type value
map_at_100 16.438
type value
map_at_1000 16.829
type value
map_at_3 9.767000000000001
type value
map_at_5 11.912
type value
mrr_at_1 25.6
type value
mrr_at_10 37.744
type value
mrr_at_100 38.847
type value
mrr_at_1000 38.894
type value
mrr_at_3 34.166999999999994
type value
mrr_at_5 36.207
type value
ndcg_at_1 25.6
type value
ndcg_at_10 22.980999999999998
type value
ndcg_at_100 32.039
type value
ndcg_at_1000 38.157000000000004
type value
ndcg_at_3 21.567
type value
ndcg_at_5 19.070999999999998
type value
precision_at_1 25.6
type value
precision_at_10 12.02
type value
precision_at_100 2.5100000000000002
type value
precision_at_1000 0.396
type value
precision_at_3 20.333000000000002
type value
precision_at_5 16.98
type value
recall_at_1 5.188000000000001
type value
recall_at_10 24.372
type value
recall_at_100 50.934999999999995
type value
recall_at_1000 80.477
type value
recall_at_3 12.363
type value
recall_at_5 17.203
task dataset metrics
type
STS
type name config split revision
mteb/sickr-sts MTEB SICK-R default test a6ea5a8cab320b040a23452cc28066d9beae2cee
type value
cos_sim_pearson 87.24286275535398
type value
cos_sim_spearman 82.62333770991818
type value
euclidean_pearson 84.60353717637284
type value
euclidean_spearman 82.32990108810047
type value
manhattan_pearson 84.6089049738196
type value
manhattan_spearman 82.33361785438936
task dataset metrics
type
STS
type name config split revision
mteb/sts12-sts MTEB STS12 default test a0d554a64d88156834ff5ae9920b964011b16384
type value
cos_sim_pearson 87.87428858503165
type value
cos_sim_spearman 79.09145886519929
type value
euclidean_pearson 86.42669231664036
type value
euclidean_spearman 80.03127375435449
type value
manhattan_pearson 86.41330338305022
type value
manhattan_spearman 80.02492538673368
task dataset metrics
type
STS
type name config split revision
mteb/sts13-sts MTEB STS13 default test 7e90230a92c190f1bf69ae9002b8cea547a64cca
type value
cos_sim_pearson 88.67912277322645
type value
cos_sim_spearman 89.6171319711762
type value
euclidean_pearson 86.56571917398725
type value
euclidean_spearman 87.71216907898948
type value
manhattan_pearson 86.57459050182473
type value
manhattan_spearman 87.71916648349993
task dataset metrics
type
STS
type name config split revision
mteb/sts14-sts MTEB STS14 default test 6031580fec1f6af667f0bd2da0a551cf4f0b2375
type value
cos_sim_pearson 86.71957379085862
type value
cos_sim_spearman 85.01784075851465
type value
euclidean_pearson 84.7407848472801
type value
euclidean_spearman 84.61063091345538
type value
manhattan_pearson 84.71494352494403
type value
manhattan_spearman 84.58772077604254
task dataset metrics
type
STS
type name config split revision
mteb/sts15-sts MTEB STS15 default test ae752c7c21bf194d8b67fd573edf7ae58183cbe3
type value
cos_sim_pearson 88.40508326325175
type value
cos_sim_spearman 89.50912897763186
type value
euclidean_pearson 87.82349070086627
type value
euclidean_spearman 88.44179162727521
type value
manhattan_pearson 87.80181927025595
type value
manhattan_spearman 88.43205129636243
task dataset metrics
type
STS
type name config split revision
mteb/sts16-sts MTEB STS16 default test 4d8694f8f0e0100860b497b999b3dbed754a0513
type value
cos_sim_pearson 85.35846741715478
type value
cos_sim_spearman 86.61172476741842
type value
euclidean_pearson 84.60123125491637
type value
euclidean_spearman 85.3001948141827
type value
manhattan_pearson 84.56231142658329
type value
manhattan_spearman 85.23579900798813
task dataset metrics
type
STS
type name config split revision
mteb/sts17-crosslingual-sts MTEB STS17 (en-en) en-en test af5e6fb845001ecf41f4c1e033ce921939a2a68d
type value
cos_sim_pearson 88.94539129818824
type value
cos_sim_spearman 88.99349064256742
type value
euclidean_pearson 88.7142444640351
type value
euclidean_spearman 88.34120813505011
type value
manhattan_pearson 88.70363008238084
type value
manhattan_spearman 88.31952816956954
task dataset metrics
type
STS
type name config split revision
mteb/sts22-crosslingual-sts MTEB STS22 (en) en test 6d1ba47164174a496b7fa5d3569dae26a6813b80
type value
cos_sim_pearson 68.29910260369893
type value
cos_sim_spearman 68.79263346213466
type value
euclidean_pearson 68.41627521422252
type value
euclidean_spearman 66.61602587398579
type value
manhattan_pearson 68.49402183447361
type value
manhattan_spearman 66.80157792354453
task dataset metrics
type
STS
type name config split revision
mteb/stsbenchmark-sts MTEB STSBenchmark default test b0fddb56ed78048fa8b90373c8a3cfc37b684831
type value
cos_sim_pearson 87.43703906343708
type value
cos_sim_spearman 89.06081805093662
type value
euclidean_pearson 87.48311456299662
type value
euclidean_spearman 88.07417597580013
type value
manhattan_pearson 87.48202249768894
type value
manhattan_spearman 88.04758031111642
task dataset metrics
type
Reranking
type name config split revision
mteb/scidocs-reranking MTEB SciDocsRR default test d3c5e1fc0b855ab6097bf1cda04dd73947d7caab
type value
map 87.49080620485203
type value
mrr 96.19145378949301
task dataset metrics
type
Retrieval
type name config split revision
scifact MTEB SciFact default test None
type value
map_at_1 59.317
type value
map_at_10 69.296
type value
map_at_100 69.738
type value
map_at_1000 69.759
type value
map_at_3 66.12599999999999
type value
map_at_5 67.532
type value
mrr_at_1 62
type value
mrr_at_10 70.176
type value
mrr_at_100 70.565
type value
mrr_at_1000 70.583
type value
mrr_at_3 67.833
type value
mrr_at_5 68.93299999999999
type value
ndcg_at_1 62
type value
ndcg_at_10 74.069
type value
ndcg_at_100 76.037
type value
ndcg_at_1000 76.467
type value
ndcg_at_3 68.628
type value
ndcg_at_5 70.57600000000001
type value
precision_at_1 62
type value
precision_at_10 10
type value
precision_at_100 1.097
type value
precision_at_1000 0.11299999999999999
type value
precision_at_3 26.667
type value
precision_at_5 17.4
type value
recall_at_1 59.317
type value
recall_at_10 87.822
type value
recall_at_100 96.833
type value
recall_at_1000 100
type value
recall_at_3 73.06099999999999
type value
recall_at_5 77.928
task dataset metrics
type
PairClassification
type name config split revision
mteb/sprintduplicatequestions-pairclassification MTEB SprintDuplicateQuestions default test d66bd1f72af766a5cc4b0ca5e00c162f89e8cc46
type value
cos_sim_accuracy 99.88910891089108
type value
cos_sim_ap 97.236958456951
type value
cos_sim_f1 94.39999999999999
type value
cos_sim_precision 94.39999999999999
type value
cos_sim_recall 94.39999999999999
type value
dot_accuracy 99.82574257425742
type value
dot_ap 94.94344759441888
type value
dot_f1 91.17352056168507
type value
dot_precision 91.44869215291752
type value
dot_recall 90.9
type value
euclidean_accuracy 99.88415841584158
type value
euclidean_ap 97.2044250782305
type value
euclidean_f1 94.210786739238
type value
euclidean_precision 93.24191968658178
type value
euclidean_recall 95.19999999999999
type value
manhattan_accuracy 99.88613861386139
type value
manhattan_ap 97.20683205497689
type value
manhattan_f1 94.2643391521197
type value
manhattan_precision 94.02985074626866
type value
manhattan_recall 94.5
type value
max_accuracy 99.88910891089108
type value
max_ap 97.236958456951
type value
max_f1 94.39999999999999
task dataset metrics
type
Clustering
type name config split revision
mteb/stackexchange-clustering MTEB StackExchangeClustering default test 6cbc1f7b2bc0622f2e39d2c77fa502909748c259
type value
v_measure 66.53940781726187
task dataset metrics
type
Clustering
type name config split revision
mteb/stackexchange-clustering-p2p MTEB StackExchangeClusteringP2P default test 815ca46b2622cec33ccafc3735d572c266efdb44
type value
v_measure 36.71865011295108
task dataset metrics
type
Reranking
type name config split revision
mteb/stackoverflowdupquestions-reranking MTEB StackOverflowDupQuestions default test e185fbe320c72810689fc5848eb6114e1ef5ec69
type value
map 55.3218674533331
type value
mrr 56.28279910449028
task dataset metrics
type
Summarization
type name config split revision
mteb/summeval MTEB SummEval default test cda12ad7615edc362dbf25a00fdd61d3b1eaf93c
type value
cos_sim_pearson 30.723915667479673
type value
cos_sim_spearman 32.029070449745234
type value
dot_pearson 28.864944212481454
type value
dot_spearman 27.939266999596725
task dataset metrics
type
Retrieval
type name config split revision
trec-covid MTEB TRECCOVID default test None
type value
map_at_1 0.231
type value
map_at_10 1.949
type value
map_at_100 10.023
type value
map_at_1000 23.485
type value
map_at_3 0.652
type value
map_at_5 1.054
type value
mrr_at_1 86
type value
mrr_at_10 92.067
type value
mrr_at_100 92.067
type value
mrr_at_1000 92.067
type value
mrr_at_3 91.667
type value
mrr_at_5 92.067
type value
ndcg_at_1 83
type value
ndcg_at_10 76.32900000000001
type value
ndcg_at_100 54.662
type value
ndcg_at_1000 48.062
type value
ndcg_at_3 81.827
type value
ndcg_at_5 80.664
type value
precision_at_1 86
type value
precision_at_10 80
type value
precision_at_100 55.48
type value
precision_at_1000 20.938000000000002
type value
precision_at_3 85.333
type value
precision_at_5 84.39999999999999
type value
recall_at_1 0.231
type value
recall_at_10 2.158
type value
recall_at_100 13.344000000000001
type value
recall_at_1000 44.31
type value
recall_at_3 0.6779999999999999
type value
recall_at_5 1.13
task dataset metrics
type
Retrieval
type name config split revision
webis-touche2020 MTEB Touche2020 default test None
type value
map_at_1 2.524
type value
map_at_10 10.183
type value
map_at_100 16.625
type value
map_at_1000 18.017
type value
map_at_3 5.169
type value
map_at_5 6.772
type value
mrr_at_1 32.653
type value
mrr_at_10 47.128
type value
mrr_at_100 48.458
type value
mrr_at_1000 48.473
type value
mrr_at_3 44.897999999999996
type value
mrr_at_5 45.306000000000004
type value
ndcg_at_1 30.612000000000002
type value
ndcg_at_10 24.928
type value
ndcg_at_100 37.613
type value
ndcg_at_1000 48.528
type value
ndcg_at_3 28.829
type value
ndcg_at_5 25.237
type value
precision_at_1 32.653
type value
precision_at_10 22.448999999999998
type value
precision_at_100 8.02
type value
precision_at_1000 1.537
type value
precision_at_3 30.612000000000002
type value
precision_at_5 24.490000000000002
type value
recall_at_1 2.524
type value
recall_at_10 16.38
type value
recall_at_100 49.529
type value
recall_at_1000 83.598
type value
recall_at_3 6.411
type value
recall_at_5 8.932
task dataset metrics
type
Classification
type name config split revision
mteb/toxic_conversations_50k MTEB ToxicConversationsClassification default test d7c0de2777da35d6aae2200a62c6e0e5af397c4c
type value
accuracy 71.09020000000001
type value
ap 14.451710060978993
type value
f1 54.7874410609049
task dataset metrics
type
Classification
type name config split revision
mteb/tweet_sentiment_extraction MTEB TweetSentimentExtractionClassification default test d604517c81ca91fe16a244d1248fc021f9ecee7a
type value
accuracy 59.745331069609506
type value
f1 60.08387848592697
task dataset metrics
type
Clustering
type name config split revision
mteb/twentynewsgroups-clustering MTEB TwentyNewsgroupsClustering default test 6125ec4e24fa026cec8a478383ee943acfbd5449
type value
v_measure 51.71549485462037
task dataset metrics
type
PairClassification
type name config split revision
mteb/twittersemeval2015-pairclassification MTEB TwitterSemEval2015 default test 70970daeab8776df92f5ea462b6173c0b46fd2d1
type value
cos_sim_accuracy 87.39345532574357
type value
cos_sim_ap 78.16796549696478
type value
cos_sim_f1 71.27713276123171
type value
cos_sim_precision 68.3115626511853
type value
cos_sim_recall 74.51187335092348
type value
dot_accuracy 85.12248912201228
type value
dot_ap 69.26039256107077
type value
dot_f1 65.04294321240867
type value
dot_precision 63.251059586138126
type value
dot_recall 66.93931398416886
type value
euclidean_accuracy 87.07754664123503
type value
euclidean_ap 77.7872176038945
type value
euclidean_f1 70.85587801278899
type value
euclidean_precision 66.3519115614924
type value
euclidean_recall 76.01583113456465
type value
manhattan_accuracy 87.07754664123503
type value
manhattan_ap 77.7341400185556
type value
manhattan_f1 70.80310880829015
type value
manhattan_precision 69.54198473282443
type value
manhattan_recall 72.1108179419525
type value
max_accuracy 87.39345532574357
type value
max_ap 78.16796549696478
type value
max_f1 71.27713276123171
task dataset metrics
type
PairClassification
type name config split revision
mteb/twitterurlcorpus-pairclassification MTEB TwitterURLCorpus default test 8b6510b0b1fa4e4c4f879467980e9be563ec1cdf
type value
cos_sim_accuracy 89.09457833663213
type value
cos_sim_ap 86.33024314706873
type value
cos_sim_f1 78.59623733719248
type value
cos_sim_precision 74.13322413322413
type value
cos_sim_recall 83.63104404065291
type value
dot_accuracy 88.3086894089339
type value
dot_ap 83.92225241805097
type value
dot_f1 76.8721826377781
type value
dot_precision 72.8168044077135
type value
dot_recall 81.40591315060055
type value
euclidean_accuracy 88.77052043311213
type value
euclidean_ap 85.7410710218755
type value
euclidean_f1 77.97705489398781
type value
euclidean_precision 73.77713657598241
type value
euclidean_recall 82.68401601478288
type value
manhattan_accuracy 88.73753250281368
type value
manhattan_ap 85.72867199072802
type value
manhattan_f1 77.89774182922812
type value
manhattan_precision 74.23787931635857
type value
manhattan_recall 81.93717277486911
type value
max_accuracy 89.09457833663213
type value
max_ap 86.33024314706873
type value
max_f1 78.59623733719248
mit
en

Universal AnglE Embedding

📢 WhereIsAI/UAE-Large-V1 is licensed under MIT. Feel free to use it in any scenario. If you use it for academic papers, you could cite us via 👉 citation info.

🤝 Follow us on:

Welcome to using AnglE to train and infer powerful sentence embeddings.

🏆 Achievements

  • 📅 May 16, 2024 | AnglE's paper is accepted by ACL 2024 Main Conference
  • 📅 Dec 4, 2024 | 🔥 Our universal English sentence embedding WhereIsAI/UAE-Large-V1 achieves SOTA on the MTEB Leaderboard with an average score of 64.64!

image/jpeg

🧑‍🤝‍🧑 Siblings:

Usage

1. angle_emb

python -m pip install -U angle-emb
  1. Non-Retrieval Tasks

There is no need to specify any prompts.

from angle_emb import AnglE
from angle_emb.utils import cosine_similarity

angle = AnglE.from_pretrained('WhereIsAI/UAE-Large-V1', pooling_strategy='cls').cuda()
doc_vecs = angle.encode([
    'The weather is great!',
    'The weather is very good!',
    'i am going to bed'
], normalize_embedding=True)

for i, dv1 in enumerate(doc_vecs):
    for dv2 in doc_vecs[i+1:]:
        print(cosine_similarity(dv1, dv2))
  1. Retrieval Tasks

For retrieval purposes, please use the prompt Prompts.C for query (not for document).

from angle_emb import AnglE, Prompts
from angle_emb.utils import cosine_similarity

angle = AnglE.from_pretrained('WhereIsAI/UAE-Large-V1', pooling_strategy='cls').cuda()
qv = angle.encode(Prompts.C.format(text='what is the weather?'))
doc_vecs = angle.encode([
    'The weather is great!',
    'it is rainy today.',
    'i am going to bed'
])

for dv in doc_vecs:
    print(cosine_similarity(qv[0], dv))

2. sentence transformer

from angle_emb import Prompts
from sentence_transformers import SentenceTransformer

model = SentenceTransformer("WhereIsAI/UAE-Large-V1").cuda()

qv = model.encode(Prompts.C.format(text='what is the weather?'))
doc_vecs = model.encode([
    'The weather is great!',
    'it is rainy today.',
    'i am going to bed'
])

for dv in doc_vecs:
    print(1 - spatial.distance.cosine(qv, dv))

3. Infinity

Infinity is a MIT licensed server for OpenAI-compatible deployment.

docker run --gpus all -v $PWD/data:/app/.cache -p "7997":"7997" \
michaelf34/infinity:latest \
v2 --model-id WhereIsAI/UAE-Large-V1 --revision "369c368f70f16a613f19f5598d4f12d9f44235d4" --dtype float16 --batch-size 32 --device cuda --engine torch --port 7997

Citation

If you use our pre-trained models, welcome to support us by citing our work:

@article{li2023angle,
  title={AnglE-optimized Text Embeddings},
  author={Li, Xianming and Li, Jing},
  journal={arXiv preprint arXiv:2309.12871},
  year={2023}
}