榜单总榜
大语言精调模型排行榜
排名 | 模型 | 模型格式 | 最新评测时间 | 类型 | 综合评测胜率 | 客观评测准确率 | 主观评测胜率 | 认知-知识 | 认知-推理 | 交互 | 中文 | 英文 | 简单 | 中等 | 困难 | MMLU(英) | BBH(英) | CMMLU(中) | CEval (中) |
---|
0.8262 | 0.6086 | 0.7582 | 0.8357 | 0.7725 | 0.7981 | 0.8476 | 0.8890 | 0.7622 | 0.7690 | 0.8981 | 0.8654 | 0.8388 | 0.7821 | ||||||
0.7783 | 0.5626 | 0.7920 | 0.7379 | 0.6836 | 0.7885 | 0.7706 | 0.9109 | 0.7954 | 0.5325 | 0.8636 | 0.8823 | 0.7642 | 0.7103 | ||||||
0.7713 | 0.5639 * | 0.8327 | 0.7271 | 0.6926 | 0.7557 | 0.7830 | 0.8824 | 0.8254 | 0.5325 | 0.8402 | 0.8760 | 0.7672 | 0.7846 | ||||||
0.7309 | 0.5708 | 0.7367 | 0.6935 | 0.7363 | 0.7448 | 0.7204 | 0.9127 | 0.7734 | 0.3781 | 0.8131 | 0.8343 | 0.8299 | 0.7333 | ||||||
0.7733 | 0.5572 * | 0.7440 | 0.7386 | 0.7861 | 0.7859 | 0.7638 | 0.9190 | 0.7939 | 0.5017 | 0.8393 | 0.8109 | 0.7672 | 0.6436 | ||||||
0.7581 | 0.5538 * | 0.7342 | 0.7317 | 0.7694 | 0.7661 | 0.7521 | 0.9050 | 0.8010 | 0.4661 | 0.8463 | 0.8567 | 0.7851 | 0.6385 | ||||||
0.7495 | 0.5559 | 0.7586 | 0.7327 | 0.6629 | 0.7643 | 0.7384 | 0.8791 | 0.7789 | 0.4989 | 0.7972 | 0.8088 | 0.8185 | 0.8020 | ||||||
0.7452 | 0.5547 * | 0.7055 | 0.7190 | 0.7339 | 0.7472 | 0.7436 | 0.8914 | 0.7552 | 0.4811 | 0.8290 | 0.7827 | 0.7761 | 0.5795 | ||||||
0.7339 | 0.5474 | 0.7252 | 0.7175 | 0.6469 | 0.7577 | 0.7159 | 0.8740 | 0.7394 | 0.4844 | 0.7879 | 0.8047 | 0.8084 | 0.7788 | ||||||
0.7632 | 0.5344 | 0.7207 | 0.7462 | 0.6829 | 0.7860 | 0.7460 | 0.9036 | 0.7788 | 0.5050 | 0.7551 | 0.6972 | 0.5851 | 0.4333 | ||||||
0.7207 | 0.5437 * | 0.7516 | 0.6963 | 0.6208 | 0.7291 | 0.7143 | 0.8782 | 0.7612 | 0.4119 | 0.8479 | 0.8239 | 0.7600 | 0.7763 | ||||||
0.7170 | 0.5389 | 0.7478 | 0.6826 | 0.6897 | 0.7489 | 0.6929 | 0.8845 | 0.7589 | 0.3898 | 0.8416 | 0.8342 | 0.8406 | 0.8353 | ||||||
0.7075 | 0.5376 * | 0.7502 | 0.6651 | 0.5939 | 0.7562 | 0.6706 | 0.8803 | 0.7850 | 0.3418 | 0.8523 | 0.7182 | 0.8209 | 0.6872 | ||||||
0.7569 | 0.5204 * | 0.7948 | 0.7041 | 0.6965 | 0.7957 | 0.7274 | 0.9039 | 0.8232 | 0.4453 | 0.7991 | 0.8657 | 0.8597 | 0.6923 | ||||||
0.7053 | 0.5136 | 0.7505 | 0.6637 | 0.6292 | 0.7387 | 0.6800 | 0.8824 | 0.7786 | 0.3354 | 0.8303 | 0.8093 | 0.8293 | 0.8170 | ||||||
0.7049 | 0.5108 * | 0.6712 | 0.7285 | 0.5915 | 0.7257 | 0.6891 | 0.8384 | 0.6505 | 0.5160 | 0.7243 | 0.6969 | 0.7481 | 0.7110 | ||||||
0.7424 | 0.5000 | 0.7686 | 0.7183 | 0.7223 | 0.7326 | 0.7498 | 0.8824 | 0.7616 | 0.4818 | 0.7989 | 0.4287 | 0.6719 | 0.6534 | ||||||
0.7626 | 0.4886 | 0.7757 | 0.7330 | 0.7499 | 0.7478 | 0.7739 | 0.8764 | 0.7667 | 0.5602 | 0.7421 | 0.7871 | 0.5943 | 0.6094 | ||||||
0.7078 | 0.5013 | 0.7292 | 0.6825 | 0.6667 | 0.7163 | 0.7013 | 0.8422 | 0.7268 | 0.4570 | 0.7521 | 0.7860 | 0.7510 | 0.7449 | ||||||
0.6914 | 0.5032 | 0.7367 | 0.6391 | 0.6335 | 0.7457 | 0.6503 | 0.8820 | 0.7329 | 0.3241 | 0.8076 | 0.7812 | 0.8672 | 0.8746 | ||||||
0.7263 | 0.4934 * | 0.6766 | 0.7525 | 0.6074 | 0.7822 | 0.6840 | 0.8654 | 0.6937 | 0.5099 | 0.7366 | 0.7432 | 0.7821 | 0.7217 | ||||||
0.7143 | 0.4920 * | 0.7576 | 0.6749 | 0.6873 | 0.7244 | 0.7067 | 0.8695 | 0.7590 | 0.4063 | 0.7892 | 0.4579 | 0.6767 | 0.6212 | ||||||
0.7509 | 0.4818 | 0.7430 | 0.7211 | 0.7633 | 0.7703 | 0.7362 | 0.9157 | 0.7605 | 0.4549 | 0.8009 | 0.7506 | 0.6896 | 0.5846 | ||||||
0.6670 | 0.5010 | 0.7127 | 0.6256 | 0.6403 | 0.6658 | 0.6679 | 0.8185 | 0.7528 | 0.3316 | 0.7888 | 0.7129 | 0.7761 | 0.5872 | ||||||
0.6850 | 0.4944 | 0.7492 | 0.6379 | 0.6321 | 0.7328 | 0.6487 | 0.8911 | 0.7164 | 0.2987 | 0.7664 | 0.7016 | 0.8000 | 0.5744 | ||||||
0.7133 | 0.4714 * | 0.6906 | 0.7111 | 0.6069 | 0.7605 | 0.6776 | 0.8494 | 0.6954 | 0.4900 | 0.7873 | 0.7589 | 0.7693 | 0.7172 | ||||||
0.6926 | 0.4763 * | 0.6595 | 0.6878 | 0.6617 | 0.7926 | 0.6167 | 0.8504 | 0.6928 | 0.4162 | 0.7505 | 0.7843 | 0.7851 | 0.5795 | ||||||
0.6744 | 0.4784 * | 0.6709 | 0.6335 | 0.6880 | 0.7093 | 0.6479 | 0.8645 | 0.6990 | 0.3216 | 0.7607 | 0.8054 | 0.7940 | 0.5308 | ||||||
0.7261 | 0.4649 | 0.7412 | 0.7028 | 0.6631 | 0.7752 | 0.6889 | 0.8489 | 0.7862 | 0.4620 | 0.7804 | 0.7610 | 0.8119 | 0.6282 | ||||||
0.6761 | 0.4761 | 0.6771 | 0.6650 | 0.6268 | 0.6948 | 0.6619 | 0.8623 | 0.7196 | 0.3148 | 0.7850 | 0.7569 | 0.8239 | 0.7667 | ||||||
0.6850 | 0.4729 * | 0.7098 | 0.6452 | 0.6200 | 0.7303 | 0.6506 | 0.8721 | 0.7312 | 0.3198 | 0.7925 | 0.7769 | 0.8090 | 0.7947 | ||||||
0.7262 | 0.4594 | 0.6948 | 0.7272 | 0.6337 | 0.7688 | 0.6940 | 0.8580 | 0.7097 | 0.5092 | 0.8006 | 0.7995 | 0.7746 | 0.7574 | ||||||
0.6607 | 0.4655 | 0.6843 | 0.6027 | 0.6421 | 0.7208 | 0.6153 | 0.8176 | 0.7657 | 0.3005 | 0.7327 | 0.7317 | 0.7970 | 0.5923 | ||||||
0.6563 | 0.4623 * | 0.6763 | 0.5936 | 0.6587 | 0.7062 | 0.6185 | 0.8155 | 0.7737 | 0.2817 | 0.7449 | 0.7907 | 0.7731 | 0.6795 | ||||||
0.6761 | 0.4573 | 0.6771 | 0.6650 | 0.6268 | 0.6948 | 0.6619 | 0.8623 | 0.7196 | 0.3148 | 0.7103 | 0.7624 | 0.7194 | 0.5641 | ||||||
0.7474 | 0.4324 | 0.7674 | 0.7204 | 0.7406 | 0.7376 | 0.7548 | 0.8720 | 0.7585 | 0.5203 | 0.8179 | 0.6906 | 0.6761 | 0.6269 | ||||||
0.7129 | 0.4408 * | 0.6764 | 0.7226 | 0.6076 | 0.7697 | 0.6699 | 0.8601 | 0.6916 | 0.4728 | 0.7022 | 0.7569 | 0.7334 | 0.7041 | ||||||
0.6814 | 0.4392 | 0.6853 | 0.6538 | 0.6351 | 0.7030 | 0.6651 | 0.8641 | 0.7082 | 0.3401 | 0.7645 | 0.7883 | 0.6836 | 0.4487 | ||||||
0.7070 | 0.4320 | 0.6593 | 0.6840 | 0.6196 | 0.7678 | 0.6609 | 0.8604 | 0.7164 | 0.4309 | 0.7579 | 0.7216 | 0.8985 | 0.8103 | ||||||
0.6550 | 0.4432 | 0.6800 | 0.6117 | 0.6146 | 0.6911 | 0.6276 | 0.8587 | 0.6110 | 0.3348 | 0.7299 | 0.8180 | 0.6985 | 0.5769 | ||||||
0.7062 | 0.4203 * | 0.7460 | 0.6659 | 0.6321 | 0.7198 | 0.6958 | 0.8684 | 0.6919 | 0.4341 | 0.8595 | 0.8706 | 0.7546 | 0.7305 | ||||||
0.6475 | 0.4328 * | 0.7039 | 0.5930 | 0.5891 | 0.7006 | 0.6073 | 0.8238 | 0.7255 | 0.2753 | 0.7505 | 0.7683 | 0.7970 | 0.5462 | ||||||
0.5943 | 0.4414 * | 0.6269 | 0.5488 | 0.5603 | 0.6147 | 0.5788 | 0.7715 | 0.6142 | 0.2680 | 0.7013 | 0.5602 | 0.6890 | 0.6310 | ||||||
0.6481 | 0.4239 * | 0.6689 | 0.6254 | 0.5889 | 0.6864 | 0.6191 | 0.8437 | 0.6844 | 0.2763 | 0.7510 | 0.6721 | 0.7761 | 0.7459 | ||||||
0.7123 | 0.4063 * | 0.7136 | 0.6909 | 0.6648 | 0.6888 | 0.7300 | 0.8378 | 0.7235 | 0.4834 | 0.8244 | 0.8059 | 0.6636 | 0.6432 | ||||||
0.6877 | 0.4010 | 0.6981 | 0.6610 | 0.6600 | 0.6714 | 0.7001 | 0.8592 | 0.6738 | 0.3993 | 0.7429 | 0.7885 | 0.5988 | 0.5956 | ||||||
0.6373 | 0.4135 | 0.6610 | 0.6021 | 0.6071 | 0.6948 | 0.5938 | 0.8457 | 0.6432 | 0.2680 | 0.7565 | 0.6406 | 0.8272 | 0.7750 | ||||||
0.6359 | 0.4113 * | 0.6837 | 0.5979 | 0.5882 | 0.6619 | 0.6161 | 0.8211 | 0.6287 | 0.3178 | 0.7357 | 0.6542 | 0.5884 | 0.5727 | ||||||
0.6449 | 0.4089 * | 0.5638 | 0.6679 | 0.5449 | 0.7290 | 0.5811 | 0.8102 | 0.6026 | 0.3903 | 0.6560 | 0.6321 | 0.6887 | 0.6837 | ||||||
0.7082 | 0.3807 | 0.7066 | 0.7044 | 0.6268 | 0.7628 | 0.6668 | 0.8261 | 0.7420 | 0.4741 | 0.6486 | 0.7045 | 0.6597 | 0.4513 | ||||||
0.6263 | 0.3929 * | 0.6417 | 0.5679 | 0.6036 | 0.6225 | 0.6292 | 0.7874 | 0.6545 | 0.3214 | 0.6841 | 0.6511 | 0.7194 | 0.6077 | ||||||
0.5609 | 0.4066 | 0.6488 | 0.5015 | 0.5322 | 0.6225 | 0.5142 | 0.7461 | 0.6039 | 0.2017 | 0.7256 | 0.5668 | 0.7675 | 0.7459 | ||||||
0.5902 | 0.3956 * | 0.6516 | 0.5407 | 0.5586 | 0.6090 | 0.5760 | 0.7691 | 0.6212 | 0.2519 | 0.7167 | 0.6359 | 0.7260 | 0.6814 | ||||||
0.5568 | 0.3858 | 0.6566 | 0.4876 | 0.4987 | 0.5810 | 0.5385 | 0.7033 | 0.5804 | 0.2813 | 0.8004 | 0.7614 | 0.5313 | 0.5027 | ||||||
0.6138 | 0.3509 | 0.6748 | 0.6030 | 0.5441 | 0.6268 | 0.6039 | 0.7943 | 0.6101 | 0.3010 | 0.7094 | 0.4581 | 0.5003 | 0.4591 | ||||||
0.5813 | 0.3302 * | 0.5616 | 0.5692 | 0.5440 | 0.6433 | 0.5343 | 0.7724 | 0.5909 | 0.2392 | 0.6934 | 0.5493 | 0.6790 | 0.6453 | ||||||
0.4956 | 0.3514 * | 0.5065 | 0.5028 | 0.4213 | 0.5475 | 0.4562 | 0.6509 | 0.4670 | 0.2473 | 0.3730 | 0.4203 | 0.4618 | 0.1427 | ||||||
0.5462 | 0.3342 * | 0.5717 | 0.5126 | 0.4668 | 0.5090 | 0.5745 | 0.6861 | 0.5255 | 0.3186 | 0.6935 | 0.6591 | 0.5084 | 0.5296 | ||||||
0.5690 | 0.3188 * | 0.6369 | 0.5242 | 0.5215 | 0.5864 | 0.5558 | 0.7765 | 0.5405 | 0.2293 | 0.7006 | 0.5925 | 0.4525 | 0.4833 | ||||||
0.4769 | 0.3414 * | 0.5072 | 0.4289 | 0.5086 | 0.5622 | 0.4123 | 0.6980 | 0.4629 | 0.1016 | 0.5800 | 0.4101 | 0.5373 | 0.5453 | ||||||
0.3724 | 0.3526 * | 0.4051 | 0.3309 | 0.3877 | 0.4660 | 0.3014 | 0.5064 | 0.3970 | 0.1179 | 0.0245 | 0.2687 | 0.5866 | 0.5306 | ||||||
0.4892 | 0.3161 * | 0.5666 | 0.4502 | 0.4281 | 0.5278 | 0.4600 | 0.6676 | 0.4869 | 0.1792 | 0.6583 | 0.6077 | 0.5152 | 0.4713 | ||||||
0.5639 | 0.2946 * | 0.5605 | 0.5502 | 0.6024 | 0.6353 | 0.5098 | 0.7851 | 0.5394 | 0.1970 | 0.6108 | 0.4780 | 0.5594 | 0.4905 | ||||||
0.2981 | 0.3257 * | 0.3475 | 0.2488 | 0.3387 | 0.3742 | 0.2404 | 0.4364 | 0.2724 | 0.0773 | 0.1513 | 0.3129 | 0.4651 | 0.0189 | ||||||
0.4850 | 0.2788 * | 0.5208 | 0.4528 | 0.4823 | 0.5821 | 0.4114 | 0.6773 | 0.5124 | 0.1263 | 0.6236 | 0.4396 | 0.6230 | 0.5865 | ||||||
0.4270 | 0.2927 * | 0.5019 | 0.3745 | 0.4219 | 0.4391 | 0.4178 | 0.6062 | 0.4034 | 0.1328 | 0.5748 | 0.4813 | 0.2842 | 0.2924 | ||||||
0.5017 | 0.2689 * | 0.5902 | 0.4522 | 0.4677 | 0.5629 | 0.4554 | 0.6953 | 0.4968 | 0.1672 | 0.6982 | 0.4863 | 0.7143 | 0.7146 | ||||||
0.5741 | 0.2370 * | 0.5630 | 0.5629 | 0.6023 | 0.6383 | 0.5254 | 0.7956 | 0.5362 | 0.2177 | 0.6218 | 0.4769 | 0.5582 | 0.5146 | ||||||
0.5083 | 0.2455 * | 0.5248 | 0.4778 | 0.5241 | 0.5735 | 0.4590 | 0.7451 | 0.4830 | 0.1150 | 0.5732 | 0.3731 | 0.5460 | 0.5321 | ||||||
0.4651 | 0.2469 * | 0.5727 | 0.4103 | 0.4625 | 0.5060 | 0.4341 | 0.6398 | 0.4791 | 0.1480 | 0.5564 | 0.3341 | 0.6448 | 0.6345 | ||||||
0.3829 | 0.2036 * | 0.5384 | 0.3323 | 0.3285 | 0.3394 | 0.4158 | 0.4567 | 0.4398 | 0.2070 | 0.6941 | 0.5629 | 0.4531 | 0.4669 | ||||||
0.3463 | 0.2088 * | 0.3904 | 0.3223 | 0.3568 | 0.4170 | 0.2927 | 0.4858 | 0.3515 | 0.0980 | 0.4624 | 0.3173 | 0.4421 | 0.4019 | ||||||
0.2456 | 0.2250 * | 0.3456 | 0.2018 | 0.2069 | 0.2468 | 0.2448 | 0.3369 | 0.2552 | 0.0782 | 0.4364 | 0.2264 | 0.4475 | 0.4697 | ||||||
0.7248 | 评测中 | 0.7471 | 0.7066 | 0.5492 | 0.7360 | 0.7163 | 0.8451 | 0.7199 | 0.5184 | 0.8033 | 0.7629 | 0.6245 | 0.6228 | ||||||
评测中 | 0.4040 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 0.7151 | 0.6616 | 0.7412 | 0.7358 | ||||||
评测中 | 0.3815 * | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 0.5784 | 0.4328 | 0.6355 | 0.5754 | ||||||
评测中 | 0.3722 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 0.7058 | 0.6246 | 0.5881 | 0.5715 | ||||||
评测中 | 0.4210 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 评测中 | 0.7207 | 0.7098 | 0.6812 | 0.6447 |
暂无数据
排名 | 模型 | 模型格式 | 最新评测时间 | 类型 | 综合评测胜率 |
---|
1 | OpenAI/o1 | API | 2024-11-07 | 闭源模型 | 0.5864 |
2 | Anthropic/Claude-3.5-Sonnet-20240620 | API | 2024-11-28 | 闭源模型 | 0.5477 |
3 | Anthropic/Claude-3.5-Sonnet-20241022 | API | 2024-12-02 | 闭源模型 | 0.5474 |
4 | 智谱/GLM4-Plus | API | 2024-11-28 | 闭源模型 | 0.5453 |
5 | OpenAI/GPT-4o-20240806 | API | 2024-11-13 | 闭源模型 | 0.5433 |
6 | OpenAI/GPT-4o-20241120 | API | 2024-11-28 | 闭源模型 | 0.5385 |
7 | LongCat-Large-Friday | FT | 2024-12-02 | 自研模型 | 0.5385 |
8 | OpenAI/GPT-4o-20240513 | API | 2024-11-13 | 闭源模型 | 0.5369 |
9 | LongCat-Large(量化前DPO) | HF | 2024-10-29 | 自研模型 | 0.5302 |
10 | 百度/ERNIE 4.0 | API | 2024-11-28 | 闭源模型 | 0.5264 |
11 | Alphabet/Gemini-1.5-pro-002 | API | 2024-11-28 | 闭源模型 | 0.5255 |
12 | Qwen2.5-72B-Instruct | HF | 2024-11-28 | 开源模型 | 0.5217 |
13 | 零一万物/Yi-Lightning | API | 2024-11-28 | 闭源模型 | 0.5193 |
14 | 阶跃星辰/StepChat-2-16k | API | 2024-11-28 | 闭源模型 | 0.5160 |
15 | Qwen2.5-32B-Instruct | HF | 2024-11-28 | 开源模型 | 0.5029 |
16 | DeepSeek-V2.5-Chat | HF | 2024-11-28 | 开源模型 | 0.5010 |
17 | OpenAI/GPT4-turbo-0125 | API | 2024-11-28 | 闭源模型 | 0.5000 |
18 | Anthropic/Claude3-Opus | API | 2024-11-07 | 闭源模型 | 0.4958 |
19 | DeepSeek-V2-Chat | HF | 2024-11-28 | 开源模型 | 0.4951 |
20 | Qwen2-72B-instruct | HF | 2024-11-28 | 开源模型 | 0.4936 |
21 | LongCat-Medium-Friday | FT | 2024-11-29 | 自研模型 | 0.4929 |
22 | Mistral-Large2 | HF | 2024-11-28 | 开源模型 | 0.4900 |
23 | OpenAI/GPT4-turbo-0409 | API | 2024-11-28 | 闭源模型 | 0.4893 |
24 | 零一万物/Yi-Large | API | 2024-11-28 | 闭源模型 | 0.4881 |
25 | 百川/baichuan4 | API | 2024-11-28 | 闭源模型 | 0.4867 |
26 | LongCat-120B-DPO-202410 | HF | 2024-10-29 | 自研模型 | 0.4761 |
27 | 讯飞/spark-4.0-Ultra | API | 2024-11-28 | 闭源模型 | 0.4759 |
28 | 智谱/GLM4-0520 | API | 2024-11-28 | 闭源模型 | 0.4743 |
29 | 字节/doubao-pro-4k | API | 2024-11-07 | 闭源模型 | 0.4739 |
30 | 阿里/Qwen-Max | API | 2024-10-29 | 闭源模型 | 0.4730 |
31 | Qwen2.5-14B-Instruct | HF | 2024-11-28 | 开源模型 | 0.4724 |
32 | LongCat-Prime-8K-Chat-preview | FT | 2024-10-29 | 自研模型 | 0.4702 |
33 | 智谱/ChatGLM4 | API | 2024-11-28 | 闭源模型 | 0.4634 |
34 | 智谱/GLM4-air | API | 2024-11-28 | 闭源模型 | 0.4605 |
35 | 月之暗面/moonshot-v1-8k | API | 2024-11-28 | 闭源模型 | 0.4605 |
36 | OpenAI/GPT4-0613 | API | 2024-11-28 | 闭源模型 | 0.4558 |
37 | LongCat-120B-SFT-202410 | FT | 2024-10-29 | 自研模型 | 0.4556 |
38 | OpenAI/GPT-4o-mini | API | 2024-11-28 | 闭源模型 | 0.4493 |
39 | 腾讯/hunyuan-pro | API | 2024-11-28 | 闭源模型 | 0.4488 |
40 | Minimax/abab6.5 | API | 2024-11-28 | 闭源模型 | 0.4476 |
41 | LLama3.1-405B-instruct | HF | 2024-11-28 | 开源模型 | 0.4408 |
42 | 智谱/GLM4-Long | API | 2024-11-28 | 闭源模型 | 0.4394 |
43 | GLM4-9B-Chat | HF | 2024-11-28 | 开源模型 | 0.4363 |
44 | Qwen2.5-7B-Instruct | HF | 2024-11-28 | 开源模型 | 0.4335 |
45 | LLama3.1-70B-instruct | HF | 2024-11-07 | 开源模型 | 0.4325 |
46 | Anthropic/Claude3-Sonnet | API | 2024-11-28 | 闭源模型 | 0.4249 |
47 | Qwen2-57B-A14B-instruct | HF | 2024-11-28 | 开源模型 | 0.4248 |
48 | Gemma2-9B-instruction | HF | 2024-11-28 | 开源模型 | 0.4231 |
49 | LongCat-8B-128K-Friday-0902 | FT | 2024-11-13 | 自研模型 | 0.4230 |
50 | 阶跃星辰/StepChat-1 | API | 2024-11-07 | 闭源模型 | 0.4148 |
51 | 智谱/GLM4-Flash | API | 2024-11-28 | 闭源模型 | 0.4092 |
52 | Yi1.5-34B-Chat | HF | 2024-11-28 | 开源模型 | 0.4075 |
53 | GLM-9B-CHAT-1M | HF | 2024-11-28 | 开源模型 | 0.4050 |
54 | LLama3-70B-instruct | HF | 2024-11-28 | 开源模型 | 0.3929 |
55 | OpenAI/ChatGPT | API | 2024-11-28 | 闭源模型 | 0.3792 |
56 | Qwen2.5-3B-Instruct | HF | 2024-11-28 | 开源模型 | 0.3599 |
57 | DeepSeek-V2-Lite-Chat | HF | 2024-11-28 | 开源模型 | 0.3598 |
58 | LLama3.1-8B-instruct | HF | 2024-11-28 | 开源模型 | 0.3568 |
59 | Mistral-8x7B_instruct-v0.1 | HF | 2024-11-28 | 开源模型 | 0.3503 |
60 | Longcat-MoE-3B-RL(量化) | FT | 2024-10-29 | 自研模型 | 0.3500 |
61 | Baichuan2-13B-Chat-v2 | HF | 2024-11-28 | 开源模型 | 0.3401 |
62 | LLama3-8B-instruct | HF | 2024-11-28 | 开源模型 | 0.3352 |
63 | LongCat-13B-Friday | HF | 2024-11-13 | 自研模型 | 0.3333 |
64 | Baichuan2-7B-Chat | HF | 2024-11-07 | 开源模型 | 0.3098 |
65 | Qwen2.5-1.5B-Instruct | HF | 2024-11-28 | 开源模型 | 0.3096 |
66 | mistral-7B-instruct-v0.2 | HF | 2024-11-28 | 开源模型 | 0.3092 |
67 | Yi1.5-9B-Chat | HF | 2024-11-28 | 开源模型 | 0.3058 |
68 | LongCat-13B-SFT | HF | 2024-10-29 | 自研模型 | 0.2966 |
69 | LongCat-7B-SFT | HF | 2024-10-29 | 自研模型 | 0.2913 |
70 | Yi1.5-6B-Chat | HF | 2024-11-28 | 开源模型 | 0.2850 |
71 | DBRX-instruct | HF | 2024-11-07 | 开源模型 | 0.2425 |
72 | Qwen2.5-0.5B-Instruct | HF | 2024-11-28 | 开源模型 | 0.2398 |
73 | Qwen2-0.5B-Chat | HF | 2024-11-28 | 开源模型 | 0.2339 |
74 | OpenAI/o1-mini-2024-09-12 | API | 2024-11-22 | 闭源模型 | 评测中 |
75 | LongCat-Lite-Chat(MOE-6B-85B) | FT | 2024-10-29 | 自研模型 | 评测中 |
76 | LongCat-33B-SFT-量化 | FT | 2024-10-29 | 自研模型 | 评测中 |
77 | LongCat-70B-SFT-DPO | HF | 2024-10-29 | 自研模型 | 评测中 |
78 | LongCat-Prime-Friday | FT | 2024-10-29 | 自研模型 | 评测中 |
暂无数据
加载中...