opencompass 0.2.6__tar.gz → 0.3.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2191) hide show
  1. opencompass-0.3.0/MANIFEST.in +2 -0
  2. opencompass-0.3.0/PKG-INFO +613 -0
  3. opencompass-0.3.0/README.md +594 -0
  4. opencompass-0.3.0/opencompass/__init__.py +1 -0
  5. opencompass-0.3.0/opencompass/cli/main.py +391 -0
  6. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_clean_ppl.py +55 -0
  7. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_gen.py +4 -0
  8. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_gen_1e0de5.py +44 -0
  9. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_ppl.py +4 -0
  10. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_ppl_2ef631.py +37 -0
  11. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_ppl_a450bd.py +54 -0
  12. opencompass-0.3.0/opencompass/configs/datasets/ARC_c/ARC_c_ppl_d52a21.py +36 -0
  13. opencompass-0.3.0/opencompass/configs/datasets/ARC_e/ARC_e_gen.py +4 -0
  14. opencompass-0.3.0/opencompass/configs/datasets/ARC_e/ARC_e_gen_1e0de5.py +44 -0
  15. opencompass-0.3.0/opencompass/configs/datasets/ARC_e/ARC_e_ppl.py +4 -0
  16. opencompass-0.3.0/opencompass/configs/datasets/ARC_e/ARC_e_ppl_2ef631.py +37 -0
  17. opencompass-0.3.0/opencompass/configs/datasets/ARC_e/ARC_e_ppl_a450bd.py +54 -0
  18. opencompass-0.3.0/opencompass/configs/datasets/ARC_e/ARC_e_ppl_d52a21.py +34 -0
  19. opencompass-0.3.0/opencompass/configs/datasets/CHARM/README.md +164 -0
  20. opencompass-0.3.0/opencompass/configs/datasets/CHARM/README_ZH.md +162 -0
  21. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_memory_gen_bbbd53.py +63 -0
  22. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_memory_settings.py +31 -0
  23. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_reason_cot_only_gen_f7b7d3.py +50 -0
  24. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_reason_gen.py +4 -0
  25. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_reason_gen_f8fca2.py +49 -0
  26. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_reason_ppl_3da4de.py +57 -0
  27. opencompass-0.3.0/opencompass/configs/datasets/CHARM/charm_reason_settings.py +36 -0
  28. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Anachronisms_Judgment_Direct.txt +22 -0
  29. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Anachronisms_Judgment_EN-CoT.txt +25 -0
  30. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Anachronisms_Judgment_XLT.txt +63 -0
  31. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Anachronisms_Judgment_ZH-CoT.txt +25 -0
  32. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Movie_and_Music_Recommendation_Direct.txt +25 -0
  33. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Movie_and_Music_Recommendation_EN-CoT.txt +40 -0
  34. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Movie_and_Music_Recommendation_XLT.txt +76 -0
  35. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Movie_and_Music_Recommendation_ZH-CoT.txt +40 -0
  36. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Natural_Language_Inference_Direct.txt +25 -0
  37. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Natural_Language_Inference_EN-CoT.txt +28 -0
  38. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Natural_Language_Inference_XLT.txt +67 -0
  39. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Natural_Language_Inference_ZH-CoT.txt +28 -0
  40. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Reading_Comprehension_Direct.txt +23 -0
  41. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Reading_Comprehension_EN-CoT.txt +25 -0
  42. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Reading_Comprehension_XLT.txt +62 -0
  43. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Reading_Comprehension_ZH-CoT.txt +26 -0
  44. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sequence_Understanding_Direct.txt +22 -0
  45. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sequence_Understanding_EN-CoT.txt +25 -0
  46. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sequence_Understanding_XLT.txt +62 -0
  47. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sequence_Understanding_ZH-CoT.txt +25 -0
  48. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sport_Understanding_Direct.txt +19 -0
  49. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sport_Understanding_EN-CoT.txt +22 -0
  50. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sport_Understanding_XLT.txt +56 -0
  51. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Sport_Understanding_ZH-CoT.txt +22 -0
  52. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Time_Understanding_Direct.txt +25 -0
  53. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Time_Understanding_EN-CoT.txt +28 -0
  54. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Time_Understanding_XLT.txt +68 -0
  55. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Chinese_Time_Understanding_ZH-CoT.txt +28 -0
  56. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Anachronisms_Judgment_Direct.txt +22 -0
  57. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Anachronisms_Judgment_EN-CoT.txt +25 -0
  58. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Anachronisms_Judgment_XLT.txt +61 -0
  59. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Anachronisms_Judgment_ZH-CoT.txt +25 -0
  60. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Movie_and_Music_Recommendation_Direct.txt +25 -0
  61. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Movie_and_Music_Recommendation_EN-CoT.txt +40 -0
  62. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Movie_and_Music_Recommendation_XLT.txt +76 -0
  63. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Movie_and_Music_Recommendation_ZH-CoT.txt +40 -0
  64. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Natural_Language_Inference_Direct.txt +25 -0
  65. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Natural_Language_Inference_EN-CoT.txt +28 -0
  66. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Natural_Language_Inference_XLT.txt +69 -0
  67. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Natural_Language_Inference_ZH-CoT.txt +28 -0
  68. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Reading_Comprehension_Direct.txt +22 -0
  69. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Reading_Comprehension_EN-CoT.txt +25 -0
  70. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Reading_Comprehension_XLT.txt +61 -0
  71. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Reading_Comprehension_ZH-CoT.txt +25 -0
  72. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sequence_Understanding_Direct.txt +22 -0
  73. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sequence_Understanding_EN-CoT.txt +25 -0
  74. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sequence_Understanding_XLT.txt +60 -0
  75. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sequence_Understanding_ZH-CoT.txt +25 -0
  76. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sport_Understanding_Direct.txt +19 -0
  77. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sport_Understanding_EN-CoT.txt +22 -0
  78. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sport_Understanding_XLT.txt +57 -0
  79. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Sport_Understanding_ZH-CoT.txt +22 -0
  80. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Time_Understanding_Direct.txt +27 -0
  81. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Time_Understanding_EN-CoT.txt +30 -0
  82. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Time_Understanding_XLT.txt +71 -0
  83. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples/Global_Time_Understanding_ZH-CoT.txt +30 -0
  84. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Anachronisms_Judgment_Translate-EN.txt +25 -0
  85. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Movie_and_Music_Recommendation_Translate-EN.txt +40 -0
  86. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Natural_Language_Inference_Translate-EN.txt +28 -0
  87. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Reading_Comprehension_Translate-EN.txt +26 -0
  88. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Sequence_Understanding_Translate-EN.txt +25 -0
  89. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Sport_Understanding_Translate-EN.txt +22 -0
  90. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Chinese_Time_Understanding_Translate-EN.txt +28 -0
  91. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Anachronisms_Judgment_Translate-EN.txt +25 -0
  92. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Movie_and_Music_Recommendation_Translate-EN.txt +40 -0
  93. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Natural_Language_Inference_Translate-EN.txt +28 -0
  94. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Reading_Comprehension_Translate-EN.txt +25 -0
  95. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Sequence_Understanding_Translate-EN.txt +25 -0
  96. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Sport_Understanding_Translate-EN.txt +22 -0
  97. opencompass-0.3.0/opencompass/configs/datasets/CHARM/few-shot-examples_Translate-EN/Global_Time_Understanding_Translate-EN.txt +30 -0
  98. opencompass-0.3.0/opencompass/configs/datasets/CIBench/CIBench_generation_gen_8ab0dc.py +35 -0
  99. opencompass-0.3.0/opencompass/configs/datasets/CIBench/CIBench_generation_oracle_gen_c4a7c1.py +35 -0
  100. opencompass-0.3.0/opencompass/configs/datasets/CIBench/CIBench_template_gen_e6b12a.py +39 -0
  101. opencompass-0.3.0/opencompass/configs/datasets/CIBench/CIBench_template_oracle_gen_fecda1.py +39 -0
  102. opencompass-0.3.0/opencompass/configs/datasets/CLUE_C3/CLUE_C3_gen.py +4 -0
  103. opencompass-0.3.0/opencompass/configs/datasets/CLUE_C3/CLUE_C3_gen_8c358f.py +51 -0
  104. opencompass-0.3.0/opencompass/configs/datasets/CLUE_C3/CLUE_C3_ppl.py +4 -0
  105. opencompass-0.3.0/opencompass/configs/datasets/CLUE_C3/CLUE_C3_ppl_56b537.py +36 -0
  106. opencompass-0.3.0/opencompass/configs/datasets/CLUE_C3/CLUE_C3_ppl_e24a31.py +37 -0
  107. opencompass-0.3.0/opencompass/configs/datasets/CLUE_CMRC/CLUE_CMRC_gen.py +4 -0
  108. opencompass-0.3.0/opencompass/configs/datasets/CLUE_CMRC/CLUE_CMRC_gen_1bd3c8.py +35 -0
  109. opencompass-0.3.0/opencompass/configs/datasets/CLUE_CMRC/CLUE_CMRC_gen_3749cd.py +33 -0
  110. opencompass-0.3.0/opencompass/configs/datasets/CLUE_CMRC/CLUE_CMRC_gen_8484b9.py +27 -0
  111. opencompass-0.3.0/opencompass/configs/datasets/CLUE_CMRC/CLUE_CMRC_gen_941108.py +34 -0
  112. opencompass-0.3.0/opencompass/configs/datasets/CLUE_DRCD/CLUE_DRCD_gen.py +4 -0
  113. opencompass-0.3.0/opencompass/configs/datasets/CLUE_DRCD/CLUE_DRCD_gen_1bd3c8.py +36 -0
  114. opencompass-0.3.0/opencompass/configs/datasets/CLUE_DRCD/CLUE_DRCD_gen_3749cd.py +33 -0
  115. opencompass-0.3.0/opencompass/configs/datasets/CLUE_DRCD/CLUE_DRCD_gen_8484b9.py +27 -0
  116. opencompass-0.3.0/opencompass/configs/datasets/CLUE_DRCD/CLUE_DRCD_gen_941108.py +34 -0
  117. opencompass-0.3.0/opencompass/configs/datasets/CLUE_afqmc/CLUE_afqmc_gen.py +4 -0
  118. opencompass-0.3.0/opencompass/configs/datasets/CLUE_afqmc/CLUE_afqmc_gen_901306.py +43 -0
  119. opencompass-0.3.0/opencompass/configs/datasets/CLUE_afqmc/CLUE_afqmc_ppl.py +4 -0
  120. opencompass-0.3.0/opencompass/configs/datasets/CLUE_afqmc/CLUE_afqmc_ppl_378c5b.py +44 -0
  121. opencompass-0.3.0/opencompass/configs/datasets/CLUE_afqmc/CLUE_afqmc_ppl_6507d7.py +50 -0
  122. opencompass-0.3.0/opencompass/configs/datasets/CLUE_afqmc/CLUE_afqmc_ppl_7b0c1e.py +34 -0
  123. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_gen.py +4 -0
  124. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_gen_1abf97.py +43 -0
  125. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_gen_51e956.py +43 -0
  126. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_ppl.py +4 -0
  127. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_ppl_98dd6e.py +34 -0
  128. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_ppl_ef69e7.py +50 -0
  129. opencompass-0.3.0/opencompass/configs/datasets/CLUE_cmnli/CLUE_cmnli_ppl_fdc6de.py +54 -0
  130. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_gen.py +4 -0
  131. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_gen_51e956.py +44 -0
  132. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_gen_c4cb6c.py +44 -0
  133. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_ppl.py +4 -0
  134. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_ppl_98dd6e.py +35 -0
  135. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_ppl_ef69e7.py +51 -0
  136. opencompass-0.3.0/opencompass/configs/datasets/CLUE_ocnli/CLUE_ocnli_ppl_fdc6de.py +55 -0
  137. opencompass-0.3.0/opencompass/configs/datasets/ChemBench/ChemBench_gen.py +77 -0
  138. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_bustm/FewCLUE_bustm_gen.py +4 -0
  139. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_bustm/FewCLUE_bustm_gen_634f41.py +53 -0
  140. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_bustm/FewCLUE_bustm_ppl.py +4 -0
  141. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_bustm/FewCLUE_bustm_ppl_4b16c0.py +65 -0
  142. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_bustm/FewCLUE_bustm_ppl_9ef540.py +43 -0
  143. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_bustm/FewCLUE_bustm_ppl_e53034.py +59 -0
  144. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_chid/FewCLUE_chid_gen.py +4 -0
  145. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_chid/FewCLUE_chid_gen_0a29a2.py +51 -0
  146. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_chid/FewCLUE_chid_ppl.py +4 -0
  147. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_chid/FewCLUE_chid_ppl_8f2872.py +45 -0
  148. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_chid/FewCLUE_chid_ppl_acccb5.py +39 -0
  149. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_cluewsc/FewCLUE_cluewsc_gen.py +4 -0
  150. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_cluewsc/FewCLUE_cluewsc_gen_c68933.py +51 -0
  151. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_cluewsc/FewCLUE_cluewsc_ppl.py +4 -0
  152. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_cluewsc/FewCLUE_cluewsc_ppl_12e4e0.py +58 -0
  153. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_cluewsc/FewCLUE_cluewsc_ppl_4284a0.py +44 -0
  154. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_cluewsc/FewCLUE_cluewsc_ppl_868415.py +54 -0
  155. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_csl/FewCLUE_csl_gen.py +4 -0
  156. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_csl/FewCLUE_csl_gen_28b223.py +51 -0
  157. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_csl/FewCLUE_csl_gen_87f4a8.py +51 -0
  158. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_csl/FewCLUE_csl_ppl.py +4 -0
  159. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_csl/FewCLUE_csl_ppl_769f8d.py +45 -0
  160. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_csl/FewCLUE_csl_ppl_841b62.py +41 -0
  161. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_eprstmt/FewCLUE_eprstmt_gen.py +4 -0
  162. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_eprstmt/FewCLUE_eprstmt_gen_740ea0.py +49 -0
  163. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_eprstmt/FewCLUE_eprstmt_ppl.py +4 -0
  164. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_eprstmt/FewCLUE_eprstmt_ppl_1ce587.py +41 -0
  165. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_eprstmt/FewCLUE_eprstmt_ppl_f1e631.py +49 -0
  166. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_ocnli_fc/FewCLUE_ocnli_fc_gen.py +4 -0
  167. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_ocnli_fc/FewCLUE_ocnli_fc_gen_f97a97.py +52 -0
  168. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_ocnli_fc/FewCLUE_ocnli_fc_ppl.py +4 -0
  169. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_ocnli_fc/FewCLUE_ocnli_fc_ppl_9e8b3d.py +60 -0
  170. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_ocnli_fc/FewCLUE_ocnli_fc_ppl_c08300.py +44 -0
  171. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_tnews/FewCLUE_tnews_gen.py +4 -0
  172. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_tnews/FewCLUE_tnews_gen_b90e4a.py +75 -0
  173. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_tnews/FewCLUE_tnews_ppl.py +4 -0
  174. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_tnews/FewCLUE_tnews_ppl_7d1c07.py +43 -0
  175. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_tnews/FewCLUE_tnews_ppl_d10e8a.py +48 -0
  176. opencompass-0.3.0/opencompass/configs/datasets/FewCLUE_tnews/FewCLUE_tnews_ppl_fff486.py +48 -0
  177. opencompass-0.3.0/opencompass/configs/datasets/FinanceIQ/FinanceIQ_gen.py +4 -0
  178. opencompass-0.3.0/opencompass/configs/datasets/FinanceIQ/FinanceIQ_gen_e0e6b5.py +77 -0
  179. opencompass-0.3.0/opencompass/configs/datasets/FinanceIQ/FinanceIQ_ppl.py +4 -0
  180. opencompass-0.3.0/opencompass/configs/datasets/FinanceIQ/FinanceIQ_ppl_42b9bd.py +76 -0
  181. opencompass-0.3.0/opencompass/configs/datasets/GLUE_CoLA/GLUE_CoLA_ppl.py +4 -0
  182. opencompass-0.3.0/opencompass/configs/datasets/GLUE_CoLA/GLUE_CoLA_ppl_77d0df.py +50 -0
  183. opencompass-0.3.0/opencompass/configs/datasets/GLUE_MRPC/GLUE_MRPC_ppl.py +4 -0
  184. opencompass-0.3.0/opencompass/configs/datasets/GLUE_MRPC/GLUE_MRPC_ppl_96564c.py +51 -0
  185. opencompass-0.3.0/opencompass/configs/datasets/GLUE_QQP/GLUE_QQP_ppl.py +4 -0
  186. opencompass-0.3.0/opencompass/configs/datasets/GLUE_QQP/GLUE_QQP_ppl_250d00.py +51 -0
  187. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_gen.py +4 -0
  188. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_gen_5cfe9e.py +303 -0
  189. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_mixed.py +4 -0
  190. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_mixed_9af5ee.py +354 -0
  191. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_no_subjective_gen_4c31db.py +43 -0
  192. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_no_subjective_gen_d21e37.py +42 -0
  193. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/GaokaoBench_prompts.py +191 -0
  194. opencompass-0.3.0/opencompass/configs/datasets/GaokaoBench/README.md +191 -0
  195. opencompass-0.3.0/opencompass/configs/datasets/IFEval/IFEval.md +55 -0
  196. opencompass-0.3.0/opencompass/configs/datasets/IFEval/IFEval_gen.py +4 -0
  197. opencompass-0.3.0/opencompass/configs/datasets/IFEval/IFEval_gen_3321a3.py +33 -0
  198. opencompass-0.3.0/opencompass/configs/datasets/IFEval/README.md +31 -0
  199. opencompass-0.3.0/opencompass/configs/datasets/LCBench/README.md +66 -0
  200. opencompass-0.3.0/opencompass/configs/datasets/LCBench/lcbench_gen.py +4 -0
  201. opencompass-0.3.0/opencompass/configs/datasets/LCBench/lcbench_gen_5ff288.py +107 -0
  202. opencompass-0.3.0/opencompass/configs/datasets/LCBench/lcbench_levels_gen_bb665f.py +77 -0
  203. opencompass-0.3.0/opencompass/configs/datasets/LCBench/lcbench_repeat10_gen.py +4 -0
  204. opencompass-0.3.0/opencompass/configs/datasets/LCBench/lcbench_repeat10_gen_5ff288.py +106 -0
  205. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/MMLUArabic_gen.py +4 -0
  206. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/MMLUArabic_gen_326684.py +59 -0
  207. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/MMLUArabic_ppl.py +4 -0
  208. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/MMLUArabic_ppl_d2333a.py +51 -0
  209. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/MMLUArabic_zero_shot_gen.py +4 -0
  210. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/MMLUArabic_zero_shot_gen_3523e0.py +53 -0
  211. opencompass-0.3.0/opencompass/configs/datasets/MMLUArabic/README.md +26 -0
  212. opencompass-0.3.0/opencompass/configs/datasets/MathBench/deprecated_mathbench_2024_gen_de9ff9.py +108 -0
  213. opencompass-0.3.0/opencompass/configs/datasets/MathBench/deprecated_mathbench_agent_gen_48ec47.py +128 -0
  214. opencompass-0.3.0/opencompass/configs/datasets/MathBench/deprecated_mathbench_agent_gen_fbe13b.py +130 -0
  215. opencompass-0.3.0/opencompass/configs/datasets/MathBench/deprecated_mathbench_arith_gen_ccd638.py +58 -0
  216. opencompass-0.3.0/opencompass/configs/datasets/MathBench/deprecated_mathbench_cot_gen_66f329.py +110 -0
  217. opencompass-0.3.0/opencompass/configs/datasets/MathBench/deprecated_mathbench_gen_7b734b.py +110 -0
  218. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_2024_gen_19e486.py +114 -0
  219. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_2024_gen_1dc21d.py +81 -0
  220. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_2024_gen_fc2a24.py +81 -0
  221. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_2024_wocircular_gen_1dc21d.py +81 -0
  222. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_2024_wocircular_mixed_8eb12b.py +81 -0
  223. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_gen.py +4 -0
  224. opencompass-0.3.0/opencompass/configs/datasets/MathBench/mathbench_prompt.py +103 -0
  225. opencompass-0.3.0/opencompass/configs/datasets/MedBench/medbench_gen.py +4 -0
  226. opencompass-0.3.0/opencompass/configs/datasets/MedBench/medbench_gen_0b4fff.py +119 -0
  227. opencompass-0.3.0/opencompass/configs/datasets/NPHardEval/NPHardEval_gen.py +4 -0
  228. opencompass-0.3.0/opencompass/configs/datasets/NPHardEval/NPHardEval_gen_22aac5.py +59 -0
  229. opencompass-0.3.0/opencompass/configs/datasets/NPHardEval/README.md +126 -0
  230. opencompass-0.3.0/opencompass/configs/datasets/OpenFinData/OpenFinData_gen.py +4 -0
  231. opencompass-0.3.0/opencompass/configs/datasets/OpenFinData/OpenFinData_gen_46dedb.py +99 -0
  232. opencompass-0.3.0/opencompass/configs/datasets/OpenFinData/README.md +64 -0
  233. opencompass-0.3.0/opencompass/configs/datasets/PJExam/PJExam_gen.py +4 -0
  234. opencompass-0.3.0/opencompass/configs/datasets/PJExam/PJExam_gen_8cd97c.py +54 -0
  235. opencompass-0.3.0/opencompass/configs/datasets/QuALITY/QuALITY.md +56 -0
  236. opencompass-0.3.0/opencompass/configs/datasets/QuALITY/QuALITY_gen.py +4 -0
  237. opencompass-0.3.0/opencompass/configs/datasets/QuALITY/QuALITY_gen_c407cb.py +38 -0
  238. opencompass-0.3.0/opencompass/configs/datasets/SVAMP/svamp_gen.py +4 -0
  239. opencompass-0.3.0/opencompass/configs/datasets/SVAMP/svamp_gen_fb25e4.py +36 -0
  240. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_b/SuperGLUE_AX_b_gen.py +4 -0
  241. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_b/SuperGLUE_AX_b_gen_4dfefa.py +43 -0
  242. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_b/SuperGLUE_AX_b_ppl.py +4 -0
  243. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_b/SuperGLUE_AX_b_ppl_0748aa.py +34 -0
  244. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_b/SuperGLUE_AX_b_ppl_6db806.py +53 -0
  245. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_g/SuperGLUE_AX_g_gen.py +4 -0
  246. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_g/SuperGLUE_AX_g_gen_68aac7.py +43 -0
  247. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_g/SuperGLUE_AX_g_ppl.py +4 -0
  248. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_g/SuperGLUE_AX_g_ppl_50f8f6.py +34 -0
  249. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_AX_g/SuperGLUE_AX_g_ppl_66caf3.py +53 -0
  250. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_gen.py +4 -0
  251. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_gen_883d50.py +41 -0
  252. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_ppl.py +4 -0
  253. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_ppl_314797.py +43 -0
  254. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_ppl_314b96.py +45 -0
  255. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_ppl_4da4db.py +45 -0
  256. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_BoolQ/SuperGLUE_BoolQ_ppl_9619db.py +34 -0
  257. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_CB/SuperGLUE_CB_gen.py +4 -0
  258. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_CB/SuperGLUE_CB_gen_854c6c.py +44 -0
  259. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_CB/SuperGLUE_CB_ppl.py +4 -0
  260. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_CB/SuperGLUE_CB_ppl_0143fe.py +62 -0
  261. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_CB/SuperGLUE_CB_ppl_11c175.py +33 -0
  262. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_COPA/SuperGLUE_COPA_gen.py +4 -0
  263. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_COPA/SuperGLUE_COPA_gen_91ca53.py +44 -0
  264. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_COPA/SuperGLUE_COPA_ppl.py +4 -0
  265. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_COPA/SuperGLUE_COPA_ppl_54058d.py +34 -0
  266. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_COPA/SuperGLUE_COPA_ppl_5c24f1.py +45 -0
  267. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_COPA/SuperGLUE_COPA_ppl_9f3618.py +49 -0
  268. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_MultiRC/SuperGLUE_MultiRC_gen.py +4 -0
  269. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_MultiRC/SuperGLUE_MultiRC_gen_27071f.py +43 -0
  270. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_MultiRC/SuperGLUE_MultiRC_ppl.py +4 -0
  271. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_MultiRC/SuperGLUE_MultiRC_ppl_866273.py +30 -0
  272. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_MultiRC/SuperGLUE_MultiRC_ppl_ced824.py +47 -0
  273. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_RTE/SuperGLUE_RTE_gen.py +4 -0
  274. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_RTE/SuperGLUE_RTE_gen_68aac7.py +43 -0
  275. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_RTE/SuperGLUE_RTE_ppl.py +4 -0
  276. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_RTE/SuperGLUE_RTE_ppl_50f8f6.py +34 -0
  277. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_RTE/SuperGLUE_RTE_ppl_66caf3.py +53 -0
  278. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_ReCoRD/SuperGLUE_ReCoRD_gen.py +4 -0
  279. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_ReCoRD/SuperGLUE_ReCoRD_gen_0f7784.py +29 -0
  280. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_ReCoRD/SuperGLUE_ReCoRD_gen_30dea0.py +42 -0
  281. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_ReCoRD/SuperGLUE_ReCoRD_gen_a69961.py +35 -0
  282. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_gen.py +4 -0
  283. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_gen_7902a7.py +43 -0
  284. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_gen_fe4bf3.py +43 -0
  285. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_ppl.py +4 -0
  286. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_ppl_003529.py +41 -0
  287. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_ppl_1c4a90.py +49 -0
  288. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_ppl_d0f531.py +51 -0
  289. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WSC/SuperGLUE_WSC_ppl_f37e78.py +34 -0
  290. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_gen.py +4 -0
  291. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_gen_d06864.py +47 -0
  292. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_ppl.py +4 -0
  293. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_ppl_312de9.py +55 -0
  294. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_ppl_3fb6fd.py +38 -0
  295. opencompass-0.3.0/opencompass/configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_ppl_c926be.py +49 -0
  296. opencompass-0.3.0/opencompass/configs/datasets/TabMWP/TabMWP_gen.py +4 -0
  297. opencompass-0.3.0/opencompass/configs/datasets/TabMWP/TabMWP_gen_2aef96.py +52 -0
  298. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/README.md +69 -0
  299. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/TheoremQA_5shot_gen_6f0af8.py +45 -0
  300. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/TheoremQA_few_shot_examples.py +22 -0
  301. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/TheoremQA_few_shot_examples_official.py +22 -0
  302. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/TheoremQA_gen.py +4 -0
  303. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/deprecated_TheoremQA_gen_424e0a.py +39 -0
  304. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/deprecated_TheoremQA_gen_7009de.py +44 -0
  305. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/deprecated_TheoremQA_gen_ef26ca.py +44 -0
  306. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/deprecated_TheoremQA_post_v2_gen_2c2583.py +38 -0
  307. opencompass-0.3.0/opencompass/configs/datasets/TheoremQA/deprecated_TheoremQA_post_v2_gen_ef26ca.py +45 -0
  308. opencompass-0.3.0/opencompass/configs/datasets/XCOPA/XCOPA_ppl.py +4 -0
  309. opencompass-0.3.0/opencompass/configs/datasets/XCOPA/XCOPA_ppl_54058d.py +31 -0
  310. opencompass-0.3.0/opencompass/configs/datasets/XLSum/XLSum_gen.py +4 -0
  311. opencompass-0.3.0/opencompass/configs/datasets/XLSum/XLSum_gen_2bb71c.py +29 -0
  312. opencompass-0.3.0/opencompass/configs/datasets/Xsum/Xsum_gen.py +4 -0
  313. opencompass-0.3.0/opencompass/configs/datasets/Xsum/Xsum_gen_31397e.py +39 -0
  314. opencompass-0.3.0/opencompass/configs/datasets/Xsum/Xsum_gen_8ea5f8.py +30 -0
  315. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/__init__.py +11 -0
  316. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_mnli/adv_glue_mnli_gen.py +4 -0
  317. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_mnli/adv_glue_mnli_gen_bd8ef0.py +42 -0
  318. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_mnli_mm/adv_glue_mnli_mm_gen.py +4 -0
  319. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_mnli_mm/adv_glue_mnli_mm_gen_bd8ef0.py +42 -0
  320. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_qnli/adv_glue_qnli_gen.py +4 -0
  321. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_qnli/adv_glue_qnli_gen_0b7326.py +42 -0
  322. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_qqp/adv_glue_qqp_gen.py +4 -0
  323. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_qqp/adv_glue_qqp_gen_cdc277.py +42 -0
  324. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_rte/adv_glue_rte_gen.py +4 -0
  325. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_rte/adv_glue_rte_gen_8cc547.py +42 -0
  326. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_sst2/adv_glue_sst2_gen.py +4 -0
  327. opencompass-0.3.0/opencompass/configs/datasets/adv_glue/adv_glue_sst2/adv_glue_sst2_gen_ee8d3b.py +41 -0
  328. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_gen.py +4 -0
  329. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_gen_397d81.py +204 -0
  330. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_gen_617738.py +209 -0
  331. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_gen_64afd3.py +207 -0
  332. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_gen_a0c741.py +85 -0
  333. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_mixed.py +4 -0
  334. opencompass-0.3.0/opencompass/configs/datasets/agieval/agieval_mixed_0fa998.py +220 -0
  335. opencompass-0.3.0/opencompass/configs/datasets/anli/anli_gen.py +4 -0
  336. opencompass-0.3.0/opencompass/configs/datasets/anli/anli_gen_fc7328.py +42 -0
  337. opencompass-0.3.0/opencompass/configs/datasets/anli/anli_ppl.py +4 -0
  338. opencompass-0.3.0/opencompass/configs/datasets/anli/anli_ppl_1d290e.py +50 -0
  339. opencompass-0.3.0/opencompass/configs/datasets/anthropics_evals/airisk_gen.py +4 -0
  340. opencompass-0.3.0/opencompass/configs/datasets/anthropics_evals/airisk_gen_ba66fc.py +66 -0
  341. opencompass-0.3.0/opencompass/configs/datasets/anthropics_evals/persona_gen.py +4 -0
  342. opencompass-0.3.0/opencompass/configs/datasets/anthropics_evals/persona_gen_cc72e2.py +184 -0
  343. opencompass-0.3.0/opencompass/configs/datasets/anthropics_evals/sycophancy_gen.py +4 -0
  344. opencompass-0.3.0/opencompass/configs/datasets/anthropics_evals/sycophancy_gen_4bba45.py +50 -0
  345. opencompass-0.3.0/opencompass/configs/datasets/apps/README.md +43 -0
  346. opencompass-0.3.0/opencompass/configs/datasets/apps/apps_gen.py +4 -0
  347. opencompass-0.3.0/opencompass/configs/datasets/apps/apps_gen_c7893a.py +28 -0
  348. opencompass-0.3.0/opencompass/configs/datasets/apps/apps_mini_gen.py +4 -0
  349. opencompass-0.3.0/opencompass/configs/datasets/apps/apps_mini_gen_c7893a.py +28 -0
  350. opencompass-0.3.0/opencompass/configs/datasets/apps/deprecated_apps_gen_5b4254.py +33 -0
  351. opencompass-0.3.0/opencompass/configs/datasets/apps/deprecated_apps_gen_7fbb95.py +40 -0
  352. opencompass-0.3.0/opencompass/configs/datasets/apps/deprecated_apps_gen_b4dee3.py +30 -0
  353. opencompass-0.3.0/opencompass/configs/datasets/bbh/README.md +250 -0
  354. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_gen.py +4 -0
  355. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_gen_2879b0.py +56 -0
  356. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_gen_4a31fa.py +99 -0
  357. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_gen_5b92b0.py +99 -0
  358. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_gen_5bf00b.py +99 -0
  359. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_gen_98fba6.py +90 -0
  360. opencompass-0.3.0/opencompass/configs/datasets/bbh/bbh_subset_settings.py +29 -0
  361. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/boolean_expressions.txt +23 -0
  362. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/causal_judgement.txt +25 -0
  363. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/date_understanding.txt +33 -0
  364. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/disambiguation_qa.txt +37 -0
  365. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/dyck_languages.txt +72 -0
  366. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/formal_fallacies.txt +44 -0
  367. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/geometric_shapes.txt +78 -0
  368. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/hyperbaton.txt +28 -0
  369. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/logical_deduction_five_objects.txt +37 -0
  370. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/logical_deduction_seven_objects.txt +37 -0
  371. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/logical_deduction_three_objects.txt +37 -0
  372. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/movie_recommendation.txt +42 -0
  373. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/multistep_arithmetic_two.txt +25 -0
  374. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/navigate.txt +43 -0
  375. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/object_counting.txt +37 -0
  376. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/penguins_in_a_table.txt +41 -0
  377. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/reasoning_about_colored_objects.txt +63 -0
  378. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/ruin_names.txt +44 -0
  379. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/salient_translation_error_detection.txt +40 -0
  380. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/snarks.txt +30 -0
  381. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/sports_understanding.txt +10 -0
  382. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/temporal_sequences.txt +77 -0
  383. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/tracking_shuffled_objects_five_objects.txt +40 -0
  384. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/tracking_shuffled_objects_seven_objects.txt +40 -0
  385. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/tracking_shuffled_objects_three_objects.txt +40 -0
  386. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/web_of_lies.txt +28 -0
  387. opencompass-0.3.0/opencompass/configs/datasets/bbh/lib_prompt/word_sorting.txt +17 -0
  388. opencompass-0.3.0/opencompass/configs/datasets/calm/README.md +117 -0
  389. opencompass-0.3.0/opencompass/configs/datasets/calm/calm.py +160 -0
  390. opencompass-0.3.0/opencompass/configs/datasets/ceval/README.md +372 -0
  391. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_clean_ppl.py +108 -0
  392. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_gen.py +4 -0
  393. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_gen_2daf24.py +107 -0
  394. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_gen_5f30c7.py +108 -0
  395. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_internal_ppl_1cd8bf.py +103 -0
  396. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_internal_ppl_93e5ce.py +108 -0
  397. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_ppl.py +4 -0
  398. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_ppl_1cd8bf.py +103 -0
  399. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_ppl_578f8d.py +108 -0
  400. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_ppl_93e5ce.py +108 -0
  401. opencompass-0.3.0/opencompass/configs/datasets/ceval/ceval_zero_shot_gen_bd40ef.py +106 -0
  402. opencompass-0.3.0/opencompass/configs/datasets/civilcomments/civilcomments_clp.py +4 -0
  403. opencompass-0.3.0/opencompass/configs/datasets/civilcomments/civilcomments_clp_6a2561.py +31 -0
  404. opencompass-0.3.0/opencompass/configs/datasets/civilcomments/civilcomments_clp_a3c5fd.py +35 -0
  405. opencompass-0.3.0/opencompass/configs/datasets/clozeTest_maxmin/clozeTest_maxmin_gen.py +4 -0
  406. opencompass-0.3.0/opencompass/configs/datasets/clozeTest_maxmin/clozeTest_maxmin_gen_c205fb.py +42 -0
  407. opencompass-0.3.0/opencompass/configs/datasets/cmb/cmb_gen.py +4 -0
  408. opencompass-0.3.0/opencompass/configs/datasets/cmb/cmb_gen_dfb5c4.py +49 -0
  409. opencompass-0.3.0/opencompass/configs/datasets/cmmlu/cmmlu_0shot_cot_gen_305931.py +130 -0
  410. opencompass-0.3.0/opencompass/configs/datasets/cmmlu/cmmlu_gen.py +4 -0
  411. opencompass-0.3.0/opencompass/configs/datasets/cmmlu/cmmlu_gen_c13365.py +123 -0
  412. opencompass-0.3.0/opencompass/configs/datasets/cmmlu/cmmlu_ppl.py +4 -0
  413. opencompass-0.3.0/opencompass/configs/datasets/cmmlu/cmmlu_ppl_041cbf.py +117 -0
  414. opencompass-0.3.0/opencompass/configs/datasets/cmmlu/cmmlu_ppl_8b9c76.py +122 -0
  415. opencompass-0.3.0/opencompass/configs/datasets/collections/base_core.py +20 -0
  416. opencompass-0.3.0/opencompass/configs/datasets/collections/base_medium.py +56 -0
  417. opencompass-0.3.0/opencompass/configs/datasets/collections/base_medium_llama.py +56 -0
  418. opencompass-0.3.0/opencompass/configs/datasets/collections/base_small.py +38 -0
  419. opencompass-0.3.0/opencompass/configs/datasets/collections/chat_core.py +20 -0
  420. opencompass-0.3.0/opencompass/configs/datasets/collections/chat_medium.py +56 -0
  421. opencompass-0.3.0/opencompass/configs/datasets/collections/chat_small.py +39 -0
  422. opencompass-0.3.0/opencompass/configs/datasets/collections/example.py +7 -0
  423. opencompass-0.3.0/opencompass/configs/datasets/collections/leaderboard/qwen.py +51 -0
  424. opencompass-0.3.0/opencompass/configs/datasets/collections/leaderboard/qwen_chat.py +51 -0
  425. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_gen.py +4 -0
  426. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_gen_1da2d0.py +55 -0
  427. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_gen_c946f2.py +62 -0
  428. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_ppl.py +4 -0
  429. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_ppl_3e9f2d.py +56 -0
  430. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_ppl_5545e2.py +49 -0
  431. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_ppl_716f78.py +45 -0
  432. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_ppl_c49e77.py +41 -0
  433. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa/commonsenseqa_ppl_e51e32.py +42 -0
  434. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa_cn/commonsenseqacn_gen.py +4 -0
  435. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa_cn/commonsenseqacn_gen_d380d0.py +50 -0
  436. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa_cn/commonsenseqacn_ppl.py +4 -0
  437. opencompass-0.3.0/opencompass/configs/datasets/commonsenseqa_cn/commonsenseqacn_ppl_971f48.py +52 -0
  438. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/agent/cibench_template_gen_e6b12a.py +57 -0
  439. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/agent/mus_teval_gen_105c48.py +56 -0
  440. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/code/compassbench_v1_1_code_gen_986f01.py +291 -0
  441. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/knowledge/compassbench_v1_knowledge_gen_bd74e0.py +133 -0
  442. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/language/compassbench_v1_language_gen_7aa06d.py +46 -0
  443. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/math/compassbench_v1_1_math_gen_1dc21d.py +81 -0
  444. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/math/mathbench_prompt.py +103 -0
  445. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1/reason/compassbench_v1_reason_gen_d26d08.py +28 -0
  446. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/agent/cibench_template_gen_e6b12a.py +57 -0
  447. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/agent/mus_teval_gen_105c48.py +56 -0
  448. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/code/compassbench_v1_1_code_gen_986f01.py +291 -0
  449. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/knowledge/compassbench_v1_knowledge_gen_bd74e0.py +133 -0
  450. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/language/compassbench_v1_language_gen_7aa06d.py +46 -0
  451. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/math/compassbench_v1_1_math_gen_1dc21d.py +81 -0
  452. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/math/mathbench_prompt.py +103 -0
  453. opencompass-0.3.0/opencompass/configs/datasets/compassbench_20_v1_1_public/reason/compassbench_v1_reason_gen_d26d08.py +28 -0
  454. opencompass-0.3.0/opencompass/configs/datasets/compassbench_v1_3/compassbench_v1_3_objective_gen.py +4 -0
  455. opencompass-0.3.0/opencompass/configs/datasets/compassbench_v1_3/compassbench_v1_3_objective_gen_068af0.py +74 -0
  456. opencompass-0.3.0/opencompass/configs/datasets/contamination/ceval_contamination_ppl_810ec6.py +55 -0
  457. opencompass-0.3.0/opencompass/configs/datasets/contamination/mbpp_contamination_ppl_f01cb6.py +57 -0
  458. opencompass-0.3.0/opencompass/configs/datasets/contamination/mmlu_contamination_ppl_810ec6.py +55 -0
  459. opencompass-0.3.0/opencompass/configs/datasets/crowspairs/crowspairs_gen.py +4 -0
  460. opencompass-0.3.0/opencompass/configs/datasets/crowspairs/crowspairs_gen_02b6c1.py +40 -0
  461. opencompass-0.3.0/opencompass/configs/datasets/crowspairs/crowspairs_gen_381af0.py +49 -0
  462. opencompass-0.3.0/opencompass/configs/datasets/crowspairs/crowspairs_ppl.py +4 -0
  463. opencompass-0.3.0/opencompass/configs/datasets/crowspairs/crowspairs_ppl_47f211.py +32 -0
  464. opencompass-0.3.0/opencompass/configs/datasets/crowspairs/crowspairs_ppl_e811e1.py +40 -0
  465. opencompass-0.3.0/opencompass/configs/datasets/crowspairs_cn/crowspairscn_gen.py +4 -0
  466. opencompass-0.3.0/opencompass/configs/datasets/crowspairs_cn/crowspairscn_gen_556dc9.py +64 -0
  467. opencompass-0.3.0/opencompass/configs/datasets/crowspairs_cn/crowspairscn_ppl.py +4 -0
  468. opencompass-0.3.0/opencompass/configs/datasets/crowspairs_cn/crowspairscn_ppl_f53575.py +39 -0
  469. opencompass-0.3.0/opencompass/configs/datasets/cvalues/cvalues_responsibility_gen.py +4 -0
  470. opencompass-0.3.0/opencompass/configs/datasets/cvalues/cvalues_responsibility_gen_543378.py +37 -0
  471. opencompass-0.3.0/opencompass/configs/datasets/demo/demo_cmmlu_base_ppl.py +8 -0
  472. opencompass-0.3.0/opencompass/configs/datasets/demo/demo_cmmlu_chat_gen.py +8 -0
  473. opencompass-0.3.0/opencompass/configs/datasets/demo/demo_gsm8k_base_gen.py +7 -0
  474. opencompass-0.3.0/opencompass/configs/datasets/demo/demo_gsm8k_chat_gen.py +7 -0
  475. opencompass-0.3.0/opencompass/configs/datasets/demo/demo_math_base_gen.py +7 -0
  476. opencompass-0.3.0/opencompass/configs/datasets/demo/demo_math_chat_gen.py +7 -0
  477. opencompass-0.3.0/opencompass/configs/datasets/drop/deprecated_drop_gen_8a9ed9.py +44 -0
  478. opencompass-0.3.0/opencompass/configs/datasets/drop/drop_examples.py +16 -0
  479. opencompass-0.3.0/opencompass/configs/datasets/drop/drop_gen.py +4 -0
  480. opencompass-0.3.0/opencompass/configs/datasets/drop/drop_gen_a2697c.py +43 -0
  481. opencompass-0.3.0/opencompass/configs/datasets/drop/drop_gen_eb14af.py +34 -0
  482. opencompass-0.3.0/opencompass/configs/datasets/drop/drop_openai_simple_evals_gen_3857b0.py +34 -0
  483. opencompass-0.3.0/opencompass/configs/datasets/ds1000/ds1000_compl_gen_cbc84f.py +69 -0
  484. opencompass-0.3.0/opencompass/configs/datasets/ds1000/ds1000_compl_service_eval_gen_cbc84f.py +68 -0
  485. opencompass-0.3.0/opencompass/configs/datasets/ds1000/ds1000_gen_5c4bec.py +84 -0
  486. opencompass-0.3.0/opencompass/configs/datasets/ds1000/ds1000_gen_cbc84f.py +67 -0
  487. opencompass-0.3.0/opencompass/configs/datasets/ds1000/ds1000_service_eval_gen_cbc84f.py +67 -0
  488. opencompass-0.3.0/opencompass/configs/datasets/flames/README.md +86 -0
  489. opencompass-0.3.0/opencompass/configs/datasets/flames/flames_gen.py +4 -0
  490. opencompass-0.3.0/opencompass/configs/datasets/flames/flames_gen_1a58bb.py +62 -0
  491. opencompass-0.3.0/opencompass/configs/datasets/flores/flores_gen.py +4 -0
  492. opencompass-0.3.0/opencompass/configs/datasets/flores/flores_gen_806ede.py +162 -0
  493. opencompass-0.3.0/opencompass/configs/datasets/flores/flores_gen_aad4fd.py +155 -0
  494. opencompass-0.3.0/opencompass/configs/datasets/game24/game24_gen.py +4 -0
  495. opencompass-0.3.0/opencompass/configs/datasets/game24/game24_gen_52a460.py +34 -0
  496. opencompass-0.3.0/opencompass/configs/datasets/govrepcrs/govrepcrs_gen.py +4 -0
  497. opencompass-0.3.0/opencompass/configs/datasets/govrepcrs/govrepcrs_gen_aa5eb3.py +36 -0
  498. opencompass-0.3.0/opencompass/configs/datasets/govrepcrs/govrepcrs_gen_db7930.py +48 -0
  499. opencompass-0.3.0/opencompass/configs/datasets/gpqa/README.md +69 -0
  500. opencompass-0.3.0/opencompass/configs/datasets/gpqa/gpqa_gen.py +4 -0
  501. opencompass-0.3.0/opencompass/configs/datasets/gpqa/gpqa_gen_015262.py +46 -0
  502. opencompass-0.3.0/opencompass/configs/datasets/gpqa/gpqa_gen_4baadb.py +46 -0
  503. opencompass-0.3.0/opencompass/configs/datasets/gpqa/gpqa_openai_simple_evals_gen_5aeece.py +52 -0
  504. opencompass-0.3.0/opencompass/configs/datasets/gpqa/gpqa_ppl_6bf57a.py +40 -0
  505. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/README.md +69 -0
  506. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/deprecated_gsm8k_agent_gen_be1606.py +55 -0
  507. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_0shot_gen_a58960.py +36 -0
  508. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_0shot_v2_gen_a58960.py +37 -0
  509. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_agent_gen_c3dff3.py +55 -0
  510. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen.py +4 -0
  511. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_17d0dc.py +38 -0
  512. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_1d7fe4.py +40 -0
  513. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_1dce88.py +85 -0
  514. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_3309bd.py +38 -0
  515. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_57b0b1.py +83 -0
  516. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_701491.py +32 -0
  517. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_a3e34a.py +88 -0
  518. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_d6de81.py +36 -0
  519. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_e9e91e.py +53 -0
  520. opencompass-0.3.0/opencompass/configs/datasets/gsm8k/gsm8k_gen_ee684f.py +87 -0
  521. opencompass-0.3.0/opencompass/configs/datasets/gsm8k_contamination/gsm8k_contamination_ppl_ecdd22.py +57 -0
  522. opencompass-0.3.0/opencompass/configs/datasets/gsm_hard/gsmhard_gen.py +4 -0
  523. opencompass-0.3.0/opencompass/configs/datasets/gsm_hard/gsmhard_gen_8a1400.py +36 -0
  524. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/README.md +69 -0
  525. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_10shot_gen_e42710.py +58 -0
  526. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_10shot_ppl_59c85e.py +45 -0
  527. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_clean_ppl.py +35 -0
  528. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_gen.py +4 -0
  529. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_gen_6faab5.py +44 -0
  530. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_ppl.py +4 -0
  531. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_ppl_47bff9.py +34 -0
  532. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_ppl_7d7f2d.py +33 -0
  533. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_ppl_9dbb12.py +34 -0
  534. opencompass-0.3.0/opencompass/configs/datasets/hellaswag/hellaswag_ppl_a6e128.py +41 -0
  535. opencompass-0.3.0/opencompass/configs/datasets/humaneval/README.md +69 -0
  536. opencompass-0.3.0/opencompass/configs/datasets/humaneval/deprecated_humaneval_gen_4a6eef.py +36 -0
  537. opencompass-0.3.0/opencompass/configs/datasets/humaneval/deprecated_humaneval_gen_6d1cc2.py +36 -0
  538. opencompass-0.3.0/opencompass/configs/datasets/humaneval/deprecated_humaneval_gen_a82cae.py +36 -0
  539. opencompass-0.3.0/opencompass/configs/datasets/humaneval/deprecated_humaneval_gen_d2537e.py +33 -0
  540. opencompass-0.3.0/opencompass/configs/datasets/humaneval/deprecated_humaneval_gen_fd5822.py +31 -0
  541. opencompass-0.3.0/opencompass/configs/datasets/humaneval/deprecated_humaneval_gen_ff7054.py +41 -0
  542. opencompass-0.3.0/opencompass/configs/datasets/humaneval/humaneval_gen.py +4 -0
  543. opencompass-0.3.0/opencompass/configs/datasets/humaneval/humaneval_gen_66a7f4.py +35 -0
  544. opencompass-0.3.0/opencompass/configs/datasets/humaneval/humaneval_gen_8e312c.py +37 -0
  545. opencompass-0.3.0/opencompass/configs/datasets/humaneval/humaneval_openai_sample_evals_gen_159614.py +36 -0
  546. opencompass-0.3.0/opencompass/configs/datasets/humaneval/humaneval_passk_gen_8e312c.py +36 -0
  547. opencompass-0.3.0/opencompass/configs/datasets/humaneval/humaneval_repeat10_gen_8e312c.py +37 -0
  548. opencompass-0.3.0/opencompass/configs/datasets/humaneval_cn/humaneval_cn_gen.py +4 -0
  549. opencompass-0.3.0/opencompass/configs/datasets/humaneval_cn/humaneval_cn_gen_6313aa.py +37 -0
  550. opencompass-0.3.0/opencompass/configs/datasets/humaneval_cn/humaneval_cn_passk_gen_6313aa.py +37 -0
  551. opencompass-0.3.0/opencompass/configs/datasets/humaneval_cn/humaneval_cn_repeat10_gen_6313aa.py +38 -0
  552. opencompass-0.3.0/opencompass/configs/datasets/humaneval_multi/humaneval_multi_gen.py +4 -0
  553. opencompass-0.3.0/opencompass/configs/datasets/humaneval_multi/humaneval_multi_gen_82cf85.py +46 -0
  554. opencompass-0.3.0/opencompass/configs/datasets/humaneval_plus/humaneval_plus_gen.py +4 -0
  555. opencompass-0.3.0/opencompass/configs/datasets/humaneval_plus/humaneval_plus_gen_66a7f4.py +35 -0
  556. opencompass-0.3.0/opencompass/configs/datasets/humaneval_plus/humaneval_plus_gen_8e312c.py +37 -0
  557. opencompass-0.3.0/opencompass/configs/datasets/humaneval_plus/humaneval_plus_passk_gen_8e312c.py +36 -0
  558. opencompass-0.3.0/opencompass/configs/datasets/humaneval_plus/humaneval_plus_repeat10_gen_8e312c.py +37 -0
  559. opencompass-0.3.0/opencompass/configs/datasets/humanevalx/humanevalx_gen.py +4 -0
  560. opencompass-0.3.0/opencompass/configs/datasets/humanevalx/humanevalx_gen_0af626.py +60 -0
  561. opencompass-0.3.0/opencompass/configs/datasets/humanevalx/humanevalx_gen_620cfa.py +41 -0
  562. opencompass-0.3.0/opencompass/configs/datasets/hungarian_exam/hungarian_exam_gen.py +4 -0
  563. opencompass-0.3.0/opencompass/configs/datasets/hungarian_exam/hungarian_exam_gen_8a1435.py +91 -0
  564. opencompass-0.3.0/opencompass/configs/datasets/inference_ppl/README.md +26 -0
  565. opencompass-0.3.0/opencompass/configs/datasets/inference_ppl/inference_ppl.py +38 -0
  566. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebench.py +17 -0
  567. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchcodedebug/infinitebench_codedebug_gen.py +4 -0
  568. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchcodedebug/infinitebench_codedebug_gen_276a42.py +43 -0
  569. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchcoderun/infinitebench_coderun_gen.py +4 -0
  570. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchcoderun/infinitebench_coderun_gen_1a76bd.py +43 -0
  571. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchendia/infinitebench_endia_gen.py +4 -0
  572. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchendia/infinitebench_endia_gen_c96eb5.py +40 -0
  573. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchenmc/infinitebench_enmc_gen.py +4 -0
  574. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchenmc/infinitebench_enmc_gen_3a4102.py +43 -0
  575. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchenqa/infinitebench_enqa_gen.py +4 -0
  576. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchenqa/infinitebench_enqa_gen_a1640c.py +40 -0
  577. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchensum/infinitebench_ensum_gen.py +4 -0
  578. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchensum/infinitebench_ensum_gen_cfbc08.py +41 -0
  579. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchmathcalc/infinitebench_mathcalc_gen.py +4 -0
  580. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchmathcalc/infinitebench_mathcalc_gen_78d17e.py +40 -0
  581. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchmathfind/infinitebench_mathfind_gen.py +4 -0
  582. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchmathfind/infinitebench_mathfind_gen_6d799e.py +43 -0
  583. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchretrievekv/infinitebench_retrievekv_gen.py +4 -0
  584. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchretrievekv/infinitebench_retrievekv_gen_06b3ac.py +40 -0
  585. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchretrievenumber/infinitebench_retrievenumber_gen.py +4 -0
  586. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchretrievenumber/infinitebench_retrievenumber_gen_047436.py +43 -0
  587. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchretrievepasskey/infinitebench_retrievepasskey_gen.py +4 -0
  588. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchretrievepasskey/infinitebench_retrievepasskey_gen_62ff68.py +43 -0
  589. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchzhqa/infinitebench_zhqa_gen.py +4 -0
  590. opencompass-0.3.0/opencompass/configs/datasets/infinitebench/infinitebenchzhqa/infinitebench_zhqa_gen_1e5293.py +41 -0
  591. opencompass-0.3.0/opencompass/configs/datasets/iwslt2017/iwslt2017_gen.py +4 -0
  592. opencompass-0.3.0/opencompass/configs/datasets/iwslt2017/iwslt2017_gen_69ce16.py +32 -0
  593. opencompass-0.3.0/opencompass/configs/datasets/iwslt2017/iwslt2017_gen_b4a814.py +41 -0
  594. opencompass-0.3.0/opencompass/configs/datasets/iwslt2017/iwslt2017_gen_d0ebd1.py +39 -0
  595. opencompass-0.3.0/opencompass/configs/datasets/jigsawmultilingual/jigsawmultilingual_clp.py +4 -0
  596. opencompass-0.3.0/opencompass/configs/datasets/jigsawmultilingual/jigsawmultilingual_clp_1af0ae.py +43 -0
  597. opencompass-0.3.0/opencompass/configs/datasets/jigsawmultilingual/jigsawmultilingual_clp_fe50d8.py +47 -0
  598. opencompass-0.3.0/opencompass/configs/datasets/kaoshi/kaoshi_gen.py +4 -0
  599. opencompass-0.3.0/opencompass/configs/datasets/kaoshi/kaoshi_gen_86aca2.py +76 -0
  600. opencompass-0.3.0/opencompass/configs/datasets/lambada/lambada_gen.py +4 -0
  601. opencompass-0.3.0/opencompass/configs/datasets/lambada/lambada_gen_217e11.py +33 -0
  602. opencompass-0.3.0/opencompass/configs/datasets/lambada/lambada_gen_8b48a5.py +29 -0
  603. opencompass-0.3.0/opencompass/configs/datasets/lawbench/lawbench_one_shot_gen_002588.py +62 -0
  604. opencompass-0.3.0/opencompass/configs/datasets/lawbench/lawbench_zero_shot_gen_002588.py +62 -0
  605. opencompass-0.3.0/opencompass/configs/datasets/lcsts/lcsts_gen.py +4 -0
  606. opencompass-0.3.0/opencompass/configs/datasets/lcsts/lcsts_gen_8ee1fe.py +32 -0
  607. opencompass-0.3.0/opencompass/configs/datasets/lcsts/lcsts_gen_9b0b89.py +28 -0
  608. opencompass-0.3.0/opencompass/configs/datasets/leval/leval.py +23 -0
  609. opencompass-0.3.0/opencompass/configs/datasets/leval/levalcoursera/leval_coursera_gen.py +4 -0
  610. opencompass-0.3.0/opencompass/configs/datasets/leval/levalcoursera/leval_coursera_gen_36a006.py +45 -0
  611. opencompass-0.3.0/opencompass/configs/datasets/leval/levalfinancialqa/leval_financialqa_gen.py +4 -0
  612. opencompass-0.3.0/opencompass/configs/datasets/leval/levalfinancialqa/leval_financialqa_gen_b03798.py +43 -0
  613. opencompass-0.3.0/opencompass/configs/datasets/leval/levalgovreportsumm/leval_gov_report_summ_gen.py +4 -0
  614. opencompass-0.3.0/opencompass/configs/datasets/leval/levalgovreportsumm/leval_gov_report_summ_gen_b03798.py +43 -0
  615. opencompass-0.3.0/opencompass/configs/datasets/leval/levalgsm100/leval_gsm100_gen.py +4 -0
  616. opencompass-0.3.0/opencompass/configs/datasets/leval/levalgsm100/leval_gsm100_gen_77dd94.py +46 -0
  617. opencompass-0.3.0/opencompass/configs/datasets/leval/levallegalcontractqa/leval_legalcontractqa_gen.py +4 -0
  618. opencompass-0.3.0/opencompass/configs/datasets/leval/levallegalcontractqa/leval_legalcontractqa_gen_68a2ac.py +43 -0
  619. opencompass-0.3.0/opencompass/configs/datasets/leval/levalmeetingsumm/leval_meetingsumm_gen.py +4 -0
  620. opencompass-0.3.0/opencompass/configs/datasets/leval/levalmeetingsumm/leval_meetingsumm_gen_b03798.py +43 -0
  621. opencompass-0.3.0/opencompass/configs/datasets/leval/levalmultidocqa/leval_multidocqa_gen.py +4 -0
  622. opencompass-0.3.0/opencompass/configs/datasets/leval/levalmultidocqa/leval_multidocqa_gen_96bf3f.py +43 -0
  623. opencompass-0.3.0/opencompass/configs/datasets/leval/levalnarrativeqa/leval_narrativeqa_gen.py +4 -0
  624. opencompass-0.3.0/opencompass/configs/datasets/leval/levalnarrativeqa/leval_narrativeqa_gen_766dd0.py +43 -0
  625. opencompass-0.3.0/opencompass/configs/datasets/leval/levalnaturalquestion/leval_naturalquestion_gen.py +4 -0
  626. opencompass-0.3.0/opencompass/configs/datasets/leval/levalnaturalquestion/leval_naturalquestion_gen_52c33f.py +43 -0
  627. opencompass-0.3.0/opencompass/configs/datasets/leval/levalnewssumm/leval_newssumm_gen.py +4 -0
  628. opencompass-0.3.0/opencompass/configs/datasets/leval/levalnewssumm/leval_newssumm_gen_b03798.py +43 -0
  629. opencompass-0.3.0/opencompass/configs/datasets/leval/levalpaperassistant/leval_paper_assistant_gen.py +4 -0
  630. opencompass-0.3.0/opencompass/configs/datasets/leval/levalpaperassistant/leval_paper_assistant_gen_b03798.py +43 -0
  631. opencompass-0.3.0/opencompass/configs/datasets/leval/levalpatentsumm/leval_patent_summ_gen.py +4 -0
  632. opencompass-0.3.0/opencompass/configs/datasets/leval/levalpatentsumm/leval_patent_summ_gen_b03798.py +43 -0
  633. opencompass-0.3.0/opencompass/configs/datasets/leval/levalquality/leval_quality_gen.py +4 -0
  634. opencompass-0.3.0/opencompass/configs/datasets/leval/levalquality/leval_quality_gen_36a006.py +45 -0
  635. opencompass-0.3.0/opencompass/configs/datasets/leval/levalreviewsumm/leval_review_summ_gen.py +4 -0
  636. opencompass-0.3.0/opencompass/configs/datasets/leval/levalreviewsumm/leval_review_summ_gen_b03798.py +43 -0
  637. opencompass-0.3.0/opencompass/configs/datasets/leval/levalscientificqa/leval_scientificqa_gen.py +4 -0
  638. opencompass-0.3.0/opencompass/configs/datasets/leval/levalscientificqa/leval_scientificqa_gen_96bf3f.py +43 -0
  639. opencompass-0.3.0/opencompass/configs/datasets/leval/levaltopicretrieval/leval_topic_retrieval_gen.py +4 -0
  640. opencompass-0.3.0/opencompass/configs/datasets/leval/levaltopicretrieval/leval_topic_retrieval_gen_bf433f.py +45 -0
  641. opencompass-0.3.0/opencompass/configs/datasets/leval/levaltpo/leval_tpo_gen.py +4 -0
  642. opencompass-0.3.0/opencompass/configs/datasets/leval/levaltpo/leval_tpo_gen_36a006.py +45 -0
  643. opencompass-0.3.0/opencompass/configs/datasets/leval/levaltvshowsumm/leval_tvshow_summ_gen.py +4 -0
  644. opencompass-0.3.0/opencompass/configs/datasets/leval/levaltvshowsumm/leval_tvshow_summ_gen_b03798.py +43 -0
  645. opencompass-0.3.0/opencompass/configs/datasets/llm_compression/README.md +105 -0
  646. opencompass-0.3.0/opencompass/configs/datasets/llm_compression/llm_compression.py +50 -0
  647. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbench.py +26 -0
  648. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbench2wikimqa/longbench_2wikimqa_gen.py +4 -0
  649. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbench2wikimqa/longbench_2wikimqa_gen_6b3efc.py +38 -0
  650. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchdureader/longbench_dureader_gen.py +4 -0
  651. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchdureader/longbench_dureader_gen_c6c7e4.py +38 -0
  652. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchgov_report/longbench_gov_report_gen.py +4 -0
  653. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchgov_report/longbench_gov_report_gen_54c5b0.py +38 -0
  654. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchhotpotqa/longbench_hotpotqa_gen.py +4 -0
  655. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchhotpotqa/longbench_hotpotqa_gen_6b3efc.py +38 -0
  656. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchlcc/longbench_lcc_gen.py +4 -0
  657. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchlcc/longbench_lcc_gen_6ba507.py +38 -0
  658. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchlsht/longbench_lsht_gen.py +4 -0
  659. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchlsht/longbench_lsht_gen_e8a339.py +39 -0
  660. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmulti_news/longbench_multi_news_gen.py +4 -0
  661. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmulti_news/longbench_multi_news_gen_6f9da9.py +38 -0
  662. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmultifieldqa_en/longbench_multifieldqa_en_gen.py +4 -0
  663. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmultifieldqa_en/longbench_multifieldqa_en_gen_d3838e.py +38 -0
  664. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmultifieldqa_zh/longbench_multifieldqa_zh_gen.py +4 -0
  665. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmultifieldqa_zh/longbench_multifieldqa_zh_gen_e9a7ef.py +38 -0
  666. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmusique/longbench_musique_gen.py +4 -0
  667. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchmusique/longbench_musique_gen_6b3efc.py +38 -0
  668. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchnarrativeqa/longbench_narrativeqa_gen.py +4 -0
  669. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchnarrativeqa/longbench_narrativeqa_gen_a68305.py +38 -0
  670. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchpassage_count/longbench_passage_count_gen.py +4 -0
  671. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchpassage_count/longbench_passage_count_gen_dcdaab.py +38 -0
  672. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchpassage_retrieval_en/longbench_passage_retrieval_en_gen.py +4 -0
  673. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchpassage_retrieval_en/longbench_passage_retrieval_en_gen_734db5.py +38 -0
  674. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchpassage_retrieval_zh/longbench_passage_retrieval_zh_gen.py +4 -0
  675. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchpassage_retrieval_zh/longbench_passage_retrieval_zh_gen_01cca2.py +38 -0
  676. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchqasper/longbench_qasper_gen.py +4 -0
  677. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchqasper/longbench_qasper_gen_6b3efc.py +38 -0
  678. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchqmsum/longbench_qmsum_gen.py +4 -0
  679. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchqmsum/longbench_qmsum_gen_d33331.py +38 -0
  680. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchrepobench/longbench_repobench_gen.py +4 -0
  681. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchrepobench/longbench_repobench_gen_6df953.py +38 -0
  682. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchsamsum/longbench_samsum_gen.py +4 -0
  683. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchsamsum/longbench_samsum_gen_f4416d.py +39 -0
  684. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchtrec/longbench_trec_gen.py +4 -0
  685. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchtrec/longbench_trec_gen_824187.py +39 -0
  686. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchtriviaqa/longbench_triviaqa_gen.py +4 -0
  687. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchtriviaqa/longbench_triviaqa_gen_d30cb9.py +39 -0
  688. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchvcsum/longbench_vcsum_gen.py +4 -0
  689. opencompass-0.3.0/opencompass/configs/datasets/longbench/longbenchvcsum/longbench_vcsum_gen_f7a8ac.py +38 -0
  690. opencompass-0.3.0/opencompass/configs/datasets/lveval/lveval.md +165 -0
  691. opencompass-0.3.0/opencompass/configs/datasets/lveval/lveval.py +38 -0
  692. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalcmrc_mixup/lveval_cmrc_mixup_gen.py +6 -0
  693. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalcmrc_mixup/lveval_cmrc_mixup_gen_465823.py +54 -0
  694. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevaldureader_mixup/lveval_dureader_mixup_gen.py +6 -0
  695. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevaldureader_mixup/lveval_dureader_mixup_gen_465823.py +55 -0
  696. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalfactrecall_en/lveval_factrecall_en_gen.py +6 -0
  697. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalfactrecall_en/lveval_factrecall_en_gen_9a836f.py +54 -0
  698. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalfactrecall_zh/lveval_factrecall_zh_gen.py +6 -0
  699. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalfactrecall_zh/lveval_factrecall_zh_gen_dbee70.py +54 -0
  700. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalhotpotwikiqa_mixup/lveval_hotpotwikiqa_mixup_gen.py +6 -0
  701. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalhotpotwikiqa_mixup/lveval_hotpotwikiqa_mixup_gen_77ce82.py +59 -0
  702. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevallic_mixup/lveval_lic_mixup_gen.py +6 -0
  703. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevallic_mixup/lveval_lic_mixup_gen_01eb0c.py +54 -0
  704. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalloogle_CR_mixup/lveval_loogle_CR_mixup_gen.py +6 -0
  705. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalloogle_CR_mixup/lveval_loogle_CR_mixup_gen_d7ea36.py +54 -0
  706. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalloogle_MIR_mixup/lveval_loogle_MIR_mixup_gen.py +6 -0
  707. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalloogle_MIR_mixup/lveval_loogle_MIR_mixup_gen_d7ea36.py +54 -0
  708. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalloogle_SD_mixup/lveval_loogle_SD_mixup_gen.py +6 -0
  709. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalloogle_SD_mixup/lveval_loogle_SD_mixup_gen_d7ea36.py +54 -0
  710. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalmultifieldqa_en_mixup/lveval_multifieldqa_en_mixup_gen.py +6 -0
  711. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalmultifieldqa_en_mixup/lveval_multifieldqa_en_mixup_gen_d7ea36.py +59 -0
  712. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalmultifieldqa_zh_mixup/lveval_multifieldqa_zh_mixup_gen.py +6 -0
  713. opencompass-0.3.0/opencompass/configs/datasets/lveval/lvevalmultifieldqa_zh_mixup/lveval_multifieldqa_zh_mixup_gen_0fbdad.py +59 -0
  714. opencompass-0.3.0/opencompass/configs/datasets/mastermath2024v1/mastermath2024v1_gen.py +4 -0
  715. opencompass-0.3.0/opencompass/configs/datasets/mastermath2024v1/mastermath2024v1_gen_be6318.py +36 -0
  716. opencompass-0.3.0/opencompass/configs/datasets/math/README.md +69 -0
  717. opencompass-0.3.0/opencompass/configs/datasets/math/deprecated_math_agent_evaluatorv2_gen_861b4f.py +90 -0
  718. opencompass-0.3.0/opencompass/configs/datasets/math/deprecated_math_evaluatorv2_gen_265cce.py +38 -0
  719. opencompass-0.3.0/opencompass/configs/datasets/math/math_0shot_gen_393424.py +35 -0
  720. opencompass-0.3.0/opencompass/configs/datasets/math/math_4shot_base_gen_db136b.py +30 -0
  721. opencompass-0.3.0/opencompass/configs/datasets/math/math_4shot_example_from_google_research.py +40 -0
  722. opencompass-0.3.0/opencompass/configs/datasets/math/math_agent_evaluatorv2_gen_0c1b4e.py +100 -0
  723. opencompass-0.3.0/opencompass/configs/datasets/math/math_agent_gen_0c1b4e.py +99 -0
  724. opencompass-0.3.0/opencompass/configs/datasets/math/math_agent_gen_861b4f.py +90 -0
  725. opencompass-0.3.0/opencompass/configs/datasets/math/math_agent_gen_af2293.py +103 -0
  726. opencompass-0.3.0/opencompass/configs/datasets/math/math_evaluatorv2_gen_2f4a71.py +56 -0
  727. opencompass-0.3.0/opencompass/configs/datasets/math/math_evaluatorv2_gen_cecb31.py +38 -0
  728. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen.py +4 -0
  729. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_0957ff.py +36 -0
  730. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_1ed9c2.py +36 -0
  731. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_265cce.py +36 -0
  732. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_559593.py +53 -0
  733. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_5e8458.py +53 -0
  734. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_736506.py +28 -0
  735. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_78ced2.py +37 -0
  736. opencompass-0.3.0/opencompass/configs/datasets/math/math_gen_943d32.py +64 -0
  737. opencompass-0.3.0/opencompass/configs/datasets/math/math_intern_evaluator_gen_265cce.py +37 -0
  738. opencompass-0.3.0/opencompass/configs/datasets/math/math_llm_judge.py +35 -0
  739. opencompass-0.3.0/opencompass/configs/datasets/math401/math401_gen.py +4 -0
  740. opencompass-0.3.0/opencompass/configs/datasets/math401/math401_gen_ab5f39.py +47 -0
  741. opencompass-0.3.0/opencompass/configs/datasets/mbpp/README.md +69 -0
  742. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_mbpp_gen_1e1056.py +42 -0
  743. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_mbpp_gen_6590b0.py +28 -0
  744. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_mbpp_gen_caa7ab.py +42 -0
  745. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_mbpp_passk_gen_1e1056.py +42 -0
  746. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_mbpp_repeat10_gen_1e1056.py +45 -0
  747. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_sanitized_mbpp_gen_1e1056.py +42 -0
  748. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_sanitized_mbpp_gen_cb43ef.py +81 -0
  749. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_sanitized_mbpp_passk_gen_1e1056.py +42 -0
  750. opencompass-0.3.0/opencompass/configs/datasets/mbpp/deprecated_sanitized_mbpp_repeat10_gen_1e1056.py +43 -0
  751. opencompass-0.3.0/opencompass/configs/datasets/mbpp/mbpp_gen.py +4 -0
  752. opencompass-0.3.0/opencompass/configs/datasets/mbpp/mbpp_gen_830460.py +42 -0
  753. opencompass-0.3.0/opencompass/configs/datasets/mbpp/mbpp_passk_gen_830460.py +42 -0
  754. opencompass-0.3.0/opencompass/configs/datasets/mbpp/mbpp_repeat10_gen_830460.py +45 -0
  755. opencompass-0.3.0/opencompass/configs/datasets/mbpp/sanitized_mbpp_gen_742f0c.py +82 -0
  756. opencompass-0.3.0/opencompass/configs/datasets/mbpp/sanitized_mbpp_gen_830460.py +42 -0
  757. opencompass-0.3.0/opencompass/configs/datasets/mbpp/sanitized_mbpp_gen_a0fc46.py +41 -0
  758. opencompass-0.3.0/opencompass/configs/datasets/mbpp/sanitized_mbpp_mdblock_gen_a447ff.py +41 -0
  759. opencompass-0.3.0/opencompass/configs/datasets/mbpp/sanitized_mbpp_passk_gen_830460.py +42 -0
  760. opencompass-0.3.0/opencompass/configs/datasets/mbpp/sanitized_mbpp_repeat10_gen_830460.py +43 -0
  761. opencompass-0.3.0/opencompass/configs/datasets/mbpp_cn/deprecated_mbpp_cn_gen_1d1481.py +64 -0
  762. opencompass-0.3.0/opencompass/configs/datasets/mbpp_cn/deprecated_mbpp_cn_passk_gen_1d1481.py +64 -0
  763. opencompass-0.3.0/opencompass/configs/datasets/mbpp_cn/deprecated_mbpp_cn_repeat10_gen_1d1481.py +65 -0
  764. opencompass-0.3.0/opencompass/configs/datasets/mbpp_cn/mbpp_cn_gen.py +4 -0
  765. opencompass-0.3.0/opencompass/configs/datasets/mbpp_cn/mbpp_cn_gen_9114d5.py +65 -0
  766. opencompass-0.3.0/opencompass/configs/datasets/mbpp_plus/deprecated_mbpp_plus_gen_94815c.py +64 -0
  767. opencompass-0.3.0/opencompass/configs/datasets/mbpp_plus/mbpp_plus_gen.py +4 -0
  768. opencompass-0.3.0/opencompass/configs/datasets/mbpp_plus/mbpp_plus_gen_0b836a.py +64 -0
  769. opencompass-0.3.0/opencompass/configs/datasets/mgsm/README.md +67 -0
  770. opencompass-0.3.0/opencompass/configs/datasets/mgsm/mgsm_gen.py +4 -0
  771. opencompass-0.3.0/opencompass/configs/datasets/mgsm/mgsm_gen_d967bc.py +56 -0
  772. opencompass-0.3.0/opencompass/configs/datasets/mmlu/README.md +368 -0
  773. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_all_sets.py +59 -0
  774. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_clean_ppl.py +114 -0
  775. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_gen.py +4 -0
  776. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_gen_23a9a9.py +124 -0
  777. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_gen_4d595a.py +123 -0
  778. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_gen_5d1409.py +124 -0
  779. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_gen_79e572.py +110 -0
  780. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_gen_a484b3.py +124 -0
  781. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_openai_simple_evals_gen_b618ea.py +59 -0
  782. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_ppl.py +4 -0
  783. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_ppl_ac766d.py +106 -0
  784. opencompass-0.3.0/opencompass/configs/datasets/mmlu/mmlu_zero_shot_gen_47e2c0.py +123 -0
  785. opencompass-0.3.0/opencompass/configs/datasets/mmlu_pro/mmlu_pro_0shot_cot_gen_08c1de.py +63 -0
  786. opencompass-0.3.0/opencompass/configs/datasets/mmlu_pro/mmlu_pro_categories.py +16 -0
  787. opencompass-0.3.0/opencompass/configs/datasets/mmlu_pro/mmlu_pro_gen_cdbebf.py +58 -0
  788. opencompass-0.3.0/opencompass/configs/datasets/narrativeqa/narrativeqa_gen.py +4 -0
  789. opencompass-0.3.0/opencompass/configs/datasets/narrativeqa/narrativeqa_gen_a2d88a.py +30 -0
  790. opencompass-0.3.0/opencompass/configs/datasets/narrativeqa/narrativeqa_gen_db6413.py +37 -0
  791. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc.py +104 -0
  792. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc_choice.py +134 -0
  793. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc_choice_20.py +132 -0
  794. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc_choice_50.py +42 -0
  795. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc_choice_50_en_reasoning.py +96 -0
  796. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc_choice_80.py +42 -0
  797. opencompass-0.3.0/opencompass/configs/datasets/needlebench/atc/atc_choice_80_en_reasoning.py +96 -0
  798. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_1000k/needlebench_1000k.py +18 -0
  799. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_1000k/needlebench_multi_reasoning_1000k.py +286 -0
  800. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_1000k/needlebench_multi_retrieval_1000k.py +108 -0
  801. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_1000k/needlebench_single_1000k.py +109 -0
  802. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_128k/needlebench_128k.py +18 -0
  803. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_128k/needlebench_multi_reasoning_128k.py +288 -0
  804. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_128k/needlebench_multi_retrieval_128k.py +108 -0
  805. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_128k/needlebench_single_128k.py +111 -0
  806. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_200k/needlebench_200k.py +18 -0
  807. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_200k/needlebench_multi_reasoning_200k.py +287 -0
  808. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_200k/needlebench_multi_retrieval_200k.py +109 -0
  809. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_200k/needlebench_single_200k.py +110 -0
  810. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_256k/needlebench_256k.py +18 -0
  811. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_256k/needlebench_multi_reasoning_256k.py +287 -0
  812. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_256k/needlebench_multi_retrieval_256k.py +109 -0
  813. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_256k/needlebench_single_256k.py +110 -0
  814. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_32k/needlebench_32k.py +18 -0
  815. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_32k/needlebench_multi_reasoning_32k.py +288 -0
  816. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_32k/needlebench_multi_retrieval_32k.py +108 -0
  817. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_32k/needlebench_single_32k.py +111 -0
  818. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_4k/needlebench_4k.py +18 -0
  819. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_4k/needlebench_multi_reasoning_4k.py +303 -0
  820. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_4k/needlebench_multi_retrieval_4k.py +111 -0
  821. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_4k/needlebench_single_4k.py +114 -0
  822. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_8k/needlebench_8k.py +18 -0
  823. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_8k/needlebench_multi_reasoning_8k.py +303 -0
  824. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_8k/needlebench_multi_retrieval_8k.py +111 -0
  825. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_8k/needlebench_multi_retrieval_compare_batch_8k.py +120 -0
  826. opencompass-0.3.0/opencompass/configs/datasets/needlebench/needlebench_8k/needlebench_single_8k.py +114 -0
  827. opencompass-0.3.0/opencompass/configs/datasets/needlebench/readme.md +53 -0
  828. opencompass-0.3.0/opencompass/configs/datasets/needlebench/readme_zh-CN.md +53 -0
  829. opencompass-0.3.0/opencompass/configs/datasets/nq/README.md +69 -0
  830. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_gen.py +4 -0
  831. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_gen_0356ec.py +61 -0
  832. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_gen_2463e2.py +27 -0
  833. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_gen_3dcea1.py +29 -0
  834. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_gen_68c1c6.py +30 -0
  835. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_gen_c788f6.py +30 -0
  836. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_open_1shot_gen_01cf41.py +61 -0
  837. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_open_1shot_gen_20a989.py +45 -0
  838. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_open_1shot_gen_2e45e5.py +61 -0
  839. opencompass-0.3.0/opencompass/configs/datasets/nq/nq_open_gen_e93f8a.py +61 -0
  840. opencompass-0.3.0/opencompass/configs/datasets/nq_cn/nqcn_gen.py +4 -0
  841. opencompass-0.3.0/opencompass/configs/datasets/nq_cn/nqcn_gen_141737.py +34 -0
  842. opencompass-0.3.0/opencompass/configs/datasets/obqa/obqa_gen.py +4 -0
  843. opencompass-0.3.0/opencompass/configs/datasets/obqa/obqa_gen_9069e4.py +62 -0
  844. opencompass-0.3.0/opencompass/configs/datasets/obqa/obqa_ppl.py +4 -0
  845. opencompass-0.3.0/opencompass/configs/datasets/obqa/obqa_ppl_1defe8.py +51 -0
  846. opencompass-0.3.0/opencompass/configs/datasets/obqa/obqa_ppl_6aac9e.py +42 -0
  847. opencompass-0.3.0/opencompass/configs/datasets/obqa/obqa_ppl_c7c154.py +66 -0
  848. opencompass-0.3.0/opencompass/configs/datasets/piqa/piqa_gen.py +4 -0
  849. opencompass-0.3.0/opencompass/configs/datasets/piqa/piqa_gen_1194eb.py +41 -0
  850. opencompass-0.3.0/opencompass/configs/datasets/piqa/piqa_ppl.py +4 -0
  851. opencompass-0.3.0/opencompass/configs/datasets/piqa/piqa_ppl_0cfff2.py +37 -0
  852. opencompass-0.3.0/opencompass/configs/datasets/piqa/piqa_ppl_1cf9f0.py +32 -0
  853. opencompass-0.3.0/opencompass/configs/datasets/piqa/piqa_ppl_3431ea.py +42 -0
  854. opencompass-0.3.0/opencompass/configs/datasets/promptbench/promptbench_iwslt2017_gen_cbb8c8.py +57 -0
  855. opencompass-0.3.0/opencompass/configs/datasets/promptbench/promptbench_math_gen_abf776.py +44 -0
  856. opencompass-0.3.0/opencompass/configs/datasets/promptbench/promptbench_squad20_gen_b15d1c.py +48 -0
  857. opencompass-0.3.0/opencompass/configs/datasets/promptbench/promptbench_wnli_gen_50662f.py +61 -0
  858. opencompass-0.3.0/opencompass/configs/datasets/py150/py150_gen.py +4 -0
  859. opencompass-0.3.0/opencompass/configs/datasets/py150/py150_gen_38b13d.py +42 -0
  860. opencompass-0.3.0/opencompass/configs/datasets/qabench/qabench_gen.py +4 -0
  861. opencompass-0.3.0/opencompass/configs/datasets/qabench/qabench_gen_353ae7.py +29 -0
  862. opencompass-0.3.0/opencompass/configs/datasets/qasper/qasper_gen.py +4 -0
  863. opencompass-0.3.0/opencompass/configs/datasets/qasper/qasper_gen_a2d88a.py +30 -0
  864. opencompass-0.3.0/opencompass/configs/datasets/qasper/qasper_gen_db6413.py +36 -0
  865. opencompass-0.3.0/opencompass/configs/datasets/qaspercut/qaspercut_gen.py +4 -0
  866. opencompass-0.3.0/opencompass/configs/datasets/qaspercut/qaspercut_gen_a2d88a.py +30 -0
  867. opencompass-0.3.0/opencompass/configs/datasets/qaspercut/qaspercut_gen_db6413.py +37 -0
  868. opencompass-0.3.0/opencompass/configs/datasets/race/README.md +69 -0
  869. opencompass-0.3.0/opencompass/configs/datasets/race/race_gen.py +4 -0
  870. opencompass-0.3.0/opencompass/configs/datasets/race/race_gen_69ee4f.py +50 -0
  871. opencompass-0.3.0/opencompass/configs/datasets/race/race_gen_9302a5.py +44 -0
  872. opencompass-0.3.0/opencompass/configs/datasets/race/race_ppl.py +4 -0
  873. opencompass-0.3.0/opencompass/configs/datasets/race/race_ppl_5831a0.py +48 -0
  874. opencompass-0.3.0/opencompass/configs/datasets/race/race_ppl_a138cd.py +50 -0
  875. opencompass-0.3.0/opencompass/configs/datasets/race/race_ppl_abed12.py +42 -0
  876. opencompass-0.3.0/opencompass/configs/datasets/realtoxicprompts/realtoxicprompts_gen.py +4 -0
  877. opencompass-0.3.0/opencompass/configs/datasets/realtoxicprompts/realtoxicprompts_gen_7605e4.py +37 -0
  878. opencompass-0.3.0/opencompass/configs/datasets/realtoxicprompts/realtoxicprompts_gen_ac723c.py +35 -0
  879. opencompass-0.3.0/opencompass/configs/datasets/rolebench/instruction_generalization_eng.py +41 -0
  880. opencompass-0.3.0/opencompass/configs/datasets/rolebench/instruction_generalization_zh.py +41 -0
  881. opencompass-0.3.0/opencompass/configs/datasets/rolebench/role_generalization_eng.py +41 -0
  882. opencompass-0.3.0/opencompass/configs/datasets/s3eval/s3eval.md +139 -0
  883. opencompass-0.3.0/opencompass/configs/datasets/s3eval/s3eval_gen.py +4 -0
  884. opencompass-0.3.0/opencompass/configs/datasets/s3eval/s3eval_gen_b8ac80.py +17 -0
  885. opencompass-0.3.0/opencompass/configs/datasets/safety/safety_gen.py +4 -0
  886. opencompass-0.3.0/opencompass/configs/datasets/safety/safety_gen_7ce197.py +30 -0
  887. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/atkins_prompt.txt +18 -0
  888. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/atkins_sol.txt +101 -0
  889. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/calculus_prompt.txt +17 -0
  890. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/calculus_sol.txt +62 -0
  891. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/chemmc_prompt.txt +17 -0
  892. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/chemmc_sol.txt +108 -0
  893. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/class_prompt.txt +17 -0
  894. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/class_sol.txt +169 -0
  895. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/diff_prompt.txt +17 -0
  896. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/diff_sol.txt +112 -0
  897. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/fund_prompt.txt +20 -0
  898. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/fund_sol.txt +135 -0
  899. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/matter_prompt.txt +21 -0
  900. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/matter_sol.txt +120 -0
  901. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/quan_prompt.txt +17 -0
  902. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/quan_sol.txt +75 -0
  903. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/stat_prompt.txt +17 -0
  904. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/stat_sol.txt +48 -0
  905. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/thermo_prompt.txt +20 -0
  906. opencompass-0.3.0/opencompass/configs/datasets/scibench/lib_prompt/thermo_sol.txt +112 -0
  907. opencompass-0.3.0/opencompass/configs/datasets/scibench/scibench_gen.py +4 -0
  908. opencompass-0.3.0/opencompass/configs/datasets/scibench/scibench_gen_2b21f3.py +71 -0
  909. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_gen.py +4 -0
  910. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_gen_18632c.py +42 -0
  911. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_gen_e78df3.py +41 -0
  912. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_ppl.py +4 -0
  913. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_ppl_42bc6e.py +33 -0
  914. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_ppl_7845b0.py +33 -0
  915. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_ppl_ced5f6.py +45 -0
  916. opencompass-0.3.0/opencompass/configs/datasets/siqa/siqa_ppl_e8d8c5.py +45 -0
  917. opencompass-0.3.0/opencompass/configs/datasets/squad20/squad20_gen.py +4 -0
  918. opencompass-0.3.0/opencompass/configs/datasets/squad20/squad20_gen_1710bc.py +32 -0
  919. opencompass-0.3.0/opencompass/configs/datasets/storycloze/storycloze_gen.py +4 -0
  920. opencompass-0.3.0/opencompass/configs/datasets/storycloze/storycloze_gen_7f656a.py +46 -0
  921. opencompass-0.3.0/opencompass/configs/datasets/storycloze/storycloze_ppl.py +4 -0
  922. opencompass-0.3.0/opencompass/configs/datasets/storycloze/storycloze_ppl_496661.py +39 -0
  923. opencompass-0.3.0/opencompass/configs/datasets/storycloze/storycloze_ppl_afd16f.py +36 -0
  924. opencompass-0.3.0/opencompass/configs/datasets/strategyqa/strategyqa_gen.py +4 -0
  925. opencompass-0.3.0/opencompass/configs/datasets/strategyqa/strategyqa_gen_1180a7.py +94 -0
  926. opencompass-0.3.0/opencompass/configs/datasets/strategyqa/strategyqa_gen_934441.py +58 -0
  927. opencompass-0.3.0/opencompass/configs/datasets/subjective/alignbench/alignbench_judgeby_autoj.py +74 -0
  928. opencompass-0.3.0/opencompass/configs/datasets/subjective/alignbench/alignbench_judgeby_critiquellm.py +67 -0
  929. opencompass-0.3.0/opencompass/configs/datasets/subjective/alignbench/alignbench_judgeby_judgelm.py +62 -0
  930. opencompass-0.3.0/opencompass/configs/datasets/subjective/alignbench/alignbench_v1_1_judgeby_critiquellm.py +67 -0
  931. opencompass-0.3.0/opencompass/configs/datasets/subjective/alpaca_eval/alpacav1_judgeby_gpt4.py +97 -0
  932. opencompass-0.3.0/opencompass/configs/datasets/subjective/alpaca_eval/alpacav2_judgeby_gpt4.py +116 -0
  933. opencompass-0.3.0/opencompass/configs/datasets/subjective/arena_hard/README.md +40 -0
  934. opencompass-0.3.0/opencompass/configs/datasets/subjective/arena_hard/arena_hard_compare.py +81 -0
  935. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassarena/compassarena_compare.py +154 -0
  936. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassarena/compassarena_compare_creationv3.py +145 -0
  937. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassarena/compassarena_compare_moe.py +156 -0
  938. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassbench/compassbench_checklist.py +237 -0
  939. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassbench/compassbench_compare.py +68 -0
  940. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassbench/compassbench_compare_v11.py +65 -0
  941. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassbench/compassbench_compare_v11_patch.py +69 -0
  942. opencompass-0.3.0/opencompass/configs/datasets/subjective/compassbench/compassbench_compare_v12.py +65 -0
  943. opencompass-0.3.0/opencompass/configs/datasets/subjective/creationbench/creationbench_judgeby_gpt4.py +60 -0
  944. opencompass-0.3.0/opencompass/configs/datasets/subjective/creationbench/creationbench_judgeby_gpt4_withref.py +60 -0
  945. opencompass-0.3.0/opencompass/configs/datasets/subjective/fofo/README.md +30 -0
  946. opencompass-0.3.0/opencompass/configs/datasets/subjective/fofo/fofo_judge.py +99 -0
  947. opencompass-0.3.0/opencompass/configs/datasets/subjective/multiround/functionalmt_zh_judgeby_gpt4.py +56 -0
  948. opencompass-0.3.0/opencompass/configs/datasets/subjective/multiround/mtbench101_judge.py +64 -0
  949. opencompass-0.3.0/opencompass/configs/datasets/subjective/multiround/mtbench_pair_judge.py +64 -0
  950. opencompass-0.3.0/opencompass/configs/datasets/subjective/multiround/mtbench_single_judge.py +62 -0
  951. opencompass-0.3.0/opencompass/configs/datasets/subjective/multiround/mtbench_single_judge_diff_temp.py +66 -0
  952. opencompass-0.3.0/opencompass/configs/datasets/subjective/subjective_cmp/subjective_cmp.py +61 -0
  953. opencompass-0.3.0/opencompass/configs/datasets/subjective/subjective_cmp/subjective_corev2.py +62 -0
  954. opencompass-0.3.0/opencompass/configs/datasets/subjective/subjective_cmp/subjective_creation.py +59 -0
  955. opencompass-0.3.0/opencompass/configs/datasets/subjective/wildbench/wildbench.md +34 -0
  956. opencompass-0.3.0/opencompass/configs/datasets/subjective/wildbench/wildbench_pair_judge.py +65 -0
  957. opencompass-0.3.0/opencompass/configs/datasets/subjective/wildbench/wildbench_single_judge.py +47 -0
  958. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_gen.py +4 -0
  959. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_gen_315438.py +51 -0
  960. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_gen_4fb38b.py +38 -0
  961. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_ppl.py +4 -0
  962. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_ppl_1fbeb6.py +50 -0
  963. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_ppl_3c30d0.py +58 -0
  964. opencompass-0.3.0/opencompass/configs/datasets/summedits/summedits_ppl_fa58ba.py +42 -0
  965. opencompass-0.3.0/opencompass/configs/datasets/summscreen/summscreen_gen.py +4 -0
  966. opencompass-0.3.0/opencompass/configs/datasets/summscreen/summscreen_gen_653185.py +48 -0
  967. opencompass-0.3.0/opencompass/configs/datasets/summscreen/summscreen_gen_aa5eb3.py +36 -0
  968. opencompass-0.3.0/opencompass/configs/datasets/taco/README.md +50 -0
  969. opencompass-0.3.0/opencompass/configs/datasets/taco/taco_gen.py +4 -0
  970. opencompass-0.3.0/opencompass/configs/datasets/taco/taco_gen_c7893a.py +28 -0
  971. opencompass-0.3.0/opencompass/configs/datasets/taco/taco_levels_gen_411572.py +36 -0
  972. opencompass-0.3.0/opencompass/configs/datasets/teval/README.md +22 -0
  973. opencompass-0.3.0/opencompass/configs/datasets/teval/teval_en_gen.py +4 -0
  974. opencompass-0.3.0/opencompass/configs/datasets/teval/teval_en_gen_1ac254.py +52 -0
  975. opencompass-0.3.0/opencompass/configs/datasets/teval/teval_zh_gen.py +4 -0
  976. opencompass-0.3.0/opencompass/configs/datasets/teval/teval_zh_gen_1ac254.py +52 -0
  977. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/README.md +69 -0
  978. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_gen.py +4 -0
  979. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_gen_0356ec.py +61 -0
  980. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_gen_2121ce.py +34 -0
  981. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_gen_3e39a5.py +33 -0
  982. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_gen_429db5.py +30 -0
  983. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_gen_d297bb.py +34 -0
  984. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_wiki_1shot_gen_20a989.py +46 -0
  985. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_wiki_1shot_gen_bc5f21.py +62 -0
  986. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_wiki_1shot_gen_eaf81e.py +62 -0
  987. opencompass-0.3.0/opencompass/configs/datasets/triviaqa/triviaqa_wiki_gen_d18bf4.py +62 -0
  988. opencompass-0.3.0/opencompass/configs/datasets/triviaqarc/triviaqarc_gen.py +4 -0
  989. opencompass-0.3.0/opencompass/configs/datasets/triviaqarc/triviaqarc_gen_a2d88a.py +30 -0
  990. opencompass-0.3.0/opencompass/configs/datasets/triviaqarc/triviaqarc_gen_db6413.py +37 -0
  991. opencompass-0.3.0/opencompass/configs/datasets/truthfulqa/truthfulqa_gen.py +4 -0
  992. opencompass-0.3.0/opencompass/configs/datasets/truthfulqa/truthfulqa_gen_1e7d8d.py +42 -0
  993. opencompass-0.3.0/opencompass/configs/datasets/truthfulqa/truthfulqa_gen_5ddc62.py +44 -0
  994. opencompass-0.3.0/opencompass/configs/datasets/tydiqa/tydiqa_gen.py +4 -0
  995. opencompass-0.3.0/opencompass/configs/datasets/tydiqa/tydiqa_gen_978d2a.py +61 -0
  996. opencompass-0.3.0/opencompass/configs/datasets/wikibench/wikibench_gen.py +4 -0
  997. opencompass-0.3.0/opencompass/configs/datasets/wikibench/wikibench_gen_f96ece.py +56 -0
  998. opencompass-0.3.0/opencompass/configs/datasets/wikitext/wikitext_103_raw_ppl.py +4 -0
  999. opencompass-0.3.0/opencompass/configs/datasets/wikitext/wikitext_103_raw_ppl_752e2a.py +39 -0
  1000. opencompass-0.3.0/opencompass/configs/datasets/wikitext/wikitext_2_raw_ppl.py +4 -0
  1001. opencompass-0.3.0/opencompass/configs/datasets/wikitext/wikitext_2_raw_ppl_752e2a.py +39 -0
  1002. opencompass-0.3.0/opencompass/configs/datasets/winograd/winograd_ppl.py +4 -0
  1003. opencompass-0.3.0/opencompass/configs/datasets/winograd/winograd_ppl_8f3049.py +37 -0
  1004. opencompass-0.3.0/opencompass/configs/datasets/winograd/winograd_ppl_b6c7ed.py +41 -0
  1005. opencompass-0.3.0/opencompass/configs/datasets/winogrande/README.md +69 -0
  1006. opencompass-0.3.0/opencompass/configs/datasets/winogrande/deprecated_winogrande_gen_a9ede5.py +43 -0
  1007. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_5shot_gen_6447e6.py +46 -0
  1008. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_5shot_gen_b36770.py +46 -0
  1009. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_5shot_ll_252f01.py +38 -0
  1010. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_gen.py +4 -0
  1011. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_gen_458220.py +41 -0
  1012. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_gen_a027b6.py +49 -0
  1013. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_ll.py +4 -0
  1014. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_ll_c5cf57.py +33 -0
  1015. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_ppl_55a66e.py +38 -0
  1016. opencompass-0.3.0/opencompass/configs/datasets/winogrande/winogrande_ppl_9307fd.py +36 -0
  1017. opencompass-0.3.0/opencompass/configs/datasets/xiezhi/xiezhi_gen.py +4 -0
  1018. opencompass-0.3.0/opencompass/configs/datasets/xiezhi/xiezhi_gen_b86cf5.py +50 -0
  1019. opencompass-0.3.0/opencompass/configs/datasets/xiezhi/xiezhi_ppl.py +4 -0
  1020. opencompass-0.3.0/opencompass/configs/datasets/xiezhi/xiezhi_ppl_ea6bd7.py +49 -0
  1021. opencompass-0.3.0/opencompass/configs/models/accessory/accessory_llama2_7b.py +34 -0
  1022. opencompass-0.3.0/opencompass/configs/models/accessory/accessory_mixtral_8x7b.py +31 -0
  1023. opencompass-0.3.0/opencompass/configs/models/accessory/accessory_sphinx_v2_1k.py +29 -0
  1024. opencompass-0.3.0/opencompass/configs/models/alaya/alaya.py +19 -0
  1025. opencompass-0.3.0/opencompass/configs/models/aquila/hf_aquila2_34b.py +12 -0
  1026. opencompass-0.3.0/opencompass/configs/models/aquila/hf_aquila2_7b.py +12 -0
  1027. opencompass-0.3.0/opencompass/configs/models/aquila/hf_aquilachat2_34b.py +32 -0
  1028. opencompass-0.3.0/opencompass/configs/models/aquila/hf_aquilachat2_34b_16k.py +33 -0
  1029. opencompass-0.3.0/opencompass/configs/models/aquila/hf_aquilachat2_7b.py +32 -0
  1030. opencompass-0.3.0/opencompass/configs/models/aquila/hf_aquilachat2_7b_16k.py +33 -0
  1031. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan2_13b_base.py +12 -0
  1032. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan2_13b_chat.py +29 -0
  1033. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan2_7b_base.py +12 -0
  1034. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan2_7b_chat.py +29 -0
  1035. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan_13b_base.py +20 -0
  1036. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan_13b_chat.py +20 -0
  1037. opencompass-0.3.0/opencompass/configs/models/baichuan/hf_baichuan_7b.py +20 -0
  1038. opencompass-0.3.0/opencompass/configs/models/bluelm/hf_bluelm_7b_base.py +12 -0
  1039. opencompass-0.3.0/opencompass/configs/models/bluelm/hf_bluelm_7b_base_32k.py +12 -0
  1040. opencompass-0.3.0/opencompass/configs/models/bluelm/hf_bluelm_7b_chat.py +32 -0
  1041. opencompass-0.3.0/opencompass/configs/models/bluelm/hf_bluelm_7b_chat_32k.py +32 -0
  1042. opencompass-0.3.0/opencompass/configs/models/chatglm/hf_chatglm2_6b.py +31 -0
  1043. opencompass-0.3.0/opencompass/configs/models/chatglm/hf_chatglm3_6b.py +12 -0
  1044. opencompass-0.3.0/opencompass/configs/models/chatglm/hf_chatglm3_6b_32k.py +12 -0
  1045. opencompass-0.3.0/opencompass/configs/models/chatglm/hf_chatglm3_6b_base.py +12 -0
  1046. opencompass-0.3.0/opencompass/configs/models/chatglm/hf_chatglm_6b.py +24 -0
  1047. opencompass-0.3.0/opencompass/configs/models/chatglm/hf_glm4_9b_chat.py +13 -0
  1048. opencompass-0.3.0/opencompass/configs/models/chatglm/vllm_chatglm3_6b.py +13 -0
  1049. opencompass-0.3.0/opencompass/configs/models/chatglm/vllm_chatglm3_6b_32k.py +14 -0
  1050. opencompass-0.3.0/opencompass/configs/models/chatglm/vllm_glm4_9b_chat.py +14 -0
  1051. opencompass-0.3.0/opencompass/configs/models/claude/claude.py +68 -0
  1052. opencompass-0.3.0/opencompass/configs/models/claude/claude2.py +63 -0
  1053. opencompass-0.3.0/opencompass/configs/models/codegeex2/hf_codegeex2_6b.py +25 -0
  1054. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_13b.py +12 -0
  1055. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_13b_instruct.py +12 -0
  1056. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_13b_python.py +12 -0
  1057. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_34b.py +12 -0
  1058. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_34b_instruct.py +12 -0
  1059. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_34b_python.py +12 -0
  1060. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_70b.py +12 -0
  1061. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_70b_instruct.py +12 -0
  1062. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_70b_python.py +12 -0
  1063. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_7b.py +12 -0
  1064. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_7b_instruct.py +12 -0
  1065. opencompass-0.3.0/opencompass/configs/models/codellama/hf_codellama_7b_python.py +12 -0
  1066. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_67b_base.py +12 -0
  1067. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_67b_chat.py +12 -0
  1068. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_7b_base.py +12 -0
  1069. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_7b_chat.py +12 -0
  1070. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_coder_1_3b_instruct.py +12 -0
  1071. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_coder_33b_instruct.py +12 -0
  1072. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_coder_6_7b_instruct.py +12 -0
  1073. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_moe_16b_base.py +12 -0
  1074. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_moe_16b_chat.py +12 -0
  1075. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_v2.py +18 -0
  1076. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_v2_chat.py +18 -0
  1077. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_v2_lite.py +17 -0
  1078. opencompass-0.3.0/opencompass/configs/models/deepseek/hf_deepseek_v2_lite_chat.py +17 -0
  1079. opencompass-0.3.0/opencompass/configs/models/deepseek/lmdeploy_deepseek_67b_base.py +15 -0
  1080. opencompass-0.3.0/opencompass/configs/models/deepseek/lmdeploy_deepseek_67b_chat.py +15 -0
  1081. opencompass-0.3.0/opencompass/configs/models/deepseek/lmdeploy_deepseek_7b_base.py +15 -0
  1082. opencompass-0.3.0/opencompass/configs/models/deepseek/lmdeploy_deepseek_7b_chat.py +15 -0
  1083. opencompass-0.3.0/opencompass/configs/models/deepseek/lmdeploy_deepseek_series.py +23 -0
  1084. opencompass-0.3.0/opencompass/configs/models/deepseek/vllm_deepseek_67b_chat.py +13 -0
  1085. opencompass-0.3.0/opencompass/configs/models/deepseek/vllm_deepseek_7b_chat.py +13 -0
  1086. opencompass-0.3.0/opencompass/configs/models/deepseek/vllm_deepseek_moe_16b_base.py +15 -0
  1087. opencompass-0.3.0/opencompass/configs/models/deepseek/vllm_deepseek_moe_16b_chat.py +13 -0
  1088. opencompass-0.3.0/opencompass/configs/models/falcon/hf_falcon_40b.py +12 -0
  1089. opencompass-0.3.0/opencompass/configs/models/falcon/hf_falcon_7b.py +12 -0
  1090. opencompass-0.3.0/opencompass/configs/models/gemini/gemini_1_5_flash.py +22 -0
  1091. opencompass-0.3.0/opencompass/configs/models/gemini/gemini_1_5_pro.py +22 -0
  1092. opencompass-0.3.0/opencompass/configs/models/gemini/gemini_pro.py +22 -0
  1093. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma2_27b.py +15 -0
  1094. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma2_27b_it.py +16 -0
  1095. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma2_2b.py +15 -0
  1096. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma2_2b_it.py +16 -0
  1097. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma2_9b.py +15 -0
  1098. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma2_9b_it.py +16 -0
  1099. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma_2b.py +12 -0
  1100. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma_2b_it.py +12 -0
  1101. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma_7b.py +12 -0
  1102. opencompass-0.3.0/opencompass/configs/models/gemma/hf_gemma_7b_it.py +12 -0
  1103. opencompass-0.3.0/opencompass/configs/models/gemma/vllm_gemma_2b.py +15 -0
  1104. opencompass-0.3.0/opencompass/configs/models/gemma/vllm_gemma_2b_it.py +14 -0
  1105. opencompass-0.3.0/opencompass/configs/models/gemma/vllm_gemma_7b.py +15 -0
  1106. opencompass-0.3.0/opencompass/configs/models/gemma/vllm_gemma_7b_it.py +14 -0
  1107. opencompass-0.3.0/opencompass/configs/models/hf_internlm/README.md +124 -0
  1108. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_1_8b.py +12 -0
  1109. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_20b.py +13 -0
  1110. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_5_1_8b_chat.py +12 -0
  1111. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_5_20b_chat.py +12 -0
  1112. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_5_7b.py +13 -0
  1113. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_5_7b_chat.py +12 -0
  1114. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_7b.py +13 -0
  1115. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_base_20b.py +13 -0
  1116. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_base_7b.py +13 -0
  1117. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_1_8b.py +12 -0
  1118. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_1_8b_sft.py +12 -0
  1119. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_20b.py +12 -0
  1120. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_20b_sft.py +12 -0
  1121. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_20b_with_system.py +37 -0
  1122. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_7b.py +12 -0
  1123. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_7b_sft.py +12 -0
  1124. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_7b_with_system.py +37 -0
  1125. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_math_20b.py +13 -0
  1126. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_math_20b_with_system.py +35 -0
  1127. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_math_7b.py +13 -0
  1128. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_chat_math_7b_with_system.py +35 -0
  1129. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_math_20b.py +13 -0
  1130. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm2_math_7b.py +13 -0
  1131. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm_20b.py +13 -0
  1132. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm_7b.py +13 -0
  1133. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm_chat_20b.py +34 -0
  1134. opencompass-0.3.0/opencompass/configs/models/hf_internlm/hf_internlm_chat_7b.py +34 -0
  1135. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_1_8b.py +15 -0
  1136. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_20b.py +15 -0
  1137. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_5_1_8b_chat.py +15 -0
  1138. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_5_20b_chat.py +15 -0
  1139. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_5_7b.py +15 -0
  1140. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_5_7b_chat.py +15 -0
  1141. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_5_7b_chat_1m.py +15 -0
  1142. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_7b.py +15 -0
  1143. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_base_20b.py +15 -0
  1144. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_base_7b.py +15 -0
  1145. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_1_8b.py +15 -0
  1146. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_1_8b_sft.py +15 -0
  1147. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_20b.py +15 -0
  1148. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_20b_sft.py +15 -0
  1149. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py +15 -0
  1150. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_7b_sft.py +15 -0
  1151. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm2_series.py +26 -0
  1152. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm_20b.py +15 -0
  1153. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm_7b.py +15 -0
  1154. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm_chat_20b.py +15 -0
  1155. opencompass-0.3.0/opencompass/configs/models/hf_internlm/lmdeploy_internlm_chat_7b.py +15 -0
  1156. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_chat_1_8b.py +13 -0
  1157. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_chat_1_8b_sft.py +13 -0
  1158. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_chat_20b.py +13 -0
  1159. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_chat_20b_sft.py +13 -0
  1160. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_chat_7b.py +13 -0
  1161. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_chat_7b_sft.py +13 -0
  1162. opencompass-0.3.0/opencompass/configs/models/hf_internlm/vllm_internlm2_series.py +25 -0
  1163. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama2_13b.py +12 -0
  1164. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama2_13b_chat.py +12 -0
  1165. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama2_70b.py +12 -0
  1166. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama2_70b_chat.py +12 -0
  1167. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama2_7b.py +12 -0
  1168. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama2_7b_chat.py +12 -0
  1169. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama3_70b.py +12 -0
  1170. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama3_70b_instruct.py +13 -0
  1171. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama3_8b.py +12 -0
  1172. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama3_8b_instruct.py +13 -0
  1173. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama_13b.py +12 -0
  1174. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama_30b.py +12 -0
  1175. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama_65b.py +12 -0
  1176. opencompass-0.3.0/opencompass/configs/models/hf_llama/hf_llama_7b.py +12 -0
  1177. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama2_13b.py +15 -0
  1178. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama2_13b_chat.py +15 -0
  1179. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama2_70b.py +15 -0
  1180. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama2_70b_chat.py +15 -0
  1181. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama2_7b.py +15 -0
  1182. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama2_7b_chat.py +15 -0
  1183. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama3_70b.py +15 -0
  1184. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama3_70b_instruct.py +16 -0
  1185. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama3_8b.py +15 -0
  1186. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama3_8b_instruct.py +16 -0
  1187. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama_13b.py +15 -0
  1188. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama_30b.py +15 -0
  1189. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama_65b.py +15 -0
  1190. opencompass-0.3.0/opencompass/configs/models/hf_llama/lmdeploy_llama_7b.py +15 -0
  1191. opencompass-0.3.0/opencompass/configs/models/hf_llama/vllm_llama_series.py +29 -0
  1192. opencompass-0.3.0/opencompass/configs/models/internlm/internlm_7b.py +14 -0
  1193. opencompass-0.3.0/opencompass/configs/models/judge_llm/auto_j/hf_autoj_bilingual_6b.py +24 -0
  1194. opencompass-0.3.0/opencompass/configs/models/judge_llm/auto_j/hf_autoj_eng_13b.py +18 -0
  1195. opencompass-0.3.0/opencompass/configs/models/judge_llm/auto_j/hf_autoj_eng_13b_4bit.py +23 -0
  1196. opencompass-0.3.0/opencompass/configs/models/judge_llm/auto_j/hf_autoj_scen_classifier.py +18 -0
  1197. opencompass-0.3.0/opencompass/configs/models/judge_llm/judgelm/hf_judgelm_13b_v1.py +18 -0
  1198. opencompass-0.3.0/opencompass/configs/models/judge_llm/judgelm/hf_judgelm_33b_v1.py +18 -0
  1199. opencompass-0.3.0/opencompass/configs/models/judge_llm/judgelm/hf_judgelm_7b_v1.py +18 -0
  1200. opencompass-0.3.0/opencompass/configs/models/judge_llm/pandalm/hf_alpaca_pandalm_7b_v1.py +18 -0
  1201. opencompass-0.3.0/opencompass/configs/models/judge_llm/pandalm/hf_pandalm_7b_v1.py +18 -0
  1202. opencompass-0.3.0/opencompass/configs/models/lemur/lemur_70b_chat.py +30 -0
  1203. opencompass-0.3.0/opencompass/configs/models/lingowhale/hf_lingowhale_8b.py +25 -0
  1204. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mistral_7b_instruct_v0_1.py +12 -0
  1205. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mistral_7b_instruct_v0_2.py +12 -0
  1206. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mistral_7b_instruct_v0_3.py +12 -0
  1207. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mistral_7b_v0_1.py +13 -0
  1208. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mistral_7b_v0_2.py +13 -0
  1209. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mistral_7b_v0_3.py +13 -0
  1210. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mixtral_8x22b_instruct_v0_1.py +12 -0
  1211. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mixtral_8x22b_v0_1.py +12 -0
  1212. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mixtral_8x7b_instruct_v0_1.py +12 -0
  1213. opencompass-0.3.0/opencompass/configs/models/mistral/hf_mixtral_8x7b_v0_1.py +12 -0
  1214. opencompass-0.3.0/opencompass/configs/models/mistral/mixtral_8x7b_32k.py +19 -0
  1215. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mistral_7b_instruct_v0_1.py +15 -0
  1216. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mistral_7b_instruct_v0_2.py +15 -0
  1217. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mistral_7b_v0_1.py +15 -0
  1218. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mistral_7b_v0_2.py +15 -0
  1219. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mixtral_8x22b_instruct_v0_1.py +15 -0
  1220. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mixtral_8x22b_v0_1.py +15 -0
  1221. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mixtral_8x7b_instruct_v0_1.py +15 -0
  1222. opencompass-0.3.0/opencompass/configs/models/mistral/vllm_mixtral_8x7b_v0_1.py +15 -0
  1223. opencompass-0.3.0/opencompass/configs/models/moss/hf_moss_moon_003_base.py +21 -0
  1224. opencompass-0.3.0/opencompass/configs/models/moss/hf_moss_moon_003_sft.py +28 -0
  1225. opencompass-0.3.0/opencompass/configs/models/mpt/hf_mpt_7b.py +27 -0
  1226. opencompass-0.3.0/opencompass/configs/models/mpt/hf_mpt_instruct_7b.py +27 -0
  1227. opencompass-0.3.0/opencompass/configs/models/ms_internlm/ms_internlm_chat_7b_8k.py +30 -0
  1228. opencompass-0.3.0/opencompass/configs/models/nanbeige/hf_nanbeige2_16b_chat.py +12 -0
  1229. opencompass-0.3.0/opencompass/configs/models/nanbeige/hf_nanbeige2_8b_chat.py +12 -0
  1230. opencompass-0.3.0/opencompass/configs/models/nanbeige/hf_nanbeige_16b_chat.py +35 -0
  1231. opencompass-0.3.0/opencompass/configs/models/openai/gpt_3_5_turbo.py +18 -0
  1232. opencompass-0.3.0/opencompass/configs/models/openai/gpt_3_5_turbo_0125.py +20 -0
  1233. opencompass-0.3.0/opencompass/configs/models/openai/gpt_4.py +18 -0
  1234. opencompass-0.3.0/opencompass/configs/models/openai/gpt_4o_2024_05_13.py +20 -0
  1235. opencompass-0.3.0/opencompass/configs/models/openbmb/hf_minicpm_2b_dpo_fp32.py +12 -0
  1236. opencompass-0.3.0/opencompass/configs/models/openbmb/hf_minicpm_2b_sft_bf16.py +12 -0
  1237. opencompass-0.3.0/opencompass/configs/models/openbmb/hf_minicpm_2b_sft_fp32.py +12 -0
  1238. opencompass-0.3.0/opencompass/configs/models/opt/hf_opt_125m.py +12 -0
  1239. opencompass-0.3.0/opencompass/configs/models/opt/hf_opt_350m.py +12 -0
  1240. opencompass-0.3.0/opencompass/configs/models/others/hf_abel_7b_001.py +31 -0
  1241. opencompass-0.3.0/opencompass/configs/models/others/hf_abel_7b_002.py +31 -0
  1242. opencompass-0.3.0/opencompass/configs/models/others/hf_arithmo_mistral_7b.py +33 -0
  1243. opencompass-0.3.0/opencompass/configs/models/others/hf_command_r_plus.py +12 -0
  1244. opencompass-0.3.0/opencompass/configs/models/others/hf_dbrx_base.py +12 -0
  1245. opencompass-0.3.0/opencompass/configs/models/others/hf_dbrx_instruct.py +12 -0
  1246. opencompass-0.3.0/opencompass/configs/models/others/hf_dolphin_21_mistral_7b.py +32 -0
  1247. opencompass-0.3.0/opencompass/configs/models/others/hf_fashiongpt_70b_v11.py +32 -0
  1248. opencompass-0.3.0/opencompass/configs/models/others/hf_gsm8k_rft_llama7b2_u13b.py +33 -0
  1249. opencompass-0.3.0/opencompass/configs/models/others/hf_metamath_7b_v1_0.py +33 -0
  1250. opencompass-0.3.0/opencompass/configs/models/others/hf_metamath_llemma_7b.py +33 -0
  1251. opencompass-0.3.0/opencompass/configs/models/others/hf_metamath_mistral_7b.py +33 -0
  1252. opencompass-0.3.0/opencompass/configs/models/others/hf_openchat_35_0106.py +33 -0
  1253. opencompass-0.3.0/opencompass/configs/models/others/hf_openchat_35_1210.py +33 -0
  1254. opencompass-0.3.0/opencompass/configs/models/others/hf_orionstar_14b_base.py +24 -0
  1255. opencompass-0.3.0/opencompass/configs/models/others/hf_orionstar_yi_34b_chat.py +34 -0
  1256. opencompass-0.3.0/opencompass/configs/models/others/hf_phi_2.py +24 -0
  1257. opencompass-0.3.0/opencompass/configs/models/others/hf_telechat_12b_v2.py +26 -0
  1258. opencompass-0.3.0/opencompass/configs/models/others/hf_telechat_52b.py +26 -0
  1259. opencompass-0.3.0/opencompass/configs/models/others/hf_telechat_7b.py +25 -0
  1260. opencompass-0.3.0/opencompass/configs/models/others/hf_yayi2_30b_base.py +25 -0
  1261. opencompass-0.3.0/opencompass/configs/models/others/vllm_dbrx_instruct.py +14 -0
  1262. opencompass-0.3.0/opencompass/configs/models/others/vllm_orionstar_14b_longchat.py +26 -0
  1263. opencompass-0.3.0/opencompass/configs/models/phi/hf_phi_3_medium_4k_instruct.py +12 -0
  1264. opencompass-0.3.0/opencompass/configs/models/phi/hf_phi_3_mini_4k_instruct.py +12 -0
  1265. opencompass-0.3.0/opencompass/configs/models/phi/hf_phi_3_small_8k_instruct.py +12 -0
  1266. opencompass-0.3.0/opencompass/configs/models/pulse/hf_pulse_7b.py +23 -0
  1267. opencompass-0.3.0/opencompass/configs/models/qwen/README.md +142 -0
  1268. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_0_5b.py +12 -0
  1269. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_0_5b_chat.py +13 -0
  1270. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_110b.py +12 -0
  1271. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_110b_chat.py +13 -0
  1272. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_14b.py +12 -0
  1273. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_14b_chat.py +13 -0
  1274. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_1_8b.py +12 -0
  1275. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_1_8b_chat.py +13 -0
  1276. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_32b.py +12 -0
  1277. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_32b_chat.py +13 -0
  1278. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_4b.py +12 -0
  1279. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_4b_chat.py +13 -0
  1280. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_72b.py +12 -0
  1281. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_72b_chat.py +13 -0
  1282. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_7b.py +12 -0
  1283. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_7b_chat.py +13 -0
  1284. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_moe_a2_7b.py +12 -0
  1285. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen1_5_moe_a2_7b_chat.py +13 -0
  1286. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_0_5b.py +12 -0
  1287. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_0_5b_instruct.py +12 -0
  1288. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_1_5b.py +12 -0
  1289. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_1_5b_instruct.py +12 -0
  1290. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_57b_a14b.py +12 -0
  1291. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_72b.py +12 -0
  1292. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_7b.py +12 -0
  1293. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen2_7b_instruct.py +12 -0
  1294. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_14b.py +12 -0
  1295. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_14b_chat.py +13 -0
  1296. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_1_8b.py +12 -0
  1297. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_1_8b_chat.py +13 -0
  1298. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_72b.py +12 -0
  1299. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_72b_chat.py +13 -0
  1300. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_7b.py +12 -0
  1301. opencompass-0.3.0/opencompass/configs/models/qwen/hf_qwen_7b_chat.py +13 -0
  1302. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_110b.py +15 -0
  1303. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_110b_chat.py +16 -0
  1304. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_14b.py +15 -0
  1305. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_14b_chat.py +16 -0
  1306. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_1_8b.py +15 -0
  1307. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_1_8b_chat.py +16 -0
  1308. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_32b.py +15 -0
  1309. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_32b_chat.py +16 -0
  1310. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_4b.py +15 -0
  1311. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_4b_chat.py +16 -0
  1312. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_72b.py +15 -0
  1313. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_72b_chat.py +16 -0
  1314. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_7b.py +15 -0
  1315. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_7b_chat.py +16 -0
  1316. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen1_5_series.py +30 -0
  1317. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_1_5b.py +15 -0
  1318. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_1_5b_instruct.py +15 -0
  1319. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_72b.py +15 -0
  1320. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_72b_instruct.py +15 -0
  1321. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_7b.py +15 -0
  1322. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_7b_instruct.py +15 -0
  1323. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen2_series.py +26 -0
  1324. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_14b.py +15 -0
  1325. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_14b_chat.py +16 -0
  1326. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_1_8b.py +15 -0
  1327. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_1_8b_chat.py +16 -0
  1328. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_72b.py +15 -0
  1329. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_72b_chat.py +16 -0
  1330. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_7b.py +15 -0
  1331. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_7b_chat.py +16 -0
  1332. opencompass-0.3.0/opencompass/configs/models/qwen/lmdeploy_qwen_series.py +26 -0
  1333. opencompass-0.3.0/opencompass/configs/models/qwen/ms_qwen_7b_chat.py +30 -0
  1334. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_0_5b.py +15 -0
  1335. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_0_5b_chat.py +14 -0
  1336. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_110b.py +15 -0
  1337. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_110b_chat.py +14 -0
  1338. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_14b.py +15 -0
  1339. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_14b_chat.py +14 -0
  1340. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_1_8b.py +15 -0
  1341. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_1_8b_chat.py +14 -0
  1342. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_32b.py +15 -0
  1343. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_32b_chat.py +14 -0
  1344. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_4b.py +15 -0
  1345. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_4b_chat.py +14 -0
  1346. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_72b.py +15 -0
  1347. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_72b_chat.py +14 -0
  1348. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_7b.py +15 -0
  1349. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_7b_chat.py +14 -0
  1350. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_moe_a2_7b.py +15 -0
  1351. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_moe_a2_7b_chat.py +14 -0
  1352. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen1_5_series.py +29 -0
  1353. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_0_5b.py +15 -0
  1354. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_0_5b_instruct.py +14 -0
  1355. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_1_5b.py +15 -0
  1356. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_1_5b_instruct.py +14 -0
  1357. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_57b_a14b_instruct.py +14 -0
  1358. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_72b.py +15 -0
  1359. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_72b_instruct.py +14 -0
  1360. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_7b.py +15 -0
  1361. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_7b_instruct.py +14 -0
  1362. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen2_series.py +25 -0
  1363. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_14b.py +15 -0
  1364. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_14b_chat.py +14 -0
  1365. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_1_8b.py +15 -0
  1366. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_1_8b_chat.py +14 -0
  1367. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_72b.py +15 -0
  1368. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_72b_chat.py +14 -0
  1369. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_7b.py +15 -0
  1370. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_7b_chat.py +14 -0
  1371. opencompass-0.3.0/opencompass/configs/models/qwen/vllm_qwen_series.py +24 -0
  1372. opencompass-0.3.0/opencompass/configs/models/rwkv/rwkv5_3b.py +25 -0
  1373. opencompass-0.3.0/opencompass/configs/models/skywork/hf_skywork_13b.py +12 -0
  1374. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_13b_base_v1.py +21 -0
  1375. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_13b_base_v2.py +21 -0
  1376. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_13b_chat_v1.py +29 -0
  1377. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_13b_chat_v2.py +29 -0
  1378. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_70b_base.py +24 -0
  1379. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_70b_chat_v2.py +29 -0
  1380. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_70b_chat_v3.py +32 -0
  1381. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_7b_base.py +21 -0
  1382. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_7b_base_v3.py +21 -0
  1383. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_7b_chat_v3.py +29 -0
  1384. opencompass-0.3.0/opencompass/configs/models/tigerbot/hf_tigerbot_7b_sft.py +29 -0
  1385. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_13b_v13.py +13 -0
  1386. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_13b_v15.py +13 -0
  1387. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_13b_v15_16k.py +13 -0
  1388. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_33b_v13.py +13 -0
  1389. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_7b_v13.py +13 -0
  1390. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_7b_v15.py +13 -0
  1391. opencompass-0.3.0/opencompass/configs/models/vicuna/hf_vicuna_7b_v15_16k.py +13 -0
  1392. opencompass-0.3.0/opencompass/configs/models/vicuna/vllm_vicuna_13b_v15_16k.py +23 -0
  1393. opencompass-0.3.0/opencompass/configs/models/vicuna/vllm_vicuna_7b_v15_16k.py +23 -0
  1394. opencompass-0.3.0/opencompass/configs/models/wizardcoder/hf_wizardcoder_15b.py +21 -0
  1395. opencompass-0.3.0/opencompass/configs/models/wizardcoder/hf_wizardcoder_1b.py +21 -0
  1396. opencompass-0.3.0/opencompass/configs/models/wizardcoder/hf_wizardcoder_3b.py +21 -0
  1397. opencompass-0.3.0/opencompass/configs/models/wizardcoder/hf_wizardcoder_python_13b.py +21 -0
  1398. opencompass-0.3.0/opencompass/configs/models/wizardcoder/hf_wizardcoder_python_34b.py +21 -0
  1399. opencompass-0.3.0/opencompass/configs/models/wizardlm/hf_wizardlm_13b_v1_2.py +33 -0
  1400. opencompass-0.3.0/opencompass/configs/models/wizardlm/hf_wizardlm_70b_v1_0.py +33 -0
  1401. opencompass-0.3.0/opencompass/configs/models/wizardlm/hf_wizardlm_7b_v1_0.py +33 -0
  1402. opencompass-0.3.0/opencompass/configs/models/wizardlm/hf_wizardmath_7b_v1_0.py +33 -0
  1403. opencompass-0.3.0/opencompass/configs/models/wizardlm/hf_wizardmath_7b_v1_1.py +33 -0
  1404. opencompass-0.3.0/opencompass/configs/models/wizardlm/vllm_wizardlm_13b_v1_2.py +24 -0
  1405. opencompass-0.3.0/opencompass/configs/models/wizardlm/vllm_wizardlm_70b_v1_0.py +25 -0
  1406. opencompass-0.3.0/opencompass/configs/models/wizardlm/vllm_wizardlm_7b_v1_0.py +24 -0
  1407. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_1_5_34b.py +12 -0
  1408. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_1_5_34b_chat.py +12 -0
  1409. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_1_5_6b.py +12 -0
  1410. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_1_5_6b_chat.py +12 -0
  1411. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_1_5_9b.py +12 -0
  1412. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_1_5_9b_chat.py +12 -0
  1413. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_34b.py +12 -0
  1414. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_34b_chat.py +13 -0
  1415. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_6b.py +12 -0
  1416. opencompass-0.3.0/opencompass/configs/models/yi/hf_yi_6b_chat.py +13 -0
  1417. opencompass-0.3.0/opencompass/configs/models/yi/lmdeploy_yi_series.py +23 -0
  1418. opencompass-0.3.0/opencompass/configs/models/zephyr/hf_zephyr_7b_beta.py +12 -0
  1419. opencompass-0.3.0/opencompass/configs/models/zephyr/vllm_zephyr_7b_beta.py +23 -0
  1420. opencompass-0.3.0/opencompass/configs/summarizers/agent_bench.py +32 -0
  1421. opencompass-0.3.0/opencompass/configs/summarizers/charm_reason.py +98 -0
  1422. opencompass-0.3.0/opencompass/configs/summarizers/chat_OC15.py +81 -0
  1423. opencompass-0.3.0/opencompass/configs/summarizers/chat_OC15_multi_faceted.py +148 -0
  1424. opencompass-0.3.0/opencompass/configs/summarizers/cibench.py +62 -0
  1425. opencompass-0.3.0/opencompass/configs/summarizers/code_passk.py +43 -0
  1426. opencompass-0.3.0/opencompass/configs/summarizers/compassbench_v1_1_objective.py +244 -0
  1427. opencompass-0.3.0/opencompass/configs/summarizers/compassbench_v1_1_objective_public.py +22 -0
  1428. opencompass-0.3.0/opencompass/configs/summarizers/compassbench_v1_objective.py +227 -0
  1429. opencompass-0.3.0/opencompass/configs/summarizers/contamination.py +205 -0
  1430. opencompass-0.3.0/opencompass/configs/summarizers/example.py +19 -0
  1431. opencompass-0.3.0/opencompass/configs/summarizers/groups/GaokaoBench.py +6 -0
  1432. opencompass-0.3.0/opencompass/configs/summarizers/groups/MMLUArabic.py +50 -0
  1433. opencompass-0.3.0/opencompass/configs/summarizers/groups/agieval.py +17 -0
  1434. opencompass-0.3.0/opencompass/configs/summarizers/groups/bbh.py +6 -0
  1435. opencompass-0.3.0/opencompass/configs/summarizers/groups/calm.py +169 -0
  1436. opencompass-0.3.0/opencompass/configs/summarizers/groups/ceval.py +47 -0
  1437. opencompass-0.3.0/opencompass/configs/summarizers/groups/charm_reason.py +35 -0
  1438. opencompass-0.3.0/opencompass/configs/summarizers/groups/cibench.py +395 -0
  1439. opencompass-0.3.0/opencompass/configs/summarizers/groups/cmmlu.py +104 -0
  1440. opencompass-0.3.0/opencompass/configs/summarizers/groups/ds1000.py +5 -0
  1441. opencompass-0.3.0/opencompass/configs/summarizers/groups/flores.py +31 -0
  1442. opencompass-0.3.0/opencompass/configs/summarizers/groups/infinitebench.py +5 -0
  1443. opencompass-0.3.0/opencompass/configs/summarizers/groups/jigsaw_multilingual.py +6 -0
  1444. opencompass-0.3.0/opencompass/configs/summarizers/groups/lawbench.py +29 -0
  1445. opencompass-0.3.0/opencompass/configs/summarizers/groups/lcbench.py +3 -0
  1446. opencompass-0.3.0/opencompass/configs/summarizers/groups/legacy/cibench.py +109 -0
  1447. opencompass-0.3.0/opencompass/configs/summarizers/groups/leval.py +3 -0
  1448. opencompass-0.3.0/opencompass/configs/summarizers/groups/longbench.py +22 -0
  1449. opencompass-0.3.0/opencompass/configs/summarizers/groups/lveval.py +110 -0
  1450. opencompass-0.3.0/opencompass/configs/summarizers/groups/mathbench.py +88 -0
  1451. opencompass-0.3.0/opencompass/configs/summarizers/groups/mathbench_2024.py +26 -0
  1452. opencompass-0.3.0/opencompass/configs/summarizers/groups/mathbench_agent.py +75 -0
  1453. opencompass-0.3.0/opencompass/configs/summarizers/groups/mathbench_v1.py +13 -0
  1454. opencompass-0.3.0/opencompass/configs/summarizers/groups/mathbench_v1_2024.py +44 -0
  1455. opencompass-0.3.0/opencompass/configs/summarizers/groups/mathbench_v1_2024_lang.py +57 -0
  1456. opencompass-0.3.0/opencompass/configs/summarizers/groups/mgsm.py +9 -0
  1457. opencompass-0.3.0/opencompass/configs/summarizers/groups/mmlu.py +23 -0
  1458. opencompass-0.3.0/opencompass/configs/summarizers/groups/mmlu_pro.py +5 -0
  1459. opencompass-0.3.0/opencompass/configs/summarizers/groups/plugineval.py +150 -0
  1460. opencompass-0.3.0/opencompass/configs/summarizers/groups/scibench.py +6 -0
  1461. opencompass-0.3.0/opencompass/configs/summarizers/groups/teval.py +73 -0
  1462. opencompass-0.3.0/opencompass/configs/summarizers/groups/tydiqa.py +5 -0
  1463. opencompass-0.3.0/opencompass/configs/summarizers/groups/xiezhi.py +4 -0
  1464. opencompass-0.3.0/opencompass/configs/summarizers/infinitebench.py +8 -0
  1465. opencompass-0.3.0/opencompass/configs/summarizers/internlm2_keyset.py +20 -0
  1466. opencompass-0.3.0/opencompass/configs/summarizers/lawbench.py +58 -0
  1467. opencompass-0.3.0/opencompass/configs/summarizers/leaderboard.py +99 -0
  1468. opencompass-0.3.0/opencompass/configs/summarizers/leval.py +25 -0
  1469. opencompass-0.3.0/opencompass/configs/summarizers/longbench.py +32 -0
  1470. opencompass-0.3.0/opencompass/configs/summarizers/longeval_v2.py +61 -0
  1471. opencompass-0.3.0/opencompass/configs/summarizers/lveval.py +114 -0
  1472. opencompass-0.3.0/opencompass/configs/summarizers/math_agent.py +25 -0
  1473. opencompass-0.3.0/opencompass/configs/summarizers/math_baseline.py +19 -0
  1474. opencompass-0.3.0/opencompass/configs/summarizers/mathbench.py +18 -0
  1475. opencompass-0.3.0/opencompass/configs/summarizers/mathbench_v1.py +41 -0
  1476. opencompass-0.3.0/opencompass/configs/summarizers/medium.py +93 -0
  1477. opencompass-0.3.0/opencompass/configs/summarizers/mmlu_pro.py +25 -0
  1478. opencompass-0.3.0/opencompass/configs/summarizers/needlebench.py +315 -0
  1479. opencompass-0.3.0/opencompass/configs/summarizers/plugineval.py +36 -0
  1480. opencompass-0.3.0/opencompass/configs/summarizers/small.py +61 -0
  1481. opencompass-0.3.0/opencompass/configs/summarizers/subjective.py +5 -0
  1482. opencompass-0.3.0/opencompass/configs/summarizers/teval.py +36 -0
  1483. opencompass-0.3.0/opencompass/configs/summarizers/tiny.py +30 -0
  1484. opencompass-0.3.0/opencompass/datasets/FinanceIQ.py +41 -0
  1485. opencompass-0.3.0/opencompass/datasets/GaokaoBench.py +156 -0
  1486. opencompass-0.3.0/opencompass/datasets/IFEval/ifeval.py +97 -0
  1487. opencompass-0.3.0/opencompass/datasets/LCBench.py +331 -0
  1488. opencompass-0.3.0/opencompass/datasets/MMLUArabic.py +35 -0
  1489. opencompass-0.3.0/opencompass/datasets/NPHardEval/cmp_GCP_D.py +167 -0
  1490. opencompass-0.3.0/opencompass/datasets/NPHardEval/cmp_KSP.py +185 -0
  1491. opencompass-0.3.0/opencompass/datasets/NPHardEval/cmp_TSP_D.py +156 -0
  1492. opencompass-0.3.0/opencompass/datasets/NPHardEval/hard_GCP.py +191 -0
  1493. opencompass-0.3.0/opencompass/datasets/NPHardEval/hard_MSP.py +205 -0
  1494. opencompass-0.3.0/opencompass/datasets/NPHardEval/hard_TSP.py +213 -0
  1495. opencompass-0.3.0/opencompass/datasets/NPHardEval/p_BSP.py +126 -0
  1496. opencompass-0.3.0/opencompass/datasets/NPHardEval/p_EDP.py +147 -0
  1497. opencompass-0.3.0/opencompass/datasets/NPHardEval/p_SPP.py +202 -0
  1498. opencompass-0.3.0/opencompass/datasets/OpenFinData.py +49 -0
  1499. opencompass-0.3.0/opencompass/datasets/QuALITY.py +61 -0
  1500. opencompass-0.3.0/opencompass/datasets/TheoremQA/legacy.py +40 -0
  1501. opencompass-0.3.0/opencompass/datasets/TheoremQA/main.py +68 -0
  1502. opencompass-0.3.0/opencompass/datasets/__init__.py +127 -0
  1503. opencompass-0.3.0/opencompass/datasets/advglue.py +176 -0
  1504. opencompass-0.3.0/opencompass/datasets/afqmcd.py +33 -0
  1505. opencompass-0.3.0/opencompass/datasets/agieval/agieval.py +125 -0
  1506. opencompass-0.3.0/opencompass/datasets/agieval/dataset_loader.py +407 -0
  1507. opencompass-0.3.0/opencompass/datasets/arc.py +149 -0
  1508. opencompass-0.3.0/opencompass/datasets/ax.py +37 -0
  1509. opencompass-0.3.0/opencompass/datasets/bbh.py +111 -0
  1510. opencompass-0.3.0/opencompass/datasets/boolq.py +59 -0
  1511. opencompass-0.3.0/opencompass/datasets/calm/__init__.py +1 -0
  1512. opencompass-0.3.0/opencompass/datasets/calm/calm.py +60 -0
  1513. opencompass-0.3.0/opencompass/datasets/calm/data_processing/generate_questions.py +192 -0
  1514. opencompass-0.3.0/opencompass/datasets/calm/data_processing/task_hiearchy.py +125 -0
  1515. opencompass-0.3.0/opencompass/datasets/calm/evaluation/core_metrics.py +320 -0
  1516. opencompass-0.3.0/opencompass/datasets/calm/evaluation/errors.py +253 -0
  1517. opencompass-0.3.0/opencompass/datasets/calm/utils/__init__.py +0 -0
  1518. opencompass-0.3.0/opencompass/datasets/calm/utils/load_items.py +18 -0
  1519. opencompass-0.3.0/opencompass/datasets/cb.py +27 -0
  1520. opencompass-0.3.0/opencompass/datasets/ceval.py +117 -0
  1521. opencompass-0.3.0/opencompass/datasets/charm.py +154 -0
  1522. opencompass-0.3.0/opencompass/datasets/chid.py +48 -0
  1523. opencompass-0.3.0/opencompass/datasets/circular.py +373 -0
  1524. opencompass-0.3.0/opencompass/datasets/clozeTest_maxmin.py +38 -0
  1525. opencompass-0.3.0/opencompass/datasets/cluewsc.py +62 -0
  1526. opencompass-0.3.0/opencompass/datasets/cmb.py +33 -0
  1527. opencompass-0.3.0/opencompass/datasets/cmmlu.py +57 -0
  1528. opencompass-0.3.0/opencompass/datasets/cmnli.py +71 -0
  1529. opencompass-0.3.0/opencompass/datasets/cmrc.py +49 -0
  1530. opencompass-0.3.0/opencompass/datasets/commonsenseqa.py +65 -0
  1531. opencompass-0.3.0/opencompass/datasets/commonsenseqa_cn.py +33 -0
  1532. opencompass-0.3.0/opencompass/datasets/compassbench_obj.py +71 -0
  1533. opencompass-0.3.0/opencompass/datasets/copa.py +23 -0
  1534. opencompass-0.3.0/opencompass/datasets/crowspairs.py +98 -0
  1535. opencompass-0.3.0/opencompass/datasets/crowspairs_cn.py +27 -0
  1536. opencompass-0.3.0/opencompass/datasets/csl.py +48 -0
  1537. opencompass-0.3.0/opencompass/datasets/cvalues.py +26 -0
  1538. opencompass-0.3.0/opencompass/datasets/drcd.py +49 -0
  1539. opencompass-0.3.0/opencompass/datasets/drop_simple_eval.py +82 -0
  1540. opencompass-0.3.0/opencompass/datasets/ds1000.py +444 -0
  1541. opencompass-0.3.0/opencompass/datasets/eprstmt.py +29 -0
  1542. opencompass-0.3.0/opencompass/datasets/flames.py +59 -0
  1543. opencompass-0.3.0/opencompass/datasets/flores.py +81 -0
  1544. opencompass-0.3.0/opencompass/datasets/game24.py +259 -0
  1545. opencompass-0.3.0/opencompass/datasets/govrepcrs.py +41 -0
  1546. opencompass-0.3.0/opencompass/datasets/gpqa.py +115 -0
  1547. opencompass-0.3.0/opencompass/datasets/gsm8k.py +151 -0
  1548. opencompass-0.3.0/opencompass/datasets/gsm_hard.py +26 -0
  1549. opencompass-0.3.0/opencompass/datasets/hellaswag.py +234 -0
  1550. opencompass-0.3.0/opencompass/datasets/huggingface.py +17 -0
  1551. opencompass-0.3.0/opencompass/datasets/humaneval.py +186 -0
  1552. opencompass-0.3.0/opencompass/datasets/humaneval_multi.py +220 -0
  1553. opencompass-0.3.0/opencompass/datasets/humanevalx.py +244 -0
  1554. opencompass-0.3.0/opencompass/datasets/hungarian_math.py +22 -0
  1555. opencompass-0.3.0/opencompass/datasets/inference_ppl.py +39 -0
  1556. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_codedebug.py +38 -0
  1557. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_coderun.py +36 -0
  1558. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_endia.py +52 -0
  1559. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_enmc.py +38 -0
  1560. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_enqa.py +30 -0
  1561. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_ensum.py +25 -0
  1562. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_mathcalc.py +56 -0
  1563. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_mathfind.py +34 -0
  1564. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_retrievekv.py +51 -0
  1565. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_retrievenumber.py +30 -0
  1566. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_retrievepasskey.py +30 -0
  1567. opencompass-0.3.0/opencompass/datasets/infinitebench/infinitebench_zhqa.py +30 -0
  1568. opencompass-0.3.0/opencompass/datasets/jigsawmultilingual.py +39 -0
  1569. opencompass-0.3.0/opencompass/datasets/jsonl.py +22 -0
  1570. opencompass-0.3.0/opencompass/datasets/kaoshi.py +140 -0
  1571. opencompass-0.3.0/opencompass/datasets/lambada.py +53 -0
  1572. opencompass-0.3.0/opencompass/datasets/lawbench/lawbench.py +85 -0
  1573. opencompass-0.3.0/opencompass/datasets/lcsts.py +55 -0
  1574. opencompass-0.3.0/opencompass/datasets/leval/leval_coursera.py +31 -0
  1575. opencompass-0.3.0/opencompass/datasets/leval/leval_financial_qa.py +32 -0
  1576. opencompass-0.3.0/opencompass/datasets/leval/leval_gov_report_summ.py +32 -0
  1577. opencompass-0.3.0/opencompass/datasets/leval/leval_gsm100.py +62 -0
  1578. opencompass-0.3.0/opencompass/datasets/leval/leval_legal_contract_qa.py +32 -0
  1579. opencompass-0.3.0/opencompass/datasets/leval/leval_meeting_summ.py +32 -0
  1580. opencompass-0.3.0/opencompass/datasets/leval/leval_multidoc_qa.py +32 -0
  1581. opencompass-0.3.0/opencompass/datasets/leval/leval_narrattive_qa.py +32 -0
  1582. opencompass-0.3.0/opencompass/datasets/leval/leval_natural_question.py +32 -0
  1583. opencompass-0.3.0/opencompass/datasets/leval/leval_news_summ.py +32 -0
  1584. opencompass-0.3.0/opencompass/datasets/leval/leval_paper_assistant.py +32 -0
  1585. opencompass-0.3.0/opencompass/datasets/leval/leval_patent_summ.py +32 -0
  1586. opencompass-0.3.0/opencompass/datasets/leval/leval_quality.py +31 -0
  1587. opencompass-0.3.0/opencompass/datasets/leval/leval_review_summ.py +32 -0
  1588. opencompass-0.3.0/opencompass/datasets/leval/leval_scientific_qa.py +32 -0
  1589. opencompass-0.3.0/opencompass/datasets/leval/leval_topic_retrieval.py +31 -0
  1590. opencompass-0.3.0/opencompass/datasets/leval/leval_tpo.py +31 -0
  1591. opencompass-0.3.0/opencompass/datasets/leval/leval_tvshow_summ.py +32 -0
  1592. opencompass-0.3.0/opencompass/datasets/llm_compression.py +38 -0
  1593. opencompass-0.3.0/opencompass/datasets/longbench/longbench_2wikim_qa.py +30 -0
  1594. opencompass-0.3.0/opencompass/datasets/longbench/longbench_dureader.py +30 -0
  1595. opencompass-0.3.0/opencompass/datasets/longbench/longbench_gov_report.py +25 -0
  1596. opencompass-0.3.0/opencompass/datasets/longbench/longbench_hotpot_qa.py +30 -0
  1597. opencompass-0.3.0/opencompass/datasets/longbench/longbench_lcc.py +25 -0
  1598. opencompass-0.3.0/opencompass/datasets/longbench/longbench_lsht.py +40 -0
  1599. opencompass-0.3.0/opencompass/datasets/longbench/longbench_multi_news.py +25 -0
  1600. opencompass-0.3.0/opencompass/datasets/longbench/longbench_multifieldqa_en.py +30 -0
  1601. opencompass-0.3.0/opencompass/datasets/longbench/longbench_multifieldqa_zh.py +30 -0
  1602. opencompass-0.3.0/opencompass/datasets/longbench/longbench_musique.py +30 -0
  1603. opencompass-0.3.0/opencompass/datasets/longbench/longbench_narrative_qa.py +30 -0
  1604. opencompass-0.3.0/opencompass/datasets/longbench/longbench_passage_count.py +25 -0
  1605. opencompass-0.3.0/opencompass/datasets/longbench/longbench_passage_retrieval_en.py +30 -0
  1606. opencompass-0.3.0/opencompass/datasets/longbench/longbench_passage_retrieval_zh.py +30 -0
  1607. opencompass-0.3.0/opencompass/datasets/longbench/longbench_qasper.py +30 -0
  1608. opencompass-0.3.0/opencompass/datasets/longbench/longbench_qmsum.py +30 -0
  1609. opencompass-0.3.0/opencompass/datasets/longbench/longbench_repobench.py +30 -0
  1610. opencompass-0.3.0/opencompass/datasets/longbench/longbench_samsum.py +36 -0
  1611. opencompass-0.3.0/opencompass/datasets/longbench/longbench_trec.py +40 -0
  1612. opencompass-0.3.0/opencompass/datasets/longbench/longbench_trivia_qa.py +36 -0
  1613. opencompass-0.3.0/opencompass/datasets/longbench/longbench_vcsum.py +24 -0
  1614. opencompass-0.3.0/opencompass/datasets/lveval/lveval_cmrc_mixup.py +32 -0
  1615. opencompass-0.3.0/opencompass/datasets/lveval/lveval_dureader_mixup.py +30 -0
  1616. opencompass-0.3.0/opencompass/datasets/lveval/lveval_factrecall_en.py +32 -0
  1617. opencompass-0.3.0/opencompass/datasets/lveval/lveval_factrecall_zh.py +32 -0
  1618. opencompass-0.3.0/opencompass/datasets/lveval/lveval_hotpotwikiqa_mixup.py +35 -0
  1619. opencompass-0.3.0/opencompass/datasets/lveval/lveval_lic_mixup.py +35 -0
  1620. opencompass-0.3.0/opencompass/datasets/lveval/lveval_loogle_CR_mixup.py +33 -0
  1621. opencompass-0.3.0/opencompass/datasets/lveval/lveval_loogle_MIR_mixup.py +33 -0
  1622. opencompass-0.3.0/opencompass/datasets/lveval/lveval_loogle_SD_mixup.py +33 -0
  1623. opencompass-0.3.0/opencompass/datasets/lveval/lveval_multifieldqa_en_mixup.py +35 -0
  1624. opencompass-0.3.0/opencompass/datasets/lveval/lveval_multifieldqa_zh_mixup.py +35 -0
  1625. opencompass-0.3.0/opencompass/datasets/math.py +570 -0
  1626. opencompass-0.3.0/opencompass/datasets/mathbench.py +383 -0
  1627. opencompass-0.3.0/opencompass/datasets/mbpp.py +521 -0
  1628. opencompass-0.3.0/opencompass/datasets/medbench/medbench.py +589 -0
  1629. opencompass-0.3.0/opencompass/datasets/mgsm.py +80 -0
  1630. opencompass-0.3.0/opencompass/datasets/mmlu.py +153 -0
  1631. opencompass-0.3.0/opencompass/datasets/multirc.py +66 -0
  1632. opencompass-0.3.0/opencompass/datasets/narrativeqa.py +45 -0
  1633. opencompass-0.3.0/opencompass/datasets/natural_question.py +104 -0
  1634. opencompass-0.3.0/opencompass/datasets/natural_question_cn.py +56 -0
  1635. opencompass-0.3.0/opencompass/datasets/needlebench/__init__.py +0 -0
  1636. opencompass-0.3.0/opencompass/datasets/needlebench/multi.py +272 -0
  1637. opencompass-0.3.0/opencompass/datasets/needlebench/origin.py +292 -0
  1638. opencompass-0.3.0/opencompass/datasets/needlebench/parallel.py +323 -0
  1639. opencompass-0.3.0/opencompass/datasets/obqa.py +95 -0
  1640. opencompass-0.3.0/opencompass/datasets/piqa.py +178 -0
  1641. opencompass-0.3.0/opencompass/datasets/py150.py +40 -0
  1642. opencompass-0.3.0/opencompass/datasets/qasper.py +45 -0
  1643. opencompass-0.3.0/opencompass/datasets/qaspercut.py +55 -0
  1644. opencompass-0.3.0/opencompass/datasets/race.py +57 -0
  1645. opencompass-0.3.0/opencompass/datasets/realtoxicprompts.py +43 -0
  1646. opencompass-0.3.0/opencompass/datasets/record.py +79 -0
  1647. opencompass-0.3.0/opencompass/datasets/rolebench.py +88 -0
  1648. opencompass-0.3.0/opencompass/datasets/safety.py +25 -0
  1649. opencompass-0.3.0/opencompass/datasets/scibench.py +52 -0
  1650. opencompass-0.3.0/opencompass/datasets/siqa.py +185 -0
  1651. opencompass-0.3.0/opencompass/datasets/squad20.py +68 -0
  1652. opencompass-0.3.0/opencompass/datasets/storycloze.py +80 -0
  1653. opencompass-0.3.0/opencompass/datasets/strategyqa.py +44 -0
  1654. opencompass-0.3.0/opencompass/datasets/subjective/__init__.py +17 -0
  1655. opencompass-0.3.0/opencompass/datasets/subjective/alignbench.py +113 -0
  1656. opencompass-0.3.0/opencompass/datasets/subjective/arena_hard.py +37 -0
  1657. opencompass-0.3.0/opencompass/datasets/subjective/compassbench_checklist.py +37 -0
  1658. opencompass-0.3.0/opencompass/datasets/subjective/mtbench.py +208 -0
  1659. opencompass-0.3.0/opencompass/datasets/subjective/subjective_cmp.py +36 -0
  1660. opencompass-0.3.0/opencompass/datasets/subjective/wildbench.py +249 -0
  1661. opencompass-0.3.0/opencompass/datasets/summedits.py +32 -0
  1662. opencompass-0.3.0/opencompass/datasets/summscreen.py +46 -0
  1663. opencompass-0.3.0/opencompass/datasets/svamp.py +25 -0
  1664. opencompass-0.3.0/opencompass/datasets/tabmwp.py +247 -0
  1665. opencompass-0.3.0/opencompass/datasets/taco.py +827 -0
  1666. opencompass-0.3.0/opencompass/datasets/teval/__init__.py +60 -0
  1667. opencompass-0.3.0/opencompass/datasets/teval/utils/__init__.py +0 -0
  1668. opencompass-0.3.0/opencompass/datasets/tnews.py +81 -0
  1669. opencompass-0.3.0/opencompass/datasets/triviaqa.py +136 -0
  1670. opencompass-0.3.0/opencompass/datasets/triviaqarc.py +60 -0
  1671. opencompass-0.3.0/opencompass/datasets/tydiqa.py +87 -0
  1672. opencompass-0.3.0/opencompass/datasets/wic.py +46 -0
  1673. opencompass-0.3.0/opencompass/datasets/wikibench.py +64 -0
  1674. opencompass-0.3.0/opencompass/datasets/winograd.py +23 -0
  1675. opencompass-0.3.0/opencompass/datasets/winogrande.py +174 -0
  1676. opencompass-0.3.0/opencompass/datasets/wsc.py +107 -0
  1677. opencompass-0.3.0/opencompass/datasets/xiezhi.py +90 -0
  1678. opencompass-0.3.0/opencompass/datasets/xsum.py +54 -0
  1679. opencompass-0.3.0/opencompass/models/__init__.py +50 -0
  1680. opencompass-0.3.0/opencompass/models/base.py +514 -0
  1681. opencompass-0.3.0/opencompass/models/base_api.py +457 -0
  1682. opencompass-0.3.0/opencompass/models/claude_allesapin.py +150 -0
  1683. opencompass-0.3.0/opencompass/models/claude_sdk_api.py +121 -0
  1684. opencompass-0.3.0/opencompass/models/doubao_api.py +167 -0
  1685. opencompass-0.3.0/opencompass/models/gemini_api.py +203 -0
  1686. opencompass-0.3.0/opencompass/models/huggingface_above_v4_33.py +606 -0
  1687. opencompass-0.3.0/opencompass/models/openai_api.py +455 -0
  1688. opencompass-0.3.0/opencompass/models/turbomind.py +239 -0
  1689. opencompass-0.3.0/opencompass/openicl/icl_evaluator/__init__.py +14 -0
  1690. opencompass-0.3.0/opencompass/openicl/icl_evaluator/hf_metrics/accuracy.py +106 -0
  1691. opencompass-0.3.0/opencompass/openicl/icl_evaluator/hf_metrics/rouge.py +158 -0
  1692. opencompass-0.3.0/opencompass/openicl/icl_evaluator/hf_metrics/sacrebleu.py +178 -0
  1693. opencompass-0.3.0/opencompass/openicl/icl_evaluator/hf_metrics/squad.py +111 -0
  1694. opencompass-0.3.0/opencompass/openicl/icl_evaluator/icl_misc_evaluator.py +27 -0
  1695. opencompass-0.3.0/opencompass/openicl/icl_inferencer/__init__.py +15 -0
  1696. opencompass-0.3.0/opencompass/openicl/icl_inferencer/icl_inference_ppl_only_inferencer.py +239 -0
  1697. opencompass-0.3.0/opencompass/openicl/icl_retriever/icl_base_retriever.py +324 -0
  1698. opencompass-0.3.0/opencompass/partitioners/base.py +172 -0
  1699. opencompass-0.3.0/opencompass/partitioners/num_worker.py +152 -0
  1700. opencompass-0.3.0/opencompass/runners/__init__.py +5 -0
  1701. opencompass-0.3.0/opencompass/runners/local.py +232 -0
  1702. opencompass-0.3.0/opencompass/runners/volc.py +260 -0
  1703. opencompass-0.3.0/opencompass/summarizers/default.py +362 -0
  1704. opencompass-0.3.0/opencompass/summarizers/needlebench.py +739 -0
  1705. opencompass-0.3.0/opencompass/summarizers/subjective/__init__.py +17 -0
  1706. opencompass-0.3.0/opencompass/summarizers/subjective/alignmentbench.py +390 -0
  1707. opencompass-0.3.0/opencompass/summarizers/subjective/charm.py +208 -0
  1708. opencompass-0.3.0/opencompass/summarizers/subjective/compassbench_v13.py +169 -0
  1709. opencompass-0.3.0/opencompass/summarizers/subjective/subjective.py +105 -0
  1710. opencompass-0.3.0/opencompass/tasks/openicl_attack.py +204 -0
  1711. opencompass-0.3.0/opencompass/tasks/openicl_eval.py +399 -0
  1712. opencompass-0.3.0/opencompass/tasks/openicl_infer.py +163 -0
  1713. opencompass-0.3.0/opencompass/tasks/subjective_eval.py +453 -0
  1714. opencompass-0.3.0/opencompass/utils/__init__.py +13 -0
  1715. opencompass-0.3.0/opencompass/utils/collect_env.py +26 -0
  1716. opencompass-0.3.0/opencompass/utils/datasets.py +103 -0
  1717. opencompass-0.3.0/opencompass/utils/datasets_info.py +345 -0
  1718. opencompass-0.3.0/opencompass/utils/fileio.py +378 -0
  1719. opencompass-0.3.0/opencompass.egg-info/PKG-INFO +613 -0
  1720. opencompass-0.3.0/opencompass.egg-info/SOURCES.txt +1973 -0
  1721. opencompass-0.3.0/opencompass.egg-info/requires.txt +47 -0
  1722. opencompass-0.3.0/setup.py +143 -0
  1723. opencompass-0.2.6/PKG-INFO +0 -591
  1724. opencompass-0.2.6/README.md +0 -572
  1725. opencompass-0.2.6/opencompass/__init__.py +0 -1
  1726. opencompass-0.2.6/opencompass/cli/main.py +0 -383
  1727. opencompass-0.2.6/opencompass/datasets/FinanceIQ.py +0 -39
  1728. opencompass-0.2.6/opencompass/datasets/GaokaoBench.py +0 -149
  1729. opencompass-0.2.6/opencompass/datasets/IFEval/ifeval.py +0 -95
  1730. opencompass-0.2.6/opencompass/datasets/MMLUArabic.py +0 -33
  1731. opencompass-0.2.6/opencompass/datasets/NPHardEval/cmp_GCP_D.py +0 -165
  1732. opencompass-0.2.6/opencompass/datasets/NPHardEval/cmp_KSP.py +0 -183
  1733. opencompass-0.2.6/opencompass/datasets/NPHardEval/cmp_TSP_D.py +0 -154
  1734. opencompass-0.2.6/opencompass/datasets/NPHardEval/hard_GCP.py +0 -189
  1735. opencompass-0.2.6/opencompass/datasets/NPHardEval/hard_MSP.py +0 -203
  1736. opencompass-0.2.6/opencompass/datasets/NPHardEval/hard_TSP.py +0 -211
  1737. opencompass-0.2.6/opencompass/datasets/NPHardEval/p_BSP.py +0 -124
  1738. opencompass-0.2.6/opencompass/datasets/NPHardEval/p_EDP.py +0 -145
  1739. opencompass-0.2.6/opencompass/datasets/NPHardEval/p_SPP.py +0 -200
  1740. opencompass-0.2.6/opencompass/datasets/OpenFinData.py +0 -47
  1741. opencompass-0.2.6/opencompass/datasets/QuALITY.py +0 -59
  1742. opencompass-0.2.6/opencompass/datasets/TheoremQA/legacy.py +0 -38
  1743. opencompass-0.2.6/opencompass/datasets/TheoremQA/main.py +0 -66
  1744. opencompass-0.2.6/opencompass/datasets/__init__.py +0 -123
  1745. opencompass-0.2.6/opencompass/datasets/advglue.py +0 -174
  1746. opencompass-0.2.6/opencompass/datasets/afqmcd.py +0 -21
  1747. opencompass-0.2.6/opencompass/datasets/agieval/agieval.py +0 -99
  1748. opencompass-0.2.6/opencompass/datasets/agieval/dataset_loader.py +0 -392
  1749. opencompass-0.2.6/opencompass/datasets/arc.py +0 -84
  1750. opencompass-0.2.6/opencompass/datasets/ax.py +0 -24
  1751. opencompass-0.2.6/opencompass/datasets/bbh.py +0 -98
  1752. opencompass-0.2.6/opencompass/datasets/boolq.py +0 -56
  1753. opencompass-0.2.6/opencompass/datasets/cb.py +0 -25
  1754. opencompass-0.2.6/opencompass/datasets/ceval.py +0 -76
  1755. opencompass-0.2.6/opencompass/datasets/charm.py +0 -55
  1756. opencompass-0.2.6/opencompass/datasets/chid.py +0 -43
  1757. opencompass-0.2.6/opencompass/datasets/circular.py +0 -373
  1758. opencompass-0.2.6/opencompass/datasets/clozeTest_maxmin.py +0 -35
  1759. opencompass-0.2.6/opencompass/datasets/cluewsc.py +0 -57
  1760. opencompass-0.2.6/opencompass/datasets/cmb.py +0 -31
  1761. opencompass-0.2.6/opencompass/datasets/cmmlu.py +0 -34
  1762. opencompass-0.2.6/opencompass/datasets/cmnli.py +0 -42
  1763. opencompass-0.2.6/opencompass/datasets/cmrc.py +0 -47
  1764. opencompass-0.2.6/opencompass/datasets/commonsenseqa.py +0 -44
  1765. opencompass-0.2.6/opencompass/datasets/commonsenseqa_cn.py +0 -30
  1766. opencompass-0.2.6/opencompass/datasets/copa.py +0 -21
  1767. opencompass-0.2.6/opencompass/datasets/crowspairs.py +0 -98
  1768. opencompass-0.2.6/opencompass/datasets/crowspairs_cn.py +0 -23
  1769. opencompass-0.2.6/opencompass/datasets/csl.py +0 -43
  1770. opencompass-0.2.6/opencompass/datasets/cvalues.py +0 -25
  1771. opencompass-0.2.6/opencompass/datasets/drcd.py +0 -47
  1772. opencompass-0.2.6/opencompass/datasets/drop_simple_eval.py +0 -80
  1773. opencompass-0.2.6/opencompass/datasets/ds1000.py +0 -442
  1774. opencompass-0.2.6/opencompass/datasets/eprstmt.py +0 -27
  1775. opencompass-0.2.6/opencompass/datasets/flames.py +0 -57
  1776. opencompass-0.2.6/opencompass/datasets/flores.py +0 -53
  1777. opencompass-0.2.6/opencompass/datasets/game24.py +0 -257
  1778. opencompass-0.2.6/opencompass/datasets/govrepcrs.py +0 -37
  1779. opencompass-0.2.6/opencompass/datasets/gpqa.py +0 -111
  1780. opencompass-0.2.6/opencompass/datasets/gsm8k.py +0 -143
  1781. opencompass-0.2.6/opencompass/datasets/gsm_hard.py +0 -24
  1782. opencompass-0.2.6/opencompass/datasets/hellaswag.py +0 -144
  1783. opencompass-0.2.6/opencompass/datasets/huggingface.py +0 -13
  1784. opencompass-0.2.6/opencompass/datasets/humaneval.py +0 -173
  1785. opencompass-0.2.6/opencompass/datasets/humaneval_multi.py +0 -218
  1786. opencompass-0.2.6/opencompass/datasets/humanevalx.py +0 -242
  1787. opencompass-0.2.6/opencompass/datasets/hungarian_math.py +0 -20
  1788. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_codedebug.py +0 -36
  1789. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_coderun.py +0 -34
  1790. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_endia.py +0 -50
  1791. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_enmc.py +0 -36
  1792. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_enqa.py +0 -28
  1793. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_ensum.py +0 -23
  1794. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_mathcalc.py +0 -54
  1795. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_mathfind.py +0 -32
  1796. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_retrievekv.py +0 -49
  1797. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_retrievenumber.py +0 -28
  1798. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_retrievepasskey.py +0 -28
  1799. opencompass-0.2.6/opencompass/datasets/infinitebench/infinitebench_zhqa.py +0 -28
  1800. opencompass-0.2.6/opencompass/datasets/jigsawmultilingual.py +0 -35
  1801. opencompass-0.2.6/opencompass/datasets/jsonl.py +0 -20
  1802. opencompass-0.2.6/opencompass/datasets/kaoshi.py +0 -138
  1803. opencompass-0.2.6/opencompass/datasets/lambada.py +0 -45
  1804. opencompass-0.2.6/opencompass/datasets/lawbench/lawbench.py +0 -83
  1805. opencompass-0.2.6/opencompass/datasets/lcsts.py +0 -40
  1806. opencompass-0.2.6/opencompass/datasets/leval/leval_coursera.py +0 -27
  1807. opencompass-0.2.6/opencompass/datasets/leval/leval_financial_qa.py +0 -28
  1808. opencompass-0.2.6/opencompass/datasets/leval/leval_gov_report_summ.py +0 -28
  1809. opencompass-0.2.6/opencompass/datasets/leval/leval_gsm100.py +0 -58
  1810. opencompass-0.2.6/opencompass/datasets/leval/leval_legal_contract_qa.py +0 -28
  1811. opencompass-0.2.6/opencompass/datasets/leval/leval_meeting_summ.py +0 -28
  1812. opencompass-0.2.6/opencompass/datasets/leval/leval_multidoc_qa.py +0 -28
  1813. opencompass-0.2.6/opencompass/datasets/leval/leval_narrattive_qa.py +0 -28
  1814. opencompass-0.2.6/opencompass/datasets/leval/leval_natural_question.py +0 -28
  1815. opencompass-0.2.6/opencompass/datasets/leval/leval_news_summ.py +0 -28
  1816. opencompass-0.2.6/opencompass/datasets/leval/leval_paper_assistant.py +0 -28
  1817. opencompass-0.2.6/opencompass/datasets/leval/leval_patent_summ.py +0 -28
  1818. opencompass-0.2.6/opencompass/datasets/leval/leval_quality.py +0 -27
  1819. opencompass-0.2.6/opencompass/datasets/leval/leval_review_summ.py +0 -28
  1820. opencompass-0.2.6/opencompass/datasets/leval/leval_scientific_qa.py +0 -28
  1821. opencompass-0.2.6/opencompass/datasets/leval/leval_topic_retrieval.py +0 -27
  1822. opencompass-0.2.6/opencompass/datasets/leval/leval_tpo.py +0 -27
  1823. opencompass-0.2.6/opencompass/datasets/leval/leval_tvshow_summ.py +0 -28
  1824. opencompass-0.2.6/opencompass/datasets/llm_compression.py +0 -36
  1825. opencompass-0.2.6/opencompass/datasets/longbench/longbench_2wikim_qa.py +0 -26
  1826. opencompass-0.2.6/opencompass/datasets/longbench/longbench_dureader.py +0 -26
  1827. opencompass-0.2.6/opencompass/datasets/longbench/longbench_gov_report.py +0 -21
  1828. opencompass-0.2.6/opencompass/datasets/longbench/longbench_hotpot_qa.py +0 -26
  1829. opencompass-0.2.6/opencompass/datasets/longbench/longbench_lcc.py +0 -21
  1830. opencompass-0.2.6/opencompass/datasets/longbench/longbench_lsht.py +0 -36
  1831. opencompass-0.2.6/opencompass/datasets/longbench/longbench_multi_news.py +0 -21
  1832. opencompass-0.2.6/opencompass/datasets/longbench/longbench_multifieldqa_en.py +0 -26
  1833. opencompass-0.2.6/opencompass/datasets/longbench/longbench_multifieldqa_zh.py +0 -26
  1834. opencompass-0.2.6/opencompass/datasets/longbench/longbench_musique.py +0 -26
  1835. opencompass-0.2.6/opencompass/datasets/longbench/longbench_narrative_qa.py +0 -26
  1836. opencompass-0.2.6/opencompass/datasets/longbench/longbench_passage_count.py +0 -21
  1837. opencompass-0.2.6/opencompass/datasets/longbench/longbench_passage_retrieval_en.py +0 -26
  1838. opencompass-0.2.6/opencompass/datasets/longbench/longbench_passage_retrieval_zh.py +0 -26
  1839. opencompass-0.2.6/opencompass/datasets/longbench/longbench_qasper.py +0 -26
  1840. opencompass-0.2.6/opencompass/datasets/longbench/longbench_qmsum.py +0 -26
  1841. opencompass-0.2.6/opencompass/datasets/longbench/longbench_repobench.py +0 -26
  1842. opencompass-0.2.6/opencompass/datasets/longbench/longbench_samsum.py +0 -32
  1843. opencompass-0.2.6/opencompass/datasets/longbench/longbench_trec.py +0 -36
  1844. opencompass-0.2.6/opencompass/datasets/longbench/longbench_trivia_qa.py +0 -32
  1845. opencompass-0.2.6/opencompass/datasets/longbench/longbench_vcsum.py +0 -21
  1846. opencompass-0.2.6/opencompass/datasets/lveval/lveval_cmrc_mixup.py +0 -28
  1847. opencompass-0.2.6/opencompass/datasets/lveval/lveval_dureader_mixup.py +0 -26
  1848. opencompass-0.2.6/opencompass/datasets/lveval/lveval_factrecall_en.py +0 -28
  1849. opencompass-0.2.6/opencompass/datasets/lveval/lveval_factrecall_zh.py +0 -28
  1850. opencompass-0.2.6/opencompass/datasets/lveval/lveval_hotpotwikiqa_mixup.py +0 -31
  1851. opencompass-0.2.6/opencompass/datasets/lveval/lveval_lic_mixup.py +0 -31
  1852. opencompass-0.2.6/opencompass/datasets/lveval/lveval_loogle_CR_mixup.py +0 -29
  1853. opencompass-0.2.6/opencompass/datasets/lveval/lveval_loogle_MIR_mixup.py +0 -29
  1854. opencompass-0.2.6/opencompass/datasets/lveval/lveval_loogle_SD_mixup.py +0 -29
  1855. opencompass-0.2.6/opencompass/datasets/lveval/lveval_multifieldqa_en_mixup.py +0 -31
  1856. opencompass-0.2.6/opencompass/datasets/lveval/lveval_multifieldqa_zh_mixup.py +0 -31
  1857. opencompass-0.2.6/opencompass/datasets/math.py +0 -556
  1858. opencompass-0.2.6/opencompass/datasets/mathbench.py +0 -381
  1859. opencompass-0.2.6/opencompass/datasets/mbpp.py +0 -486
  1860. opencompass-0.2.6/opencompass/datasets/medbench/medbench.py +0 -587
  1861. opencompass-0.2.6/opencompass/datasets/mgsm.py +0 -78
  1862. opencompass-0.2.6/opencompass/datasets/mmlu.py +0 -88
  1863. opencompass-0.2.6/opencompass/datasets/multirc.py +0 -63
  1864. opencompass-0.2.6/opencompass/datasets/narrativeqa.py +0 -43
  1865. opencompass-0.2.6/opencompass/datasets/natural_question.py +0 -87
  1866. opencompass-0.2.6/opencompass/datasets/natural_question_cn.py +0 -54
  1867. opencompass-0.2.6/opencompass/datasets/needlebench/multi.py +0 -257
  1868. opencompass-0.2.6/opencompass/datasets/needlebench/origin.py +0 -277
  1869. opencompass-0.2.6/opencompass/datasets/needlebench/parallel.py +0 -311
  1870. opencompass-0.2.6/opencompass/datasets/obqa.py +0 -56
  1871. opencompass-0.2.6/opencompass/datasets/piqa.py +0 -108
  1872. opencompass-0.2.6/opencompass/datasets/py150.py +0 -38
  1873. opencompass-0.2.6/opencompass/datasets/qasper.py +0 -43
  1874. opencompass-0.2.6/opencompass/datasets/qaspercut.py +0 -53
  1875. opencompass-0.2.6/opencompass/datasets/race.py +0 -33
  1876. opencompass-0.2.6/opencompass/datasets/realtoxicprompts.py +0 -40
  1877. opencompass-0.2.6/opencompass/datasets/record.py +0 -76
  1878. opencompass-0.2.6/opencompass/datasets/rolebench.py +0 -84
  1879. opencompass-0.2.6/opencompass/datasets/safety.py +0 -23
  1880. opencompass-0.2.6/opencompass/datasets/scibench.py +0 -50
  1881. opencompass-0.2.6/opencompass/datasets/siqa.py +0 -114
  1882. opencompass-0.2.6/opencompass/datasets/squad20.py +0 -66
  1883. opencompass-0.2.6/opencompass/datasets/storycloze.py +0 -50
  1884. opencompass-0.2.6/opencompass/datasets/strategyqa.py +0 -33
  1885. opencompass-0.2.6/opencompass/datasets/subjective/__init__.py +0 -15
  1886. opencompass-0.2.6/opencompass/datasets/subjective/alignbench.py +0 -110
  1887. opencompass-0.2.6/opencompass/datasets/subjective/arena_hard.py +0 -35
  1888. opencompass-0.2.6/opencompass/datasets/subjective/mtbench.py +0 -206
  1889. opencompass-0.2.6/opencompass/datasets/subjective/subjective_cmp.py +0 -34
  1890. opencompass-0.2.6/opencompass/datasets/subjective/wildbench.py +0 -249
  1891. opencompass-0.2.6/opencompass/datasets/summedits.py +0 -21
  1892. opencompass-0.2.6/opencompass/datasets/summscreen.py +0 -44
  1893. opencompass-0.2.6/opencompass/datasets/svamp.py +0 -23
  1894. opencompass-0.2.6/opencompass/datasets/tabmwp.py +0 -245
  1895. opencompass-0.2.6/opencompass/datasets/taco.py +0 -825
  1896. opencompass-0.2.6/opencompass/datasets/teval/__init__.py +0 -58
  1897. opencompass-0.2.6/opencompass/datasets/tnews.py +0 -78
  1898. opencompass-0.2.6/opencompass/datasets/triviaqa.py +0 -95
  1899. opencompass-0.2.6/opencompass/datasets/triviaqarc.py +0 -58
  1900. opencompass-0.2.6/opencompass/datasets/tydiqa.py +0 -74
  1901. opencompass-0.2.6/opencompass/datasets/wic.py +0 -41
  1902. opencompass-0.2.6/opencompass/datasets/wikibench.py +0 -62
  1903. opencompass-0.2.6/opencompass/datasets/winograd.py +0 -23
  1904. opencompass-0.2.6/opencompass/datasets/winogrande.py +0 -95
  1905. opencompass-0.2.6/opencompass/datasets/wsc.py +0 -102
  1906. opencompass-0.2.6/opencompass/datasets/xiezhi.py +0 -88
  1907. opencompass-0.2.6/opencompass/datasets/xsum.py +0 -36
  1908. opencompass-0.2.6/opencompass/models/__init__.py +0 -48
  1909. opencompass-0.2.6/opencompass/models/base.py +0 -478
  1910. opencompass-0.2.6/opencompass/models/base_api.py +0 -457
  1911. opencompass-0.2.6/opencompass/models/gemini_api.py +0 -188
  1912. opencompass-0.2.6/opencompass/models/huggingface_above_v4_33.py +0 -447
  1913. opencompass-0.2.6/opencompass/models/openai_api.py +0 -349
  1914. opencompass-0.2.6/opencompass/models/turbomind.py +0 -236
  1915. opencompass-0.2.6/opencompass/openicl/icl_evaluator/__init__.py +0 -13
  1916. opencompass-0.2.6/opencompass/openicl/icl_evaluator/icl_misc_evaluator.py +0 -19
  1917. opencompass-0.2.6/opencompass/openicl/icl_inferencer/__init__.py +0 -13
  1918. opencompass-0.2.6/opencompass/openicl/icl_retriever/icl_base_retriever.py +0 -271
  1919. opencompass-0.2.6/opencompass/partitioners/base.py +0 -170
  1920. opencompass-0.2.6/opencompass/partitioners/num_worker.py +0 -150
  1921. opencompass-0.2.6/opencompass/runners/__init__.py +0 -4
  1922. opencompass-0.2.6/opencompass/runners/local.py +0 -224
  1923. opencompass-0.2.6/opencompass/summarizers/default.py +0 -362
  1924. opencompass-0.2.6/opencompass/summarizers/needlebench.py +0 -737
  1925. opencompass-0.2.6/opencompass/summarizers/subjective/__init__.py +0 -16
  1926. opencompass-0.2.6/opencompass/summarizers/subjective/alignmentbench.py +0 -387
  1927. opencompass-0.2.6/opencompass/summarizers/subjective/subjective.py +0 -105
  1928. opencompass-0.2.6/opencompass/tasks/openicl_attack.py +0 -204
  1929. opencompass-0.2.6/opencompass/tasks/openicl_eval.py +0 -370
  1930. opencompass-0.2.6/opencompass/tasks/openicl_infer.py +0 -163
  1931. opencompass-0.2.6/opencompass/tasks/subjective_eval.py +0 -452
  1932. opencompass-0.2.6/opencompass/utils/__init__.py +0 -12
  1933. opencompass-0.2.6/opencompass/utils/collect_env.py +0 -12
  1934. opencompass-0.2.6/opencompass/utils/fileio.py +0 -168
  1935. opencompass-0.2.6/opencompass.egg-info/PKG-INFO +0 -591
  1936. opencompass-0.2.6/opencompass.egg-info/SOURCES.txt +0 -467
  1937. opencompass-0.2.6/opencompass.egg-info/requires.txt +0 -47
  1938. opencompass-0.2.6/setup.py +0 -149
  1939. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/cli/__init__.py +0 -0
  1940. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/IFEval/__init__.py +0 -0
  1941. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/IFEval/evaluation_main.py +0 -0
  1942. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/IFEval/instructions.py +0 -0
  1943. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/IFEval/instructions_registry.py +0 -0
  1944. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/IFEval/instructions_util.py +0 -0
  1945. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/NPHardEval/__init__.py +0 -0
  1946. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/NPHardEval/prompts.py +0 -0
  1947. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/NPHardEval/utils.py +0 -0
  1948. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/TheoremQA/__init__.py +0 -0
  1949. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/TheoremQA/number_utils.py +0 -0
  1950. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/TheoremQA/utils.py +0 -0
  1951. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/agieval/__init__.py +0 -0
  1952. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/agieval/constructions.py +0 -0
  1953. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/agieval/evaluation.py +0 -0
  1954. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/agieval/math_equivalence.py +0 -0
  1955. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/agieval/post_process.py +0 -0
  1956. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/agieval/utils.py +0 -0
  1957. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/anli.py +0 -0
  1958. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/anthropics_evals.py +0 -0
  1959. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/apps.py +0 -0
  1960. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/base.py +0 -0
  1961. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/benbench.py +0 -0
  1962. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/bustum.py +0 -0
  1963. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/c3.py +0 -0
  1964. {opencompass-0.2.6/opencompass/datasets/needlebench → opencompass-0.3.0/opencompass/datasets/calm/data_processing}/__init__.py +0 -0
  1965. {opencompass-0.2.6/opencompass/datasets/teval/utils → opencompass-0.3.0/opencompass/datasets/calm/evaluation}/__init__.py +0 -0
  1966. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/chembench.py +0 -0
  1967. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/cibench.py +0 -0
  1968. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/civilcomments.py +0 -0
  1969. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/custom.py +0 -0
  1970. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/drop.py +0 -0
  1971. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/ds1000_interpreter.py +0 -0
  1972. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/infinitebench/__init__.py +0 -0
  1973. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/infinitebench/utils.py +0 -0
  1974. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/iwslt2017.py +0 -0
  1975. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/__init__.py +0 -0
  1976. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/__init__.py +0 -0
  1977. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/cjft.py +0 -0
  1978. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/flzx.py +0 -0
  1979. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/ftcs.py +0 -0
  1980. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/jdzy.py +0 -0
  1981. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/jec_ac.py +0 -0
  1982. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/jec_kd.py +0 -0
  1983. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/jetq.py +0 -0
  1984. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/lblj.py +0 -0
  1985. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/ljp_accusation.py +0 -0
  1986. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/ljp_article.py +0 -0
  1987. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/ljp_imprison.py +0 -0
  1988. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/sjjc.py +0 -0
  1989. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/wbfl.py +0 -0
  1990. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/wsjd.py +0 -0
  1991. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/xxcq.py +0 -0
  1992. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/ydlj.py +0 -0
  1993. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/yqzy.py +0 -0
  1994. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/evaluation_functions/zxfl.py +0 -0
  1995. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/__init__.py +0 -0
  1996. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/char_smi.py +0 -0
  1997. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/compare_m2_for_evaluation.py +0 -0
  1998. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/comprehension_scores.py +0 -0
  1999. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/function_utils.py +0 -0
  2000. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/__init__.py +0 -0
  2001. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/alignment.py +0 -0
  2002. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/annotator.py +0 -0
  2003. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/classifier.py +0 -0
  2004. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/merger.py +0 -0
  2005. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/tokenization.py +0 -0
  2006. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/modules/tokenizer.py +0 -0
  2007. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/parallel_to_m2.py +0 -0
  2008. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lawbench/utils/rc_f1.py +0 -0
  2009. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/leval/__init__.py +0 -0
  2010. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/leval/evaluators.py +0 -0
  2011. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lmeval.py +0 -0
  2012. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/longbench/__init__.py +0 -0
  2013. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/longbench/evaluators.py +0 -0
  2014. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lveval/__init__.py +0 -0
  2015. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/lveval/evaluators.py +0 -0
  2016. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/mastermath2024v1.py +0 -0
  2017. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/math401.py +0 -0
  2018. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/math_intern.py +0 -0
  2019. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/__init__.py +0 -0
  2020. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/constructions.py +0 -0
  2021. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/dataset_loader.py +0 -0
  2022. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/evaluation.py +0 -0
  2023. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/math_equivalence.py +0 -0
  2024. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/post_process.py +0 -0
  2025. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/medbench/utils.py +0 -0
  2026. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/mmlu_pro.py +0 -0
  2027. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/needlebench/atc.py +0 -0
  2028. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/needlebench/atc_choice.py +0 -0
  2029. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/reasonbench/ReasonBenchDataset.py +0 -0
  2030. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/reasonbench/__init__.py +0 -0
  2031. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/s3eval.py +0 -0
  2032. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/compass_arena.py +0 -0
  2033. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/compassbench.py +0 -0
  2034. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/compassbench_control_length_bias.py +0 -0
  2035. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/corev2.py +0 -0
  2036. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/creationbench.py +0 -0
  2037. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/fofo.py +0 -0
  2038. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/information_retrival.py +0 -0
  2039. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/mtbench101.py +0 -0
  2040. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/subjective/multiround.py +0 -0
  2041. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/evaluators/__init__.py +0 -0
  2042. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/evaluators/instruct_evaluator.py +0 -0
  2043. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/evaluators/planning_evaluator.py +0 -0
  2044. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/evaluators/reason_retrieve_understand_evaluator.py +0 -0
  2045. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/evaluators/review_evaluator.py +0 -0
  2046. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/schema.py +0 -0
  2047. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/utils/convert_results.py +0 -0
  2048. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/utils/format_load.py +0 -0
  2049. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/utils/meta_template.py +0 -0
  2050. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/teval/utils/template.py +0 -0
  2051. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/truthfulqa.py +0 -0
  2052. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/wnli.py +0 -0
  2053. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/xcopa.py +0 -0
  2054. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/datasets/xlsum.py +0 -0
  2055. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/metrics/__init__.py +0 -0
  2056. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/metrics/dump_results.py +0 -0
  2057. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/metrics/mme_score.py +0 -0
  2058. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/metrics/seedbench.py +0 -0
  2059. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/accessory.py +0 -0
  2060. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/ai360_api.py +0 -0
  2061. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/alaya.py +0 -0
  2062. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/baichuan_api.py +0 -0
  2063. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/baidu_api.py +0 -0
  2064. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/bytedance_api.py +0 -0
  2065. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/claude_api/__init__.py +0 -0
  2066. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/claude_api/claude_api.py +0 -0
  2067. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/claude_api/postprocessors.py +0 -0
  2068. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/deepseek_api.py +0 -0
  2069. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/doubao.py +0 -0
  2070. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/glm.py +0 -0
  2071. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/huggingface.py +0 -0
  2072. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/hunyuan_api.py +0 -0
  2073. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/intern_model.py +0 -0
  2074. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/krgpt_api.py +0 -0
  2075. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/lagent.py +0 -0
  2076. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/langchain.py +0 -0
  2077. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/lightllm_api.py +0 -0
  2078. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/llama2.py +0 -0
  2079. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/lmdeploy_pytorch.py +0 -0
  2080. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/lmdeploy_tis.py +0 -0
  2081. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/minimax_api.py +0 -0
  2082. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/mistral_api.py +0 -0
  2083. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/mixtral.py +0 -0
  2084. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/modelscope.py +0 -0
  2085. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/moonshot_api.py +0 -0
  2086. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/nanbeige_api.py +0 -0
  2087. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/pangu_api.py +0 -0
  2088. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/qwen_api.py +0 -0
  2089. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/sensetime_api.py +0 -0
  2090. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/stepfun_api.py +0 -0
  2091. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/turbomind_api.py +0 -0
  2092. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/turbomind_tis.py +0 -0
  2093. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/turbomind_with_tf_above_v4_33.py +0 -0
  2094. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/unigpt_api.py +0 -0
  2095. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/vllm.py +0 -0
  2096. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/vllm_with_tf_above_v4_33.py +0 -0
  2097. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/xunfei_api.py +0 -0
  2098. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/yayi_api.py +0 -0
  2099. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/yi_api.py +0 -0
  2100. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/zhipuai_api.py +0 -0
  2101. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/models/zhipuai_v2_api.py +0 -0
  2102. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/__init__.py +0 -0
  2103. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_dataset_reader.py +0 -0
  2104. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_agent_evaluator.py +0 -0
  2105. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_aucroc_evaluator.py +0 -0
  2106. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_base_evaluator.py +0 -0
  2107. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_bpc_evaluator.py +0 -0
  2108. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_circular_evaluator.py +0 -0
  2109. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_em_evaluator.py +0 -0
  2110. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_hf_evaluator.py +0 -0
  2111. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_jieba_rouge_evaluator.py +0 -0
  2112. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_plugin_evaluator.py +0 -0
  2113. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/icl_toxic_evaluator.py +0 -0
  2114. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_evaluator/lm_evaluator.py +0 -0
  2115. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_agent_inferencer.py +0 -0
  2116. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_attack_inferencer.py +0 -0
  2117. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_base_inferencer.py +0 -0
  2118. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_chat_inferencer.py +0 -0
  2119. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_clp_inferencer.py +0 -0
  2120. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_gen_inferencer.py +0 -0
  2121. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_ll_inferencer.py +0 -0
  2122. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_mink_percent_inferencer.py +0 -0
  2123. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_ppl_inferencer.py +0 -0
  2124. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_ppl_only_inferencer.py +0 -0
  2125. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_sc_inferencer.py +0 -0
  2126. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_sw_ce_loss_inferencer.py +0 -0
  2127. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_inferencer/icl_tot_inferencer.py +0 -0
  2128. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_prompt_template.py +0 -0
  2129. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/__init__.py +0 -0
  2130. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_bm25_retriever.py +0 -0
  2131. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_dpp_retriever.py +0 -0
  2132. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_fix_k_retriever.py +0 -0
  2133. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_mdl_retriever.py +0 -0
  2134. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_random_retriever.py +0 -0
  2135. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_topk_retriever.py +0 -0
  2136. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_votek_retriever.py +0 -0
  2137. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/icl_retriever/icl_zero_retriever.py +0 -0
  2138. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/utils/__init__.py +0 -0
  2139. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/openicl/utils/logging.py +0 -0
  2140. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/partitioners/__init__.py +0 -0
  2141. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/partitioners/naive.py +0 -0
  2142. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/partitioners/size.py +0 -0
  2143. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/partitioners/sub_naive.py +0 -0
  2144. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/partitioners/sub_num_worker.py +0 -0
  2145. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/partitioners/sub_size.py +0 -0
  2146. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/registry.py +0 -0
  2147. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/runners/base.py +0 -0
  2148. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/runners/dlc.py +0 -0
  2149. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/runners/local_api.py +0 -0
  2150. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/runners/slurm.py +0 -0
  2151. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/runners/slurm_sequential.py +0 -0
  2152. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/__init__.py +0 -0
  2153. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/circular.py +0 -0
  2154. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/llm_compression.py +0 -0
  2155. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/multi_faceted.py +0 -0
  2156. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/multi_model.py +0 -0
  2157. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/all_obj.py +0 -0
  2158. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/alpacaeval.py +0 -0
  2159. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/arenahard.py +0 -0
  2160. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/compass_arena.py +0 -0
  2161. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/compassbench.py +0 -0
  2162. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/corev2.py +0 -0
  2163. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/creationbench.py +0 -0
  2164. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/flames.py +0 -0
  2165. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/fofo.py +0 -0
  2166. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/mtbench.py +0 -0
  2167. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/mtbench101.py +0 -0
  2168. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/multiround.py +0 -0
  2169. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/subjective_post_process.py +0 -0
  2170. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/utils.py +0 -0
  2171. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/subjective/wildbench.py +0 -0
  2172. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/summarizers/summarizer_pretrain.py +0 -0
  2173. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/tasks/__init__.py +0 -0
  2174. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/tasks/base.py +0 -0
  2175. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/tasks/llm_eval.py +0 -0
  2176. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/abbr.py +0 -0
  2177. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/auxiliary.py +0 -0
  2178. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/build.py +0 -0
  2179. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/dependency.py +0 -0
  2180. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/file.py +0 -0
  2181. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/lark.py +0 -0
  2182. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/logging.py +0 -0
  2183. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/menu.py +0 -0
  2184. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/prompt.py +0 -0
  2185. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/run.py +0 -0
  2186. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/text_postprocessors.py +0 -0
  2187. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass/utils/types.py +0 -0
  2188. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass.egg-info/dependency_links.txt +0 -0
  2189. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass.egg-info/entry_points.txt +0 -0
  2190. {opencompass-0.2.6 → opencompass-0.3.0}/opencompass.egg-info/top_level.txt +0 -0
  2191. {opencompass-0.2.6 → opencompass-0.3.0}/setup.cfg +0 -0
@@ -0,0 +1,2 @@
1
+ recursive-include opencompass/configs *.py *.yml *.json *.txt *.md
2
+ recursive-include opencompass/openicl/icl_evaluator/hf_metrics *.py
@@ -0,0 +1,613 @@
1
+ Metadata-Version: 2.1
2
+ Name: opencompass
3
+ Version: 0.3.0
4
+ Summary: A comprehensive toolkit for large model evaluation
5
+ Home-page: https://github.com/open-compass/opencompass
6
+ Author: OpenCompass Contributors
7
+ Maintainer: OpenCompass Authors
8
+ License: Apache License 2.0
9
+ Description: <div align="center">
10
+ <img src="docs/en/_static/image/logo.svg" width="500px"/>
11
+ <br />
12
+ <br />
13
+
14
+ [![][github-release-shield]][github-release-link]
15
+ [![][github-releasedate-shield]][github-releasedate-link]
16
+ [![][github-contributors-shield]][github-contributors-link]<br>
17
+ [![][github-forks-shield]][github-forks-link]
18
+ [![][github-stars-shield]][github-stars-link]
19
+ [![][github-issues-shield]][github-issues-link]
20
+ [![][github-license-shield]][github-license-link]
21
+
22
+ <!-- [![PyPI](https://badge.fury.io/py/opencompass.svg)](https://pypi.org/project/opencompass/) -->
23
+
24
+ [🌐Website](https://opencompass.org.cn/) |
25
+ [📖CompassHub](https://hub.opencompass.org.cn/home) |
26
+ [📊CompassRank](https://rank.opencompass.org.cn/home) |
27
+ [📘Documentation](https://opencompass.readthedocs.io/en/latest/) |
28
+ [🛠️Installation](https://opencompass.readthedocs.io/en/latest/get_started/installation.html) |
29
+ [🤔Reporting Issues](https://github.com/open-compass/opencompass/issues/new/choose)
30
+
31
+ English | [简体中文](README_zh-CN.md)
32
+
33
+ [![][github-trending-shield]][github-trending-url]
34
+
35
+ </div>
36
+
37
+ <p align="center">
38
+ 👋 join us on <a href="https://discord.gg/KKwfEbFj7U" target="_blank">Discord</a> and <a href="https://r.vansin.top/?r=opencompass" target="_blank">WeChat</a>
39
+ </p>
40
+
41
+ > \[!IMPORTANT\]
42
+ >
43
+ > **Star Us**, You will receive all release notifications from GitHub without any delay ~ ⭐️
44
+
45
+ ## 📣 OpenCompass 2.0
46
+
47
+ We are thrilled to introduce OpenCompass 2.0, an advanced suite featuring three key components: [CompassKit](https://github.com/open-compass), [CompassHub](https://hub.opencompass.org.cn/home), and [CompassRank](https://rank.opencompass.org.cn/home).
48
+ ![oc20](https://github.com/tonysy/opencompass/assets/7881589/90dbe1c0-c323-470a-991e-2b37ab5350b2)
49
+
50
+ **CompassRank** has been significantly enhanced into the leaderboards that now incorporates both open-source benchmarks and proprietary benchmarks. This upgrade allows for a more comprehensive evaluation of models across the industry.
51
+
52
+ **CompassHub** presents a pioneering benchmark browser interface, designed to simplify and expedite the exploration and utilization of an extensive array of benchmarks for researchers and practitioners alike. To enhance the visibility of your own benchmark within the community, we warmly invite you to contribute it to CompassHub. You may initiate the submission process by clicking [here](https://hub.opencompass.org.cn/dataset-submit).
53
+
54
+ **CompassKit** is a powerful collection of evaluation toolkits specifically tailored for Large Language Models and Large Vision-language Models. It provides an extensive set of tools to assess and measure the performance of these complex models effectively. Welcome to try our toolkits for in your research and products.
55
+
56
+ <details>
57
+ <summary><kbd>Star History</kbd></summary>
58
+ <picture>
59
+ <source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/svg?repos=open-compass%2Fopencompass&theme=dark&type=Date">
60
+ <img width="100%" src="https://api.star-history.com/svg?repos=open-compass%2Fopencompass&type=Date">
61
+ </picture>
62
+ </details>
63
+
64
+ ## 🧭 Welcome
65
+
66
+ to **OpenCompass**!
67
+
68
+ Just like a compass guides us on our journey, OpenCompass will guide you through the complex landscape of evaluating large language models. With its powerful algorithms and intuitive interface, OpenCompass makes it easy to assess the quality and effectiveness of your NLP models.
69
+
70
+ 🚩🚩🚩 Explore opportunities at OpenCompass! We're currently **hiring full-time researchers/engineers and interns**. If you're passionate about LLM and OpenCompass, don't hesitate to reach out to us via [email](mailto:zhangsongyang@pjlab.org.cn). We'd love to hear from you!
71
+
72
+ 🔥🔥🔥 We are delighted to announce that **the OpenCompass has been recommended by the Meta AI**, click [Get Started](https://ai.meta.com/llama/get-started/#validation) of Llama for more information.
73
+
74
+ > **Attention**<br />
75
+ > We launch the OpenCompass Collaboration project, welcome to support diverse evaluation benchmarks into OpenCompass!
76
+ > Clike [Issue](https://github.com/open-compass/opencompass/issues/248) for more information.
77
+ > Let's work together to build a more powerful OpenCompass toolkit!
78
+
79
+ ## 🚀 What's New <a><img width="35" height="20" src="https://user-images.githubusercontent.com/12782558/212848161-5e783dd6-11e8-4fe0-bbba-39ffb77730be.png"></a>
80
+
81
+ - **\[2024.08.01\]** We supported the [Gemma2](https://huggingface.co/collections/google/gemma-2-release-667d6600fd5220e7b967f315) models. Welcome to try! 🔥🔥🔥
82
+ - **\[2024.07.23\]** We supported the [ModelScope](www.modelscope.cn) datasets, you can load them on demand without downloading all the data to your local disk. Welcome to try! 🔥🔥🔥
83
+ - **\[2024.07.17\]** We have released the example data and configuration for the CompassBench-202408, welcome to [CompassBench](https://opencompass.readthedocs.io/zh-cn/latest/advanced_guides/compassbench_intro.html) for more details. 🔥🔥🔥
84
+ - **\[2024.07.17\]** We are excited to announce the release of NeedleBench's [technical report](http://arxiv.org/abs/2407.11963). We invite you to visit our [support documentation](https://opencompass.readthedocs.io/en/latest/advanced_guides/needleinahaystack_eval.html) for detailed evaluation guidelines. 🔥🔥🔥
85
+ - **\[2024.07.04\]** OpenCompass now supports InternLM2.5, which has **outstanding reasoning capability**, **1M Context window and** and **stronger tool use**, you can try the models in [OpenCompass Config](https://github.com/open-compass/opencompass/tree/main/configs/models/hf_internlm) and [InternLM](https://github.com/InternLM/InternLM) .🔥🔥🔥.
86
+ - **\[2024.06.20\]** OpenCompass now supports one-click switching between inference acceleration backends, enhancing the efficiency of the evaluation process. In addition to the default HuggingFace inference backend, it now also supports popular backends [LMDeploy](https://github.com/InternLM/lmdeploy) and [vLLM](https://github.com/vllm-project/vllm). This feature is available via a simple command-line switch and through deployment APIs. For detailed usage, see the [documentation](docs/en/advanced_guides/accelerator_intro.md).🔥🔥🔥.
87
+ - **\[2024.05.08\]** We supported the evaluation of 4 MoE models: [Mixtral-8x22B-v0.1](configs/models/mixtral/hf_mixtral_8x22b_v0_1.py), [Mixtral-8x22B-Instruct-v0.1](configs/models/mixtral/hf_mixtral_8x22b_instruct_v0_1.py), [Qwen1.5-MoE-A2.7B](configs/models/qwen/hf_qwen1_5_moe_a2_7b.py), [Qwen1.5-MoE-A2.7B-Chat](configs/models/qwen/hf_qwen1_5_moe_a2_7b_chat.py). Try them out now!
88
+ - **\[2024.04.30\]** We supported evaluating a model's compression efficiency by calculating its Bits per Character (BPC) metric on an [external corpora](configs/datasets/llm_compression/README.md) ([official paper](https://github.com/hkust-nlp/llm-compression-intelligence)). Check out the [llm-compression](configs/eval_llm_compression.py) evaluation config now! 🔥🔥🔥
89
+ - **\[2024.04.29\]** We report the performance of several famous LLMs on the common benchmarks, welcome to [documentation](https://opencompass.readthedocs.io/en/latest/user_guides/corebench.html) for more information! 🔥🔥🔥.
90
+ - **\[2024.04.26\]** We deprecated the multi-madality evaluating function from OpenCompass, related implement has moved to [VLMEvalKit](https://github.com/open-compass/VLMEvalKit), welcome to use! 🔥🔥🔥.
91
+ - **\[2024.04.26\]** We supported the evaluation of [ArenaHard](configs/eval_subjective_arena_hard.py) welcome to try!🔥🔥🔥.
92
+ - **\[2024.04.22\]** We supported the evaluation of [LLaMA3](configs/models/hf_llama/hf_llama3_8b.py) 和 [LLaMA3-Instruct](configs/models/hf_llama/hf_llama3_8b_instruct.py), welcome to try! 🔥🔥🔥
93
+ - **\[2024.02.29\]** We supported the MT-Bench, AlpacalEval and AlignBench, more information can be found [here](https://opencompass.readthedocs.io/en/latest/advanced_guides/subjective_evaluation.html)
94
+ - **\[2024.01.30\]** We release OpenCompass 2.0. Click [CompassKit](https://github.com/open-compass), [CompassHub](https://hub.opencompass.org.cn/home), and [CompassRank](https://rank.opencompass.org.cn/home) for more information !
95
+
96
+ > [More](docs/en/notes/news.md)
97
+
98
+ ## ✨ Introduction
99
+
100
+ ![image](https://github.com/open-compass/opencompass/assets/22607038/f45fe125-4aed-4f8c-8fe8-df4efb41a8ea)
101
+
102
+ OpenCompass is a one-stop platform for large model evaluation, aiming to provide a fair, open, and reproducible benchmark for large model evaluation. Its main features include:
103
+
104
+ - **Comprehensive support for models and datasets**: Pre-support for 20+ HuggingFace and API models, a model evaluation scheme of 70+ datasets with about 400,000 questions, comprehensively evaluating the capabilities of the models in five dimensions.
105
+
106
+ - **Efficient distributed evaluation**: One line command to implement task division and distributed evaluation, completing the full evaluation of billion-scale models in just a few hours.
107
+
108
+ - **Diversified evaluation paradigms**: Support for zero-shot, few-shot, and chain-of-thought evaluations, combined with standard or dialogue-type prompt templates, to easily stimulate the maximum performance of various models.
109
+
110
+ - **Modular design with high extensibility**: Want to add new models or datasets, customize an advanced task division strategy, or even support a new cluster management system? Everything about OpenCompass can be easily expanded!
111
+
112
+ - **Experiment management and reporting mechanism**: Use config files to fully record each experiment, and support real-time reporting of results.
113
+
114
+ ## 📊 Leaderboard
115
+
116
+ We provide [OpenCompass Leaderboard](https://rank.opencompass.org.cn/home) for the community to rank all public models and API models. If you would like to join the evaluation, please provide the model repository URL or a standard API interface to the email address `opencompass@pjlab.org.cn`.
117
+
118
+ <p align="right"><a href="#top">🔝Back to top</a></p>
119
+
120
+ ## 🛠️ Installation
121
+
122
+ Below are the steps for quick installation and datasets preparation.
123
+
124
+ ### 💻 Environment Setup
125
+
126
+ #### Open-source Models with GPU
127
+
128
+ ```bash
129
+ conda create --name opencompass python=3.10 pytorch torchvision pytorch-cuda -c nvidia -c pytorch -y
130
+ conda activate opencompass
131
+ git clone https://github.com/open-compass/opencompass opencompass
132
+ cd opencompass
133
+ pip install -e .
134
+ ```
135
+
136
+ #### API Models with CPU-only
137
+
138
+ ```bash
139
+ conda create -n opencompass python=3.10 pytorch torchvision torchaudio cpuonly -c pytorch -y
140
+ conda activate opencompass
141
+ git clone https://github.com/open-compass/opencompass opencompass
142
+ cd opencompass
143
+ pip install -e .
144
+ # also please install requirements packages via `pip install -r requirements/api.txt` for API models if needed.
145
+ ```
146
+
147
+ ### 📂 Data Preparation
148
+
149
+ You can download and extract the datasets with the following commands:
150
+
151
+ ```bash
152
+ # Download dataset to data/ folder
153
+ wget https://github.com/open-compass/opencompass/releases/download/0.2.2.rc1/OpenCompassData-core-20240207.zip
154
+ unzip OpenCompassData-core-20240207.zip
155
+ ```
156
+
157
+ Also, use the [ModelScope](www.modelscope.cn) to load the datasets on demand.
158
+
159
+ Installation:
160
+
161
+ ```bash
162
+ pip install modelscope
163
+ export DATASET_SOURCE=ModelScope
164
+ ```
165
+
166
+ Then submit the evaluation task without downloading all the data to your local disk. Available datasets include:
167
+
168
+ ```bash
169
+ humaneval, triviaqa, commonsenseqa, tydiqa, strategyqa, cmmlu, lambada, piqa, ceval, math, LCSTS, Xsum, winogrande, openbookqa, AGIEval, gsm8k, nq, race, siqa, mbpp, mmlu, hellaswag, ARC, BBH, xstory_cloze, summedits, GAOKAO-BENCH, OCNLI, cmnli
170
+ ```
171
+
172
+ Some third-party features, like Humaneval and Llama, may require additional steps to work properly, for detailed steps please refer to the [Installation Guide](https://opencompass.readthedocs.io/en/latest/get_started/installation.html).
173
+
174
+ <p align="right"><a href="#top">🔝Back to top</a></p>
175
+
176
+ ## 🏗️ ️Evaluation
177
+
178
+ After ensuring that OpenCompass is installed correctly according to the above steps and the datasets are prepared, you can evaluate the performance of the LLaMA-7b model on the MMLU and C-Eval datasets using the following command:
179
+
180
+ ```bash
181
+ python run.py --models hf_llama_7b --datasets mmlu_ppl ceval_ppl
182
+ ```
183
+
184
+ Additionally, if you want to use an inference backend other than HuggingFace for accelerated evaluation, such as LMDeploy or vLLM, you can do so with the command below. Please ensure that you have installed the necessary packages for the chosen backend and that your model supports accelerated inference with it. For more information, see the documentation on inference acceleration backends [here](docs/en/advanced_guides/accelerator_intro.md). Below is an example using LMDeploy:
185
+
186
+ ```bash
187
+ python run.py --models hf_llama_7b --datasets mmlu_ppl ceval_ppl -a lmdeploy
188
+ ```
189
+
190
+ OpenCompass has predefined configurations for many models and datasets. You can list all available model and dataset configurations using the [tools](./docs/en/tools.md#list-configs).
191
+
192
+ ```bash
193
+ # List all configurations
194
+ python tools/list_configs.py
195
+ # List all configurations related to llama and mmlu
196
+ python tools/list_configs.py llama mmlu
197
+ ```
198
+
199
+ You can also evaluate other HuggingFace models via command line. Taking LLaMA-7b as an example:
200
+
201
+ ```bash
202
+ python run.py --datasets ceval_ppl mmlu_ppl --hf-type base --hf-path huggyllama/llama-7b
203
+ ```
204
+
205
+ > \[!TIP\]
206
+ >
207
+ > configuration with `_ppl` is designed for base model typically.
208
+ > configuration with `_gen` can be used for both base model and chat model.
209
+
210
+ Through the command line or configuration files, OpenCompass also supports evaluating APIs or custom models, as well as more diversified evaluation strategies. Please read the [Quick Start](https://opencompass.readthedocs.io/en/latest/get_started/quick_start.html) to learn how to run an evaluation task.
211
+
212
+ <p align="right"><a href="#top">🔝Back to top</a></p>
213
+
214
+ ## 📖 Dataset Support
215
+
216
+ <table align="center">
217
+ <tbody>
218
+ <tr align="center" valign="bottom">
219
+ <td>
220
+ <b>Language</b>
221
+ </td>
222
+ <td>
223
+ <b>Knowledge</b>
224
+ </td>
225
+ <td>
226
+ <b>Reasoning</b>
227
+ </td>
228
+ <td>
229
+ <b>Examination</b>
230
+ </td>
231
+ </tr>
232
+ <tr valign="top">
233
+ <td>
234
+ <details open>
235
+ <summary><b>Word Definition</b></summary>
236
+
237
+ - WiC
238
+ - SummEdits
239
+
240
+ </details>
241
+
242
+ <details open>
243
+ <summary><b>Idiom Learning</b></summary>
244
+
245
+ - CHID
246
+
247
+ </details>
248
+
249
+ <details open>
250
+ <summary><b>Semantic Similarity</b></summary>
251
+
252
+ - AFQMC
253
+ - BUSTM
254
+
255
+ </details>
256
+
257
+ <details open>
258
+ <summary><b>Coreference Resolution</b></summary>
259
+
260
+ - CLUEWSC
261
+ - WSC
262
+ - WinoGrande
263
+
264
+ </details>
265
+
266
+ <details open>
267
+ <summary><b>Translation</b></summary>
268
+
269
+ - Flores
270
+ - IWSLT2017
271
+
272
+ </details>
273
+
274
+ <details open>
275
+ <summary><b>Multi-language Question Answering</b></summary>
276
+
277
+ - TyDi-QA
278
+ - XCOPA
279
+
280
+ </details>
281
+
282
+ <details open>
283
+ <summary><b>Multi-language Summary</b></summary>
284
+
285
+ - XLSum
286
+
287
+ </details>
288
+ </td>
289
+ <td>
290
+ <details open>
291
+ <summary><b>Knowledge Question Answering</b></summary>
292
+
293
+ - BoolQ
294
+ - CommonSenseQA
295
+ - NaturalQuestions
296
+ - TriviaQA
297
+
298
+ </details>
299
+ </td>
300
+ <td>
301
+ <details open>
302
+ <summary><b>Textual Entailment</b></summary>
303
+
304
+ - CMNLI
305
+ - OCNLI
306
+ - OCNLI_FC
307
+ - AX-b
308
+ - AX-g
309
+ - CB
310
+ - RTE
311
+ - ANLI
312
+
313
+ </details>
314
+
315
+ <details open>
316
+ <summary><b>Commonsense Reasoning</b></summary>
317
+
318
+ - StoryCloze
319
+ - COPA
320
+ - ReCoRD
321
+ - HellaSwag
322
+ - PIQA
323
+ - SIQA
324
+
325
+ </details>
326
+
327
+ <details open>
328
+ <summary><b>Mathematical Reasoning</b></summary>
329
+
330
+ - MATH
331
+ - GSM8K
332
+
333
+ </details>
334
+
335
+ <details open>
336
+ <summary><b>Theorem Application</b></summary>
337
+
338
+ - TheoremQA
339
+ - StrategyQA
340
+ - SciBench
341
+
342
+ </details>
343
+
344
+ <details open>
345
+ <summary><b>Comprehensive Reasoning</b></summary>
346
+
347
+ - BBH
348
+
349
+ </details>
350
+ </td>
351
+ <td>
352
+ <details open>
353
+ <summary><b>Junior High, High School, University, Professional Examinations</b></summary>
354
+
355
+ - C-Eval
356
+ - AGIEval
357
+ - MMLU
358
+ - GAOKAO-Bench
359
+ - CMMLU
360
+ - ARC
361
+ - Xiezhi
362
+
363
+ </details>
364
+
365
+ <details open>
366
+ <summary><b>Medical Examinations</b></summary>
367
+
368
+ - CMB
369
+
370
+ </details>
371
+ </td>
372
+ </tr>
373
+ </td>
374
+ </tr>
375
+ </tbody>
376
+ <tbody>
377
+ <tr align="center" valign="bottom">
378
+ <td>
379
+ <b>Understanding</b>
380
+ </td>
381
+ <td>
382
+ <b>Long Context</b>
383
+ </td>
384
+ <td>
385
+ <b>Safety</b>
386
+ </td>
387
+ <td>
388
+ <b>Code</b>
389
+ </td>
390
+ </tr>
391
+ <tr valign="top">
392
+ <td>
393
+ <details open>
394
+ <summary><b>Reading Comprehension</b></summary>
395
+
396
+ - C3
397
+ - CMRC
398
+ - DRCD
399
+ - MultiRC
400
+ - RACE
401
+ - DROP
402
+ - OpenBookQA
403
+ - SQuAD2.0
404
+
405
+ </details>
406
+
407
+ <details open>
408
+ <summary><b>Content Summary</b></summary>
409
+
410
+ - CSL
411
+ - LCSTS
412
+ - XSum
413
+ - SummScreen
414
+
415
+ </details>
416
+
417
+ <details open>
418
+ <summary><b>Content Analysis</b></summary>
419
+
420
+ - EPRSTMT
421
+ - LAMBADA
422
+ - TNEWS
423
+
424
+ </details>
425
+ </td>
426
+ <td>
427
+ <details open>
428
+ <summary><b>Long Context Understanding</b></summary>
429
+
430
+ - LEval
431
+ - LongBench
432
+ - GovReports
433
+ - NarrativeQA
434
+ - Qasper
435
+
436
+ </details>
437
+ </td>
438
+ <td>
439
+ <details open>
440
+ <summary><b>Safety</b></summary>
441
+
442
+ - CivilComments
443
+ - CrowsPairs
444
+ - CValues
445
+ - JigsawMultilingual
446
+ - TruthfulQA
447
+
448
+ </details>
449
+ <details open>
450
+ <summary><b>Robustness</b></summary>
451
+
452
+ - AdvGLUE
453
+
454
+ </details>
455
+ </td>
456
+ <td>
457
+ <details open>
458
+ <summary><b>Code</b></summary>
459
+
460
+ - HumanEval
461
+ - HumanEvalX
462
+ - MBPP
463
+ - APPs
464
+ - DS1000
465
+
466
+ </details>
467
+ </td>
468
+ </tr>
469
+ </td>
470
+ </tr>
471
+ </tbody>
472
+ </table>
473
+
474
+ ## 📖 Model Support
475
+
476
+ <table align="center">
477
+ <tbody>
478
+ <tr align="center" valign="bottom">
479
+ <td>
480
+ <b>Open-source Models</b>
481
+ </td>
482
+ <td>
483
+ <b>API Models</b>
484
+ </td>
485
+ <!-- <td>
486
+ <b>Custom Models</b>
487
+ </td> -->
488
+ </tr>
489
+ <tr valign="top">
490
+ <td>
491
+
492
+ - [Alpaca](https://github.com/tatsu-lab/stanford_alpaca)
493
+ - [Baichuan](https://github.com/baichuan-inc)
494
+ - [BlueLM](https://github.com/vivo-ai-lab/BlueLM)
495
+ - [ChatGLM2](https://github.com/THUDM/ChatGLM2-6B)
496
+ - [ChatGLM3](https://github.com/THUDM/ChatGLM3-6B)
497
+ - [Gemma](https://huggingface.co/google/gemma-7b)
498
+ - [InternLM](https://github.com/InternLM/InternLM)
499
+ - [LLaMA](https://github.com/facebookresearch/llama)
500
+ - [LLaMA3](https://github.com/meta-llama/llama3)
501
+ - [Qwen](https://github.com/QwenLM/Qwen)
502
+ - [TigerBot](https://github.com/TigerResearch/TigerBot)
503
+ - [Vicuna](https://github.com/lm-sys/FastChat)
504
+ - [WizardLM](https://github.com/nlpxucan/WizardLM)
505
+ - [Yi](https://github.com/01-ai/Yi)
506
+ - ……
507
+
508
+ </td>
509
+ <td>
510
+
511
+ - OpenAI
512
+ - Gemini
513
+ - Claude
514
+ - ZhipuAI(ChatGLM)
515
+ - Baichuan
516
+ - ByteDance(YunQue)
517
+ - Huawei(PanGu)
518
+ - 360
519
+ - Baidu(ERNIEBot)
520
+ - MiniMax(ABAB-Chat)
521
+ - SenseTime(nova)
522
+ - Xunfei(Spark)
523
+ - ……
524
+
525
+ </td>
526
+
527
+ </tr>
528
+ </tbody>
529
+ </table>
530
+
531
+ <p align="right"><a href="#top">🔝Back to top</a></p>
532
+
533
+ ## 🔜 Roadmap
534
+
535
+ - [x] Subjective Evaluation
536
+ - [ ] Release CompassAreana
537
+ - [x] Subjective evaluation.
538
+ - [x] Long-context
539
+ - [x] Long-context evaluation with extensive datasets.
540
+ - [ ] Long-context leaderboard.
541
+ - [x] Coding
542
+ - [ ] Coding evaluation leaderboard.
543
+ - [x] Non-python language evaluation service.
544
+ - [x] Agent
545
+ - [ ] Support various agenet framework.
546
+ - [x] Evaluation of tool use of the LLMs.
547
+ - [x] Robustness
548
+ - [x] Support various attack method
549
+
550
+ ## 👷‍♂️ Contributing
551
+
552
+ We appreciate all contributions to improving OpenCompass. Please refer to the [contributing guideline](https://opencompass.readthedocs.io/en/latest/notes/contribution_guide.html) for the best practice.
553
+
554
+ <!-- Copy-paste in your Readme.md file -->
555
+
556
+ <!-- Made with [OSS Insight](https://ossinsight.io/) -->
557
+
558
+ <a href="https://github.com/open-compass/opencompass/graphs/contributors" target="_blank">
559
+ <table>
560
+ <tr>
561
+ <th colspan="2">
562
+ <br><img src="https://contrib.rocks/image?repo=open-compass/opencompass"><br><br>
563
+ </th>
564
+ </tr>
565
+ </table>
566
+ </a>
567
+
568
+ ## 🤝 Acknowledgements
569
+
570
+ Some code in this project is cited and modified from [OpenICL](https://github.com/Shark-NLP/OpenICL).
571
+
572
+ Some datasets and prompt implementations are modified from [chain-of-thought-hub](https://github.com/FranxYao/chain-of-thought-hub) and [instruct-eval](https://github.com/declare-lab/instruct-eval).
573
+
574
+ ## 🖊️ Citation
575
+
576
+ ```bibtex
577
+ @misc{2023opencompass,
578
+ title={OpenCompass: A Universal Evaluation Platform for Foundation Models},
579
+ author={OpenCompass Contributors},
580
+ howpublished = {\url{https://github.com/open-compass/opencompass}},
581
+ year={2023}
582
+ }
583
+ ```
584
+
585
+ <p align="right"><a href="#top">🔝Back to top</a></p>
586
+
587
+ [github-contributors-link]: https://github.com/open-compass/opencompass/graphs/contributors
588
+ [github-contributors-shield]: https://img.shields.io/github/contributors/open-compass/opencompass?color=c4f042&labelColor=black&style=flat-square
589
+ [github-forks-link]: https://github.com/open-compass/opencompass/network/members
590
+ [github-forks-shield]: https://img.shields.io/github/forks/open-compass/opencompass?color=8ae8ff&labelColor=black&style=flat-square
591
+ [github-issues-link]: https://github.com/open-compass/opencompass/issues
592
+ [github-issues-shield]: https://img.shields.io/github/issues/open-compass/opencompass?color=ff80eb&labelColor=black&style=flat-square
593
+ [github-license-link]: https://github.com/open-compass/opencompass/blob/main/LICENSE
594
+ [github-license-shield]: https://img.shields.io/github/license/open-compass/opencompass?color=white&labelColor=black&style=flat-square
595
+ [github-release-link]: https://github.com/open-compass/opencompass/releases
596
+ [github-release-shield]: https://img.shields.io/github/v/release/open-compass/opencompass?color=369eff&labelColor=black&logo=github&style=flat-square
597
+ [github-releasedate-link]: https://github.com/open-compass/opencompass/releases
598
+ [github-releasedate-shield]: https://img.shields.io/github/release-date/open-compass/opencompass?labelColor=black&style=flat-square
599
+ [github-stars-link]: https://github.com/open-compass/opencompass/stargazers
600
+ [github-stars-shield]: https://img.shields.io/github/stars/open-compass/opencompass?color=ffcb47&labelColor=black&style=flat-square
601
+ [github-trending-shield]: https://trendshift.io/api/badge/repositories/6630
602
+ [github-trending-url]: https://trendshift.io/repositories/6630
603
+
604
+ Keywords: AI,NLP,in-context learning,large language model,evaluation,benchmark,llm
605
+ Platform: UNKNOWN
606
+ Classifier: Programming Language :: Python :: 3.8
607
+ Classifier: Programming Language :: Python :: 3.9
608
+ Classifier: Programming Language :: Python :: 3.10
609
+ Classifier: Intended Audience :: Developers
610
+ Classifier: Intended Audience :: Education
611
+ Classifier: Intended Audience :: Science/Research
612
+ Requires-Python: >=3.8.0
613
+ Description-Content-Type: text/markdown