@researai/deepscientist 1.5.17 → 1.6.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/AGENTS.md +309 -130
- package/AISB/catalog/aisb.b1.agentic_coding.yaml +244 -0
- package/AISB/catalog/aisb.b10.climate_earth.yaml +235 -0
- package/AISB/catalog/aisb.b11.model_efficiency.yaml +231 -0
- package/AISB/catalog/aisb.b12.embodied_ai.yaml +238 -0
- package/AISB/catalog/aisb.b2.agent_systems.yaml +229 -0
- package/AISB/catalog/aisb.b3.self_evolving_rl.yaml +237 -0
- package/AISB/catalog/aisb.b4.lm_reasoning.yaml +240 -0
- package/AISB/catalog/aisb.b5.math_proof.yaml +235 -0
- package/AISB/catalog/aisb.b6.research_process.yaml +243 -0
- package/AISB/catalog/aisb.b7.multimodal_fusion.yaml +232 -0
- package/AISB/catalog/aisb.b8.lifesci_drug.yaml +275 -0
- package/AISB/catalog/aisb.b9.material_science.yaml +237 -0
- package/AISB/catalog/aisb.t3.001_savvy.yaml +159 -0
- package/AISB/catalog/aisb.t3.001_savvy.zh.yaml +121 -0
- package/AISB/catalog/aisb.t3.002_pinet.yaml +189 -0
- package/AISB/catalog/aisb.t3.002_pinet.zh.yaml +130 -0
- package/AISB/catalog/aisb.t3.004_decentralattn.yaml +184 -0
- package/AISB/catalog/aisb.t3.004_decentralattn.zh.yaml +153 -0
- package/AISB/catalog/aisb.t3.005_tsae.yaml +193 -0
- package/AISB/catalog/aisb.t3.005_tsae.zh.yaml +139 -0
- package/AISB/catalog/aisb.t3.006_physense.yaml +194 -0
- package/AISB/catalog/aisb.t3.006_physense.zh.yaml +118 -0
- package/AISB/catalog/aisb.t3.007_reasoningiqa.yaml +169 -0
- package/AISB/catalog/aisb.t3.007_reasoningiqa.zh.yaml +133 -0
- package/AISB/catalog/aisb.t3.008_meanflows.yaml +188 -0
- package/AISB/catalog/aisb.t3.008_meanflows.zh.yaml +140 -0
- package/AISB/catalog/aisb.t3.009_scoremissing.yaml +179 -0
- package/AISB/catalog/aisb.t3.009_scoremissing.zh.yaml +119 -0
- package/AISB/catalog/aisb.t3.010_suitabilityfilter.yaml +221 -0
- package/AISB/catalog/aisb.t3.010_suitabilityfilter.zh.yaml +141 -0
- package/AISB/catalog/aisb.t3.011_osd.yaml +206 -0
- package/AISB/catalog/aisb.t3.011_osd.zh.yaml +163 -0
- package/AISB/catalog/aisb.t3.012_efficientqat.yaml +206 -0
- package/AISB/catalog/aisb.t3.012_efficientqat.zh.yaml +159 -0
- package/AISB/catalog/aisb.t3.013_appl.yaml +152 -0
- package/AISB/catalog/aisb.t3.013_appl.zh.yaml +126 -0
- package/AISB/catalog/aisb.t3.014_piguard.yaml +207 -0
- package/AISB/catalog/aisb.t3.014_piguard.zh.yaml +164 -0
- package/AISB/catalog/aisb.t3.015_frspec.yaml +209 -0
- package/AISB/catalog/aisb.t3.015_frspec.zh.yaml +163 -0
- package/AISB/catalog/aisb.t3.016_mathfusion.yaml +166 -0
- package/AISB/catalog/aisb.t3.016_mathfusion.zh.yaml +145 -0
- package/AISB/catalog/aisb.t3.017_multimodalglp.yaml +171 -0
- package/AISB/catalog/aisb.t3.017_multimodalglp.zh.yaml +122 -0
- package/AISB/catalog/aisb.t3.018_cotsynth.yaml +206 -0
- package/AISB/catalog/aisb.t3.018_cotsynth.zh.yaml +162 -0
- package/AISB/catalog/aisb.t3.019_dyscaleut.yaml +211 -0
- package/AISB/catalog/aisb.t3.019_dyscaleut.zh.yaml +148 -0
- package/AISB/catalog/aisb.t3.020_aristotle.yaml +173 -0
- package/AISB/catalog/aisb.t3.020_aristotle.zh.yaml +119 -0
- package/AISB/catalog/aisb.t3.021_tokenrecycling.yaml +160 -0
- package/AISB/catalog/aisb.t3.021_tokenrecycling.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.022_chainofreasoning.yaml +204 -0
- package/AISB/catalog/aisb.t3.022_chainofreasoning.zh.yaml +161 -0
- package/AISB/catalog/aisb.t3.023_guidedembed.yaml +211 -0
- package/AISB/catalog/aisb.t3.023_guidedembed.zh.yaml +189 -0
- package/AISB/catalog/aisb.t3.024_outputcentric.yaml +148 -0
- package/AISB/catalog/aisb.t3.024_outputcentric.zh.yaml +131 -0
- package/AISB/catalog/aisb.t3.025_deeper.yaml +143 -0
- package/AISB/catalog/aisb.t3.025_deeper.zh.yaml +116 -0
- package/AISB/catalog/aisb.t3.026_gartkg.yaml +195 -0
- package/AISB/catalog/aisb.t3.026_gartkg.zh.yaml +127 -0
- package/AISB/catalog/aisb.t3.027_citeeval.yaml +182 -0
- package/AISB/catalog/aisb.t3.027_citeeval.zh.yaml +135 -0
- package/AISB/catalog/aisb.t3.028_sbam.yaml +206 -0
- package/AISB/catalog/aisb.t3.028_sbam.zh.yaml +166 -0
- package/AISB/catalog/aisb.t3.029_cdqgeoembed.yaml +224 -0
- package/AISB/catalog/aisb.t3.029_cdqgeoembed.zh.yaml +142 -0
- package/AISB/catalog/aisb.t3.030_processrm.yaml +211 -0
- package/AISB/catalog/aisb.t3.030_processrm.zh.yaml +166 -0
- package/AISB/catalog/aisb.t3.031_circuitstability.yaml +172 -0
- package/AISB/catalog/aisb.t3.031_circuitstability.zh.yaml +134 -0
- package/AISB/catalog/aisb.t3.032_ptsolver.yaml +169 -0
- package/AISB/catalog/aisb.t3.032_ptsolver.zh.yaml +135 -0
- package/AISB/catalog/aisb.t3.033_gcse.yaml +144 -0
- package/AISB/catalog/aisb.t3.033_gcse.zh.yaml +126 -0
- package/AISB/catalog/aisb.t3.034_ensemblewm.yaml +183 -0
- package/AISB/catalog/aisb.t3.034_ensemblewm.zh.yaml +146 -0
- package/AISB/catalog/aisb.t3.035_moralvalueswa.yaml +207 -0
- package/AISB/catalog/aisb.t3.035_moralvalueswa.zh.yaml +165 -0
- package/AISB/catalog/aisb.t3.036_weakstrongpref.yaml +210 -0
- package/AISB/catalog/aisb.t3.036_weakstrongpref.zh.yaml +194 -0
- package/AISB/catalog/aisb.t3.037_dementiamask.yaml +172 -0
- package/AISB/catalog/aisb.t3.037_dementiamask.zh.yaml +132 -0
- package/AISB/catalog/aisb.t3.038_tinysam.yaml +284 -0
- package/AISB/catalog/aisb.t3.038_tinysam.zh.yaml +240 -0
- package/AISB/catalog/aisb.t3.039_calf.yaml +224 -0
- package/AISB/catalog/aisb.t3.039_calf.zh.yaml +194 -0
- package/AISB/catalog/aisb.t3.040_graniteguardian.yaml +199 -0
- package/AISB/catalog/aisb.t3.040_graniteguardian.zh.yaml +174 -0
- package/AISB/catalog/aisb.t3.041_amdm.yaml +149 -0
- package/AISB/catalog/aisb.t3.041_amdm.zh.yaml +137 -0
- package/AISB/catalog/aisb.t3.042_xpatch.yaml +216 -0
- package/AISB/catalog/aisb.t3.042_xpatch.zh.yaml +182 -0
- package/AISB/catalog/aisb.t3.043_vhm.yaml +268 -0
- package/AISB/catalog/aisb.t3.043_vhm.zh.yaml +193 -0
- package/AISB/catalog/aisb.t3.044_rgvi.yaml +224 -0
- package/AISB/catalog/aisb.t3.044_rgvi.zh.yaml +176 -0
- package/AISB/catalog/aisb.t3.045_pslstm.yaml +203 -0
- package/AISB/catalog/aisb.t3.045_pslstm.zh.yaml +179 -0
- package/AISB/catalog/aisb.t3.046_nonstatts.yaml +208 -0
- package/AISB/catalog/aisb.t3.046_nonstatts.zh.yaml +194 -0
- package/AISB/catalog/aisb.t3.047_timepfn.yaml +156 -0
- package/AISB/catalog/aisb.t3.047_timepfn.zh.yaml +124 -0
- package/AISB/catalog/aisb.t3.048_proxyspex.yaml +148 -0
- package/AISB/catalog/aisb.t3.048_proxyspex.zh.yaml +125 -0
- package/AISB/catalog/aisb.t3.049_hogwildinference.yaml +183 -0
- package/AISB/catalog/aisb.t3.049_hogwildinference.zh.yaml +138 -0
- package/AISB/catalog/aisb.t3.050_causalpfn.yaml +214 -0
- package/AISB/catalog/aisb.t3.050_causalpfn.zh.yaml +190 -0
- package/AISB/catalog/aisb.t3.051_flashtp.yaml +169 -0
- package/AISB/catalog/aisb.t3.051_flashtp.zh.yaml +124 -0
- package/AISB/catalog/aisb.t3.052_nsdiff.yaml +155 -0
- package/AISB/catalog/aisb.t3.052_nsdiff.zh.yaml +138 -0
- package/AISB/catalog/aisb.t3.053_k2vae.yaml +158 -0
- package/AISB/catalog/aisb.t3.053_k2vae.zh.yaml +132 -0
- package/AISB/catalog/aisb.t3.054_timebase.yaml +178 -0
- package/AISB/catalog/aisb.t3.054_timebase.zh.yaml +158 -0
- package/AISB/catalog/aisb.t3.055_csbrain.yaml +238 -0
- package/AISB/catalog/aisb.t3.055_csbrain.zh.yaml +184 -0
- package/AISB/catalog/aisb.t3.056_infosam.yaml +224 -0
- package/AISB/catalog/aisb.t3.056_infosam.zh.yaml +189 -0
- package/AISB/catalog/aisb.t3.057_mdreid.yaml +129 -0
- package/AISB/catalog/aisb.t3.057_mdreid.zh.yaml +117 -0
- package/AISB/catalog/aisb.t3.058_mindglitch.yaml +171 -0
- package/AISB/catalog/aisb.t3.058_mindglitch.zh.yaml +145 -0
- package/AISB/catalog/aisb.t3.059_selfsupervised.yaml +154 -0
- package/AISB/catalog/aisb.t3.059_selfsupervised.zh.yaml +125 -0
- package/AISB/catalog/aisb.t3.060_iaggad.yaml +121 -0
- package/AISB/catalog/aisb.t3.060_iaggad.zh.yaml +100 -0
- package/AISB/catalog/aisb.t3.061_hsgkn.yaml +136 -0
- package/AISB/catalog/aisb.t3.061_hsgkn.zh.yaml +113 -0
- package/AISB/catalog/aisb.t3.062_visionts.yaml +237 -0
- package/AISB/catalog/aisb.t3.062_visionts.zh.yaml +216 -0
- package/AISB/catalog/aisb.t3.063_tsrag.yaml +162 -0
- package/AISB/catalog/aisb.t3.063_tsrag.zh.yaml +138 -0
- package/AISB/catalog/aisb.t3.064_pir.yaml +221 -0
- package/AISB/catalog/aisb.t3.064_pir.zh.yaml +197 -0
- package/AISB/catalog/aisb.t3.065_proteinbinding.yaml +234 -0
- package/AISB/catalog/aisb.t3.065_proteinbinding.zh.yaml +167 -0
- package/AISB/catalog/aisb.t3.066_tropicalattention.yaml +267 -0
- package/AISB/catalog/aisb.t3.066_tropicalattention.zh.yaml +229 -0
- package/AISB/catalog/aisb.t3.067_kanad.yaml +193 -0
- package/AISB/catalog/aisb.t3.067_kanad.zh.yaml +167 -0
- package/AISB/catalog/aisb.t3.068_sempo.yaml +187 -0
- package/AISB/catalog/aisb.t3.068_sempo.zh.yaml +148 -0
- package/AISB/catalog/aisb.t3.069_treehfd.yaml +129 -0
- package/AISB/catalog/aisb.t3.069_treehfd.zh.yaml +111 -0
- package/AISB/catalog/aisb.t3.070_certifiedunlearning.yaml +224 -0
- package/AISB/catalog/aisb.t3.070_certifiedunlearning.zh.yaml +171 -0
- package/AISB/catalog/aisb.t3.071_neuralmjd.yaml +142 -0
- package/AISB/catalog/aisb.t3.071_neuralmjd.zh.yaml +120 -0
- package/AISB/catalog/aisb.t3.072_fedgmt.yaml +181 -0
- package/AISB/catalog/aisb.t3.072_fedgmt.zh.yaml +158 -0
- package/AISB/catalog/aisb.t3.073_rld.yaml +161 -0
- package/AISB/catalog/aisb.t3.073_rld.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.074_lsvi.yaml +163 -0
- package/AISB/catalog/aisb.t3.074_lsvi.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.075_treeslicedentropy.yaml +201 -0
- package/AISB/catalog/aisb.t3.075_treeslicedentropy.zh.yaml +148 -0
- package/AISB/catalog/aisb.t3.076_aanet.yaml +169 -0
- package/AISB/catalog/aisb.t3.076_aanet.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.077_cmnn.yaml +199 -0
- package/AISB/catalog/aisb.t3.077_cmnn.zh.yaml +165 -0
- package/AISB/catalog/aisb.t3.078_conformalanomaly.yaml +146 -0
- package/AISB/catalog/aisb.t3.078_conformalanomaly.zh.yaml +117 -0
- package/AISB/catalog/aisb.t3.079_dpfkmeans.yaml +131 -0
- package/AISB/catalog/aisb.t3.079_dpfkmeans.zh.yaml +104 -0
- package/AISB/catalog/aisb.t3.080_latentscorereweight.yaml +169 -0
- package/AISB/catalog/aisb.t3.080_latentscorereweight.zh.yaml +123 -0
- package/AISB/catalog/aisb.t3.081_qmamba.yaml +150 -0
- package/AISB/catalog/aisb.t3.081_qmamba.zh.yaml +117 -0
- package/AISB/catalog/aisb.t3.082_onlinellmrouting.yaml +160 -0
- package/AISB/catalog/aisb.t3.082_onlinellmrouting.zh.yaml +133 -0
- package/AISB/catalog/aisb.t3.083_starformer.yaml +178 -0
- package/AISB/catalog/aisb.t3.083_starformer.zh.yaml +140 -0
- package/AISB/catalog/aisb.t3.084_ift.yaml +139 -0
- package/AISB/catalog/aisb.t3.084_ift.zh.yaml +111 -0
- package/AISB/catalog/aisb.t3.085_neuralsurv.yaml +183 -0
- package/AISB/catalog/aisb.t3.085_neuralsurv.zh.yaml +143 -0
- package/AISB/catalog/aisb.t3.086_stella.yaml +197 -0
- package/AISB/catalog/aisb.t3.086_stella.zh.yaml +142 -0
- package/AISB/catalog/aisb.t3.087_moses.yaml +167 -0
- package/AISB/catalog/aisb.t3.087_moses.zh.yaml +132 -0
- package/AISB/catalog/aisb.t3.088_channelnorm.yaml +140 -0
- package/AISB/catalog/aisb.t3.088_channelnorm.zh.yaml +109 -0
- package/AISB/catalog/aisb.t3.089_causalvelocity.yaml +730 -0
- package/AISB/catalog/aisb.t3.089_causalvelocity.zh.yaml +668 -0
- package/AISB/catalog/aisb.t3.090_rstib.yaml +144 -0
- package/AISB/catalog/aisb.t3.090_rstib.zh.yaml +109 -0
- package/AISB/catalog/aisb.t3.091_timeawarecausal.yaml +132 -0
- package/AISB/catalog/aisb.t3.091_timeawarecausal.zh.yaml +107 -0
- package/AISB/catalog/aisb.t3.092_kmeanslocalopt.yaml +138 -0
- package/AISB/catalog/aisb.t3.092_kmeanslocalopt.zh.yaml +110 -0
- package/AISB/catalog/aisb.t3.093_fedwmsam.yaml +134 -0
- package/AISB/catalog/aisb.t3.093_fedwmsam.zh.yaml +106 -0
- package/AISB/catalog/aisb.t3.094_boundre.yaml +147 -0
- package/AISB/catalog/aisb.t3.094_boundre.zh.yaml +114 -0
- package/AISB/catalog/aisb.t3.095_fastfeaturecp.yaml +153 -0
- package/AISB/catalog/aisb.t3.095_fastfeaturecp.zh.yaml +118 -0
- package/AISB/catalog/aisb.t3.096_m3svm.yaml +189 -0
- package/AISB/catalog/aisb.t3.096_m3svm.zh.yaml +149 -0
- package/AISB/catalog/aisb.t3.097_wassersteintl.yaml +212 -0
- package/AISB/catalog/aisb.t3.097_wassersteintl.zh.yaml +169 -0
- package/AISB/catalog/aisb.t3.098_xmahalanobis.yaml +171 -0
- package/AISB/catalog/aisb.t3.098_xmahalanobis.zh.yaml +127 -0
- package/AISB/catalog/aisb.t3.099_ollalanding.yaml +248 -0
- package/AISB/catalog/aisb.t3.099_ollalanding.zh.yaml +182 -0
- package/AISB/catalog/aisb.t3.100_invmissingdata.yaml +179 -0
- package/AISB/catalog/aisb.t3.100_invmissingdata.zh.yaml +150 -0
- package/AISB/catalog/aisb.t3.101_acia.yaml +164 -0
- package/AISB/catalog/aisb.t3.101_acia.zh.yaml +109 -0
- package/AISB/catalog/aisb.t3.102_stochasticff.yaml +178 -0
- package/AISB/catalog/aisb.t3.102_stochasticff.zh.yaml +130 -0
- package/AISB/catalog/aisb.t3.103_qdcp.yaml +150 -0
- package/AISB/catalog/aisb.t3.103_qdcp.zh.yaml +116 -0
- package/AISB/catalog/aisb.t3.104_balancedactiveinf.yaml +137 -0
- package/AISB/catalog/aisb.t3.104_balancedactiveinf.zh.yaml +104 -0
- package/AISB/catalog/aisb.t3.105_binaryclasseval.yaml +161 -0
- package/AISB/catalog/aisb.t3.105_binaryclasseval.zh.yaml +130 -0
- package/AISB/image/001_aisb.t3.001_savvy.jpg +0 -0
- package/AISB/image/002_aisb.t3.002_pinet.jpg +0 -0
- package/AISB/image/003_aisb.t3.003_dmsqd.jpg +0 -0
- package/AISB/image/004_aisb.t3.004_decentralattn.jpg +0 -0
- package/AISB/image/005_aisb.t3.005_tsae.jpg +0 -0
- package/AISB/image/006_aisb.t3.006_physense.jpg +0 -0
- package/AISB/image/007_aisb.t3.007_reasoningiqa.jpg +0 -0
- package/AISB/image/008_aisb.t3.008_meanflows.jpg +0 -0
- package/AISB/image/009_aisb.t3.009_scoremissing.jpg +0 -0
- package/AISB/image/010_aisb.t3.010_suitabilityfilter.jpg +0 -0
- package/AISB/image/011_aisb.t3.011_osd.jpg +0 -0
- package/AISB/image/012_aisb.t3.012_efficientqat.jpg +0 -0
- package/AISB/image/013_aisb.t3.013_appl.jpg +0 -0
- package/AISB/image/014_aisb.t3.014_piguard.jpg +0 -0
- package/AISB/image/015_aisb.t3.015_frspec.jpg +0 -0
- package/AISB/image/016_aisb.t3.016_mathfusion.jpg +0 -0
- package/AISB/image/017_aisb.t3.017_multimodalglp.jpg +0 -0
- package/AISB/image/018_aisb.t3.018_cotsynth.jpg +0 -0
- package/AISB/image/019_aisb.t3.019_dyscaleut.jpg +0 -0
- package/AISB/image/020_aisb.t3.020_aristotle.jpg +0 -0
- package/AISB/image/021_aisb.t3.021_tokenrecycling.jpg +0 -0
- package/AISB/image/022_aisb.t3.022_chainofreasoning.jpg +0 -0
- package/AISB/image/023_aisb.t3.023_guidedembed.jpg +0 -0
- package/AISB/image/024_aisb.t3.024_outputcentric.jpg +0 -0
- package/AISB/image/025_aisb.t3.025_deeper.jpg +0 -0
- package/AISB/image/026_aisb.t3.026_gartkg.jpg +0 -0
- package/AISB/image/027_aisb.t3.027_citeeval.jpg +0 -0
- package/AISB/image/028_aisb.t3.028_sbam.jpg +0 -0
- package/AISB/image/029_aisb.t3.029_cdqgeoembed.jpg +0 -0
- package/AISB/image/030_aisb.t3.030_processrm.jpg +0 -0
- package/AISB/image/031_aisb.t3.031_circuitstability.jpg +0 -0
- package/AISB/image/032_aisb.t3.032_ptsolver.jpg +0 -0
- package/AISB/image/033_aisb.t3.033_gcse.jpg +0 -0
- package/AISB/image/034_aisb.t3.034_ensemblewm.jpg +0 -0
- package/AISB/image/035_aisb.t3.035_moralvalueswa.jpg +0 -0
- package/AISB/image/036_aisb.t3.036_weakstrongpref.jpg +0 -0
- package/AISB/image/037_aisb.t3.037_dementiamask.jpg +0 -0
- package/AISB/image/038_aisb.t3.038_tinysam.jpg +0 -0
- package/AISB/image/039_aisb.t3.039_calf.jpg +0 -0
- package/AISB/image/040_aisb.t3.040_graniteguardian.jpg +0 -0
- package/AISB/image/041_aisb.t3.041_amdm.jpg +0 -0
- package/AISB/image/042_aisb.t3.042_xpatch.jpg +0 -0
- package/AISB/image/043_aisb.t3.043_vhm.jpg +0 -0
- package/AISB/image/044_aisb.t3.044_rgvi.jpg +0 -0
- package/AISB/image/045_aisb.t3.045_pslstm.jpg +0 -0
- package/AISB/image/046_aisb.t3.046_nonstatts.jpg +0 -0
- package/AISB/image/047_aisb.t3.047_timepfn.jpg +0 -0
- package/AISB/image/048_aisb.t3.048_proxyspex.jpg +0 -0
- package/AISB/image/049_aisb.t3.049_hogwildinference.jpg +0 -0
- package/AISB/image/050_aisb.t3.050_causalpfn.jpg +0 -0
- package/AISB/image/051_aisb.t3.051_flashtp.jpg +0 -0
- package/AISB/image/052_aisb.t3.052_nsdiff.jpg +0 -0
- package/AISB/image/053_aisb.t3.053_k2vae.jpg +0 -0
- package/AISB/image/054_aisb.t3.054_timebase.jpg +0 -0
- package/AISB/image/055_aisb.t3.055_csbrain.jpg +0 -0
- package/AISB/image/056_aisb.t3.056_infosam.jpg +0 -0
- package/AISB/image/057_aisb.t3.057_mdreid.jpg +0 -0
- package/AISB/image/058_aisb.t3.058_mindglitch.jpg +0 -0
- package/AISB/image/059_aisb.t3.059_selfsupervised.jpg +0 -0
- package/AISB/image/060_aisb.t3.060_iaggad.jpg +0 -0
- package/AISB/image/061_aisb.t3.061_hsgkn.jpg +0 -0
- package/AISB/image/062_aisb.t3.062_visionts.jpg +0 -0
- package/AISB/image/063_aisb.t3.063_tsrag.jpg +0 -0
- package/AISB/image/064_aisb.t3.064_pir.jpg +0 -0
- package/AISB/image/065_aisb.t3.065_proteinbinding.jpg +0 -0
- package/AISB/image/066_aisb.t3.066_tropicalattention.jpg +0 -0
- package/AISB/image/067_aisb.t3.067_kanad.jpg +0 -0
- package/AISB/image/068_aisb.t3.068_sempo.jpg +0 -0
- package/AISB/image/069_aisb.t3.069_treehfd.jpg +0 -0
- package/AISB/image/070_aisb.t3.070_certifiedunlearning.jpg +0 -0
- package/AISB/image/071_aisb.t3.071_neuralmjd.jpg +0 -0
- package/AISB/image/072_aisb.t3.072_fedgmt.jpg +0 -0
- package/AISB/image/073_aisb.t3.073_rld.jpg +0 -0
- package/AISB/image/074_aisb.t3.074_lsvi.jpg +0 -0
- package/AISB/image/075_aisb.t3.075_treeslicedentropy.jpg +0 -0
- package/AISB/image/076_aisb.t3.076_aanet.jpg +0 -0
- package/AISB/image/077_aisb.t3.077_cmnn.jpg +0 -0
- package/AISB/image/078_aisb.t3.078_conformalanomaly.jpg +0 -0
- package/AISB/image/079_aisb.t3.079_dpfkmeans.jpg +0 -0
- package/AISB/image/080_aisb.t3.080_latentscorereweight.jpg +0 -0
- package/AISB/image/081_aisb.t3.081_qmamba.jpg +0 -0
- package/AISB/image/082_aisb.t3.082_onlinellmrouting.jpg +0 -0
- package/AISB/image/083_aisb.t3.083_starformer.jpg +0 -0
- package/AISB/image/084_aisb.t3.084_ift.jpg +0 -0
- package/AISB/image/085_aisb.t3.085_neuralsurv.jpg +0 -0
- package/AISB/image/086_aisb.t3.086_stella.jpg +0 -0
- package/AISB/image/087_aisb.t3.087_moses.jpg +0 -0
- package/AISB/image/088_aisb.t3.088_channelnorm.jpg +0 -0
- package/AISB/image/089_aisb.t3.089_causalvelocity.jpg +0 -0
- package/AISB/image/090_aisb.t3.090_rstib.jpg +0 -0
- package/AISB/image/091_aisb.t3.091_timeawarecausal.jpg +0 -0
- package/AISB/image/092_aisb.t3.092_kmeanslocalopt.jpg +0 -0
- package/AISB/image/093_aisb.t3.093_fedwmsam.jpg +0 -0
- package/AISB/image/094_aisb.t3.094_boundre.jpg +0 -0
- package/AISB/image/095_aisb.t3.095_fastfeaturecp.jpg +0 -0
- package/AISB/image/096_aisb.t3.096_m3svm.jpg +0 -0
- package/AISB/image/097_aisb.t3.097_wassersteintl.jpg +0 -0
- package/AISB/image/098_aisb.t3.098_xmahalanobis.jpg +0 -0
- package/AISB/image/099_aisb.t3.099_ollalanding.jpg +0 -0
- package/AISB/image/100_aisb.t3.100_invmissingdata.jpg +0 -0
- package/AISB/image/101_aisb.t3.101_acia.jpg +0 -0
- package/AISB/image/102_aisb.t3.102_stochasticff.jpg +0 -0
- package/AISB/image/103_aisb.t3.103_qdcp.jpg +0 -0
- package/AISB/image/104_aisb.t3.104_balancedactiveinf.jpg +0 -0
- package/AISB/image/105_aisb.t3.105_binaryclasseval.jpg +0 -0
- package/AISB/image/106_aisb.t1.reasoning_lite.jpg +0 -0
- package/AISB/image/107_aisb.t2.paper_audit.jpg +0 -0
- package/AISB/image/108_aisb.t3.multi_gpu_search.jpg +0 -0
- package/AISB/image/109_aisb.t3.tdc_admet.jpg +0 -0
- package/AISB/image/aisb.b1.agentic_coding.svg +16 -0
- package/AISB/image/aisb.b10.climate_earth.svg +16 -0
- package/AISB/image/aisb.b11.model_efficiency.svg +16 -0
- package/AISB/image/aisb.b12.embodied_ai.svg +16 -0
- package/AISB/image/aisb.b2.agent_systems.svg +16 -0
- package/AISB/image/aisb.b3.self_evolving_rl.svg +16 -0
- package/AISB/image/aisb.b4.lm_reasoning.svg +16 -0
- package/AISB/image/aisb.b5.math_proof.svg +16 -0
- package/AISB/image/aisb.b6.research_process.svg +16 -0
- package/AISB/image/aisb.b7.multimodal_fusion.svg +16 -0
- package/AISB/image/aisb.b8.lifesci_drug.svg +16 -0
- package/AISB/image/aisb.b9.material_science.svg +16 -0
- package/README.md +132 -11
- package/bin/ds.js +376 -49
- package/docs/en/00_QUICK_START.md +135 -18
- package/docs/en/01_SETTINGS_REFERENCE.md +468 -96
- package/docs/en/02_START_RESEARCH_GUIDE.md +26 -5
- package/docs/en/03_QQ_CONNECTOR_GUIDE.md +14 -3
- package/docs/en/04_LINGZHU_CONNECTOR_GUIDE.md +2 -0
- package/docs/en/05_TUI_GUIDE.md +171 -2
- package/docs/en/07_MEMORY_AND_MCP.md +38 -2
- package/docs/en/09_DOCTOR.md +64 -4
- package/docs/en/10_WEIXIN_CONNECTOR_GUIDE.md +38 -1
- package/docs/en/11_LICENSE_AND_RISK.md +4 -0
- package/docs/en/12_GUIDED_WORKFLOW_TOUR.md +15 -0
- package/docs/en/14_PROMPT_SKILLS_AND_MCP_GUIDE.md +9 -0
- package/docs/en/15_CODEX_PROVIDER_SETUP.md +622 -187
- package/docs/en/16_TELEGRAM_CONNECTOR_GUIDE.md +14 -0
- package/docs/en/17_WHATSAPP_CONNECTOR_GUIDE.md +14 -0
- package/docs/en/18_FEISHU_CONNECTOR_GUIDE.md +14 -0
- package/docs/en/21_LOCAL_MODEL_BACKENDS_GUIDE.md +105 -2
- package/docs/en/22_BENCHSTORE_YAML_REFERENCE.md +469 -0
- package/docs/en/23_BENCHSTORE_GITHUB_RELEASES_SPEC.md +316 -0
- package/docs/en/24_CLAUDE_CODE_PROVIDER_SETUP.md +469 -0
- package/docs/en/25_OPENCODE_PROVIDER_SETUP.md +653 -0
- package/docs/en/26_CITATION_AND_ATTRIBUTION.md +119 -0
- package/docs/en/27_KIMI_CODE_PROVIDER_SETUP.md +180 -0
- package/docs/en/28_DISCORD_CONNECTOR_GUIDE.md +61 -0
- package/docs/en/29_SLACK_CONNECTOR_GUIDE.md +60 -0
- package/docs/en/30_SETTINGS_CONTROL_CENTER_GUIDE.md +371 -0
- package/docs/en/{19_LOCAL_BROWSER_AUTH.md → 31_LOCAL_BROWSER_AUTH.md} +1 -1
- package/docs/en/32_WINDOWS_WSL2_DEPLOYMENT_GUIDE.md +273 -0
- package/docs/en/33_WORKSPACE_EXPLORER_QA.md +121 -0
- package/docs/en/91_DEVELOPMENT.md +29 -0
- package/docs/en/99_ACKNOWLEDGEMENTS.md +24 -19
- package/docs/en/README.md +44 -7
- package/docs/images/admin/admin-connectors-health-en.png +0 -0
- package/docs/images/admin/admin-controllers-en.png +0 -0
- package/docs/images/admin/admin-diagnostics-en.png +0 -0
- package/docs/images/admin/admin-errors-en.png +0 -0
- package/docs/images/admin/admin-issues-en.png +0 -0
- package/docs/images/admin/admin-logs-en.png +0 -0
- package/docs/images/admin/admin-quest-detail-en.png +0 -0
- package/docs/images/admin/admin-quests-en.png +0 -0
- package/docs/images/admin/admin-repairs-en.png +0 -0
- package/docs/images/admin/admin-runtime-en.png +0 -0
- package/docs/images/admin/admin-search-en.png +0 -0
- package/docs/images/admin/admin-stats-en.png +0 -0
- package/docs/images/admin/admin-summary-en.png +0 -0
- package/docs/images/connectors/connector-discord-en.png +0 -0
- package/docs/images/connectors/connector-feishu-en.png +0 -0
- package/docs/images/connectors/connector-lingzhu-en.png +0 -0
- package/docs/images/connectors/connector-qq-en.png +0 -0
- package/docs/images/connectors/connector-slack-en.png +0 -0
- package/docs/images/connectors/connector-telegram-en.png +0 -0
- package/docs/images/connectors/connector-weixin-en.png +0 -0
- package/docs/images/connectors/connector-whatsapp-en.png +0 -0
- package/docs/images/settings/settings-baselines-en.png +0 -0
- package/docs/images/settings/settings-config-en.png +0 -0
- package/docs/images/settings/settings-connectors-overview-en.png +0 -0
- package/docs/images/settings/settings-deepxiv-en.png +0 -0
- package/docs/images/settings/settings-mcp-servers-en.png +0 -0
- package/docs/images/settings/settings-plugins-en.png +0 -0
- package/docs/images/settings/settings-runners-en.png +0 -0
- package/docs/zh/00_QUICK_START.md +92 -17
- package/docs/zh/01_SETTINGS_REFERENCE.md +219 -98
- package/docs/zh/02_START_RESEARCH_GUIDE.md +26 -5
- package/docs/zh/05_TUI_GUIDE.md +171 -2
- package/docs/zh/07_MEMORY_AND_MCP.md +29 -2
- package/docs/zh/09_DOCTOR.md +39 -4
- package/docs/zh/10_WEIXIN_CONNECTOR_GUIDE.md +24 -1
- package/docs/zh/11_LICENSE_AND_RISK.md +4 -0
- package/docs/zh/12_GUIDED_WORKFLOW_TOUR.md +15 -0
- package/docs/zh/14_PROMPT_SKILLS_AND_MCP_GUIDE.md +9 -0
- package/docs/zh/15_CODEX_PROVIDER_SETUP.md +550 -188
- package/docs/zh/21_LOCAL_MODEL_BACKENDS_GUIDE.md +105 -2
- package/docs/zh/22_BENCHSTORE_YAML_REFERENCE.md +459 -0
- package/docs/zh/23_BENCHSTORE_GITHUB_RELEASES_SPEC.md +287 -0
- package/docs/zh/23_CLAUDE_RUNNER_GUIDE.md +103 -0
- package/docs/zh/24_CLAUDE_CODE_PROVIDER_SETUP.md +460 -0
- package/docs/zh/25_OPENCODE_PROVIDER_SETUP.md +660 -0
- package/docs/zh/26_CITATION_AND_ATTRIBUTION.md +102 -0
- package/docs/zh/27_KIMI_CODE_PROVIDER_SETUP.md +51 -0
- package/docs/zh/{19_LOCAL_BROWSER_AUTH.md → 31_LOCAL_BROWSER_AUTH.md} +1 -1
- package/docs/zh/32_WINDOWS_WSL2_DEPLOYMENT_GUIDE.md +264 -0
- package/docs/zh/33_WORKSPACE_EXPLORER_QA.md +127 -0
- package/docs/zh/99_ACKNOWLEDGEMENTS.md +23 -19
- package/docs/zh/README.md +29 -7
- package/install.sh +122 -16
- package/package.json +4 -1
- package/pyproject.toml +2 -1
- package/src/deepscientist/__init__.py +1 -1
- package/src/deepscientist/acp/envelope.py +13 -0
- package/src/deepscientist/admin/__init__.py +3 -0
- package/src/deepscientist/admin/charts.py +681 -0
- package/src/deepscientist/admin/logs.py +119 -0
- package/src/deepscientist/admin/repairs.py +217 -0
- package/src/deepscientist/admin/service.py +1310 -0
- package/src/deepscientist/admin/system_info.py +700 -0
- package/src/deepscientist/admin/tasks.py +465 -0
- package/src/deepscientist/admin/tool_metrics.py +600 -0
- package/src/deepscientist/artifact/guidance.py +8 -4
- package/src/deepscientist/artifact/schemas.py +115 -0
- package/src/deepscientist/artifact/service.py +4268 -260
- package/src/deepscientist/bash_exec/monitor.py +30 -3
- package/src/deepscientist/bash_exec/service.py +134 -1
- package/src/deepscientist/benchstore/__init__.py +4 -0
- package/src/deepscientist/benchstore/prompt_builder.py +224 -0
- package/src/deepscientist/benchstore/service.py +1716 -0
- package/src/deepscientist/channels/weixin_ilink.py +8 -1
- package/src/deepscientist/cli.py +92 -17
- package/src/deepscientist/codex_cli_compat.py +2 -2
- package/src/deepscientist/config/models.py +82 -11
- package/src/deepscientist/config/service.py +927 -91
- package/src/deepscientist/connector/weixin_support.py +48 -17
- package/src/deepscientist/daemon/api/handlers.py +697 -210
- package/src/deepscientist/daemon/api/router.py +76 -1
- package/src/deepscientist/daemon/app.py +1054 -51
- package/src/deepscientist/diagnostics/runner_failures.py +147 -0
- package/src/deepscientist/doctor.py +212 -65
- package/src/deepscientist/evidence_packets.py +590 -0
- package/src/deepscientist/home.py +52 -4
- package/src/deepscientist/kimi_cli_compat.py +50 -0
- package/src/deepscientist/latex_runtime.py +2 -2
- package/src/deepscientist/mcp/context.py +2 -0
- package/src/deepscientist/mcp/schemas.py +114 -0
- package/src/deepscientist/mcp/server.py +1566 -126
- package/src/deepscientist/memory/service.py +203 -16
- package/src/deepscientist/process_control.py +8 -1
- package/src/deepscientist/prompts/builder.py +836 -92
- package/src/deepscientist/quest/__init__.py +2 -2
- package/src/deepscientist/quest/layout.py +12 -1
- package/src/deepscientist/quest/node_traces.py +10 -0
- package/src/deepscientist/quest/service.py +1430 -139
- package/src/deepscientist/quest/stage_views.py +1 -1
- package/src/deepscientist/runners/__init__.py +18 -0
- package/src/deepscientist/runners/base.py +89 -1
- package/src/deepscientist/runners/builtins.py +13 -1
- package/src/deepscientist/runners/claude.py +391 -0
- package/src/deepscientist/runners/codex.py +421 -21
- package/src/deepscientist/runners/codex_telemetry.py +127 -0
- package/src/deepscientist/runners/kimi.py +334 -0
- package/src/deepscientist/runners/metadata.py +68 -0
- package/src/deepscientist/runners/opencode.py +414 -0
- package/src/deepscientist/runners/runtime_overrides.py +100 -0
- package/src/deepscientist/runners/simple_cli.py +538 -0
- package/src/deepscientist/runtime_storage.py +303 -0
- package/src/deepscientist/shared.py +61 -16
- package/src/deepscientist/skills/installer.py +37 -0
- package/src/deepscientist/skills/registry.py +2 -0
- package/src/deepscientist/tinytex.py +2 -2
- package/src/deepscientist/tui.py +10 -3
- package/src/prompts/benchstore/system.md +77 -0
- package/src/prompts/connectors/qq.md +33 -2
- package/src/prompts/connectors/weixin.md +208 -23
- package/src/prompts/contracts/admin_ops.md +74 -0
- package/src/prompts/contracts/admin_ops_knowledge.md +138 -0
- package/src/prompts/contracts/shared_interaction.md +5 -11
- package/src/prompts/start_setup/system.md +422 -0
- package/src/prompts/system.md +409 -315
- package/src/prompts/system_copilot.md +88 -12
- package/src/skills/analysis-campaign/SKILL.md +239 -578
- package/src/skills/analysis-campaign/references/artifact-flow-examples.md +102 -0
- package/src/skills/analysis-campaign/references/boundary-cases.md +98 -0
- package/src/skills/analysis-campaign/references/campaign-checklist-template.md +39 -24
- package/src/skills/analysis-campaign/references/campaign-design.md +26 -10
- package/src/skills/analysis-campaign/references/campaign-plan-template.md +53 -54
- package/src/skills/analysis-campaign/references/operational-guidance.md +97 -0
- package/src/skills/analysis-campaign/references/writing-facing-slice-examples.md +10 -20
- package/src/skills/baseline/SKILL.md +183 -461
- package/src/skills/baseline/references/artifact-flow-examples.md +106 -0
- package/src/skills/baseline/references/artifact-payload-examples.md +1 -1
- package/src/skills/baseline/references/baseline-checklist-template.md +27 -35
- package/src/skills/baseline/references/baseline-plan-template.md +37 -76
- package/src/skills/baseline/references/boundary-cases.md +86 -0
- package/src/skills/baseline/references/codebase-audit-checklist.md +2 -6
- package/src/skills/baseline/references/comparability-contract.md +7 -12
- package/src/skills/baseline/references/operational-guidance.md +56 -0
- package/src/skills/baseline/references/route-selection.md +5 -25
- package/src/skills/decision/SKILL.md +113 -306
- package/src/skills/decision/references/checkpoint-memory-template.md +47 -0
- package/src/skills/decision/references/operational-guidance.md +94 -0
- package/src/skills/decision/references/research-route-criteria.md +7 -8
- package/src/skills/decision/references/strategic-decision-template.md +13 -26
- package/src/skills/experiment/SKILL.md +132 -670
- package/src/skills/experiment/references/execution-playbook.md +374 -0
- package/src/skills/experiment/references/main-experiment-checklist-template.md +26 -2
- package/src/skills/experiment/references/main-experiment-plan-template.md +28 -17
- package/src/skills/experiment/references/operational-guidance.md +108 -0
- package/src/skills/finalize/SKILL.md +62 -0
- package/src/skills/finalize/references/checkpoint-memory-template.md +49 -0
- package/src/skills/finalize/references/resume-packet-template.md +7 -0
- package/src/skills/idea/SKILL.md +228 -15
- package/src/skills/idea/references/controlled-brainstorming-playbook.md +78 -0
- package/src/skills/idea/references/current-board-packet-template.md +61 -0
- package/src/skills/idea/references/high-value-idea-sourcing.md +119 -0
- package/src/skills/idea/references/idea-generation-playbook.md +21 -0
- package/src/skills/idea/references/idea-thinking-flow.md +6 -0
- package/src/skills/idea/references/literature-survey-template.md +3 -0
- package/src/skills/idea/references/objective-contract-template.md +54 -0
- package/src/skills/idea/references/outline-seeding-example.md +56 -0
- package/src/skills/idea/references/pre-idea-draft-template.md +105 -0
- package/src/skills/idea/references/related-work-playbook.md +75 -2
- package/src/skills/idea/references/research-history-playbook.md +114 -0
- package/src/skills/idea/references/selection-gate.md +58 -6
- package/src/skills/intake-audit/SKILL.md +43 -2
- package/src/skills/intake-audit/references/state-audit-template.md +10 -0
- package/src/skills/nature-data/SKILL.md +128 -0
- package/src/skills/nature-data/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-data/agents/openai.yaml +4 -0
- package/src/skills/nature-data/references/chinese-author-alignment.md +84 -0
- package/src/skills/nature-data/references/fair-metadata-checklist.md +105 -0
- package/src/skills/nature-data/references/policy-principles.md +103 -0
- package/src/skills/nature-data/references/repository-and-identifiers.md +96 -0
- package/src/skills/nature-data/references/source-basis.md +54 -0
- package/src/skills/nature-data/references/statement-patterns.md +153 -0
- package/src/skills/nature-figure/SKILL.md +197 -0
- package/src/skills/nature-figure/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-figure/agents/openai.yaml +4 -0
- package/src/skills/nature-figure/evals/evals.json +37 -0
- package/src/skills/nature-figure/references/api.md +428 -0
- package/src/skills/nature-figure/references/backend-selection.md +100 -0
- package/src/skills/nature-figure/references/chart-types.md +281 -0
- package/src/skills/nature-figure/references/common-patterns.md +349 -0
- package/src/skills/nature-figure/references/design-theory.md +436 -0
- package/src/skills/nature-figure/references/figure-contract.md +93 -0
- package/src/skills/nature-figure/references/nature-2026-observations.md +112 -0
- package/src/skills/nature-figure/references/qa-contract.md +119 -0
- package/src/skills/nature-figure/references/r-template-index.md +66 -0
- package/src/skills/nature-figure/references/r-workflow.md +161 -0
- package/src/skills/nature-figure/references/tutorials.md +250 -0
- package/src/skills/nature-paper2ppt/SKILL.md +507 -0
- package/src/skills/nature-paper2ppt/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-paper2ppt/agents/openai.yaml +4 -0
- package/src/skills/nature-polishing/SKILL.md +385 -0
- package/src/skills/nature-polishing/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-polishing/agents/openai.yaml +4 -0
- package/src/skills/nature-polishing/references/phrasebank-playbook.md +162 -0
- package/src/skills/nature-polishing/references/section-moves.md +240 -0
- package/src/skills/nature-polishing/references/style-guardrails.md +94 -0
- package/src/skills/nature-polishing/references/writing-strategy.md +148 -0
- package/src/skills/optimize/SKILL.md +177 -1568
- package/src/skills/optimize/references/brief-shaping-playbook.md +95 -0
- package/src/skills/optimize/references/candidate-board-template.md +13 -0
- package/src/skills/optimize/references/candidate-ranking-template.md +51 -0
- package/src/skills/optimize/references/codegen-route-playbook.md +50 -0
- package/src/skills/optimize/references/debug-response-template.md +29 -0
- package/src/skills/optimize/references/frontier-review-template.md +32 -0
- package/src/skills/optimize/references/fusion-playbook.md +36 -0
- package/src/skills/optimize/references/method-brief-template.md +73 -0
- package/src/skills/optimize/references/operational-guidance.md +621 -0
- package/src/skills/optimize/references/optimization-memory-template.md +30 -0
- package/src/skills/optimize/references/optimize-checklist-template.md +18 -0
- package/src/skills/optimize/references/plateau-response-playbook.md +28 -0
- package/src/skills/optimize/references/prompt-patterns.md +49 -0
- package/src/skills/paper-outline/SKILL.md +227 -0
- package/src/skills/paper-outline/references/outline-patterns.md +87 -0
- package/src/skills/paper-plot/SKILL.md +79 -0
- package/src/skills/paper-plot/agents/openai.yaml +4 -0
- package/src/skills/paper-plot/references/bar_grouped_hatch.md +96 -0
- package/src/skills/paper-plot/references/bar_paired_delta.md +72 -0
- package/src/skills/paper-plot/references/line_confidence_band.md +75 -0
- package/src/skills/paper-plot/references/line_loss_with_inset.md +65 -0
- package/src/skills/paper-plot/references/line_training_curve.md +44 -0
- package/src/skills/paper-plot/references/radar_dual_series.md +59 -0
- package/src/skills/paper-plot/references/scatter_broken_axis.md +59 -0
- package/src/skills/paper-plot/references/scatter_tsne_cluster.md +72 -0
- package/src/skills/paper-plot/scripts/bar_memevolve.py +109 -0
- package/src/skills/paper-plot/scripts/bar_spice.py +166 -0
- package/src/skills/paper-plot/scripts/line_aime.py +94 -0
- package/src/skills/paper-plot/scripts/line_loss_inset.py +157 -0
- package/src/skills/paper-plot/scripts/line_selfdistill.py +168 -0
- package/src/skills/paper-plot/scripts/radar_dora.py +151 -0
- package/src/skills/paper-plot/scripts/scatter_break.py +169 -0
- package/src/skills/paper-plot/scripts/scatter_tsne.py +133 -0
- package/src/skills/rebuttal/SKILL.md +9 -0
- package/src/skills/references/tool-usage-by-stage.md +438 -0
- package/src/skills/review/SKILL.md +105 -7
- package/src/skills/science/PROVENANCE.md +44 -0
- package/src/skills/science/SKILL.md +137 -0
- package/src/skills/science/references/artifact-science-tool.md +110 -0
- package/src/skills/science/references/claim-type-discipline.md +56 -0
- package/src/skills/science/references/domain-index.md +422 -0
- package/src/skills/science/references/hpc-via-bash-exec.md +42 -0
- package/src/skills/science/references/package-check-playbook.md +64 -0
- package/src/skills/science/references/package-index.min.json +3616 -0
- package/src/skills/science/references/packages/abinit.md +80 -0
- package/src/skills/science/references/packages/acts.md +73 -0
- package/src/skills/science/references/packages/aiida-core.md +80 -0
- package/src/skills/science/references/packages/alamode.md +80 -0
- package/src/skills/science/references/packages/amuse.md +88 -0
- package/src/skills/science/references/packages/anndata.md +88 -0
- package/src/skills/science/references/packages/arbor.md +80 -0
- package/src/skills/science/references/packages/arc.md +73 -0
- package/src/skills/science/references/packages/astropy.md +88 -0
- package/src/skills/science/references/packages/astroquery.md +88 -0
- package/src/skills/science/references/packages/atomate2.md +80 -0
- package/src/skills/science/references/packages/atomsmltr.md +73 -0
- package/src/skills/science/references/packages/awkward.md +73 -0
- package/src/skills/science/references/packages/batman.md +88 -0
- package/src/skills/science/references/packages/biopython.md +88 -0
- package/src/skills/science/references/packages/bloqade.md +73 -0
- package/src/skills/science/references/packages/brian2.md +73 -0
- package/src/skills/science/references/packages/bullet3.md +73 -0
- package/src/skills/science/references/packages/calculix.md +80 -0
- package/src/skills/science/references/packages/cantera.md +73 -0
- package/src/skills/science/references/packages/cavity-md-ipi.md +80 -0
- package/src/skills/science/references/packages/ccdproc.md +88 -0
- package/src/skills/science/references/packages/celerite2.md +88 -0
- package/src/skills/science/references/packages/cellrank.md +73 -0
- package/src/skills/science/references/packages/cesm.md +80 -0
- package/src/skills/science/references/packages/chemicals.md +73 -0
- package/src/skills/science/references/packages/chempy.md +73 -0
- package/src/skills/science/references/packages/cirq.md +73 -0
- package/src/skills/science/references/packages/coffea.md +73 -0
- package/src/skills/science/references/packages/cp2k.md +88 -0
- package/src/skills/science/references/packages/custodian.md +80 -0
- package/src/skills/science/references/packages/dart.md +73 -0
- package/src/skills/science/references/packages/datamol.md +88 -0
- package/src/skills/science/references/packages/dd4hep.md +73 -0
- package/src/skills/science/references/packages/dealii.md +80 -0
- package/src/skills/science/references/packages/deepchem.md +88 -0
- package/src/skills/science/references/packages/delphes.md +73 -0
- package/src/skills/science/references/packages/devito.md +80 -0
- package/src/skills/science/references/packages/dftb.md +88 -0
- package/src/skills/science/references/packages/dftd4.md +88 -0
- package/src/skills/science/references/packages/dftk-jl.md +80 -0
- package/src/skills/science/references/packages/dolfinx.md +80 -0
- package/src/skills/science/references/packages/drake.md +73 -0
- package/src/skills/science/references/packages/dumux.md +73 -0
- package/src/skills/science/references/packages/elk.md +80 -0
- package/src/skills/science/references/packages/elmerfem.md +80 -0
- package/src/skills/science/references/packages/enzo-e.md +88 -0
- package/src/skills/science/references/packages/espresso.md +80 -0
- package/src/skills/science/references/packages/exoplanet.md +88 -0
- package/src/skills/science/references/packages/fairroot.md +73 -0
- package/src/skills/science/references/packages/fbpic.md +80 -0
- package/src/skills/science/references/packages/fdtdbath-meep.md +80 -0
- package/src/skills/science/references/packages/geant4.md +73 -0
- package/src/skills/science/references/packages/geosx.md +80 -0
- package/src/skills/science/references/packages/gprmax.md +80 -0
- package/src/skills/science/references/packages/gromacs.md +80 -0
- package/src/skills/science/references/packages/gwaslab.md +73 -0
- package/src/skills/science/references/packages/gz-sim.md +73 -0
- package/src/skills/science/references/packages/hail.md +88 -0
- package/src/skills/science/references/packages/hiphive.md +80 -0
- package/src/skills/science/references/packages/hoomd-blue.md +80 -0
- package/src/skills/science/references/packages/itensor.md +73 -0
- package/src/skills/science/references/packages/itensors-jl.md +73 -0
- package/src/skills/science/references/packages/jdftx.md +73 -0
- package/src/skills/science/references/packages/jobflow.md +80 -0
- package/src/skills/science/references/packages/kadanoffbaym-jl.md +73 -0
- package/src/skills/science/references/packages/kite.md +80 -0
- package/src/skills/science/references/packages/kratos.md +80 -0
- package/src/skills/science/references/packages/kwant.md +73 -0
- package/src/skills/science/references/packages/lammps.md +80 -0
- package/src/skills/science/references/packages/lightkurve.md +88 -0
- package/src/skills/science/references/packages/limix.md +73 -0
- package/src/skills/science/references/packages/maxwelllink.md +80 -0
- package/src/skills/science/references/packages/mcdc.md +73 -0
- package/src/skills/science/references/packages/meep.md +80 -0
- package/src/skills/science/references/packages/mfem.md +80 -0
- package/src/skills/science/references/packages/mitgcm.md +73 -0
- package/src/skills/science/references/packages/modflow6.md +73 -0
- package/src/skills/science/references/packages/molecool.md +73 -0
- package/src/skills/science/references/packages/mom6.md +73 -0
- package/src/skills/science/references/packages/moose.md +80 -0
- package/src/skills/science/references/packages/mpas-model.md +73 -0
- package/src/skills/science/references/packages/mujoco.md +73 -0
- package/src/skills/science/references/packages/mumax3.md +73 -0
- package/src/skills/science/references/packages/nekrs.md +80 -0
- package/src/skills/science/references/packages/nessi.md +73 -0
- package/src/skills/science/references/packages/nest-simulator.md +73 -0
- package/src/skills/science/references/packages/netket.md +73 -0
- package/src/skills/science/references/packages/neuron.md +73 -0
- package/src/skills/science/references/packages/nextflow.md +88 -0
- package/src/skills/science/references/packages/nwchem.md +88 -0
- package/src/skills/science/references/packages/openbabel.md +88 -0
- package/src/skills/science/references/packages/openems.md +80 -0
- package/src/skills/science/references/packages/openff-toolkit.md +88 -0
- package/src/skills/science/references/packages/openfoam-dev.md +80 -0
- package/src/skills/science/references/packages/openmc.md +73 -0
- package/src/skills/science/references/packages/openmm.md +80 -0
- package/src/skills/science/references/packages/openmoc.md +73 -0
- package/src/skills/science/references/packages/openmx.md +80 -0
- package/src/skills/science/references/packages/opensees.md +80 -0
- package/src/skills/science/references/packages/opensn.md +80 -0
- package/src/skills/science/references/packages/opm-simulators.md +73 -0
- package/src/skills/science/references/packages/oqupy.md +73 -0
- package/src/skills/science/references/packages/packmol.md +80 -0
- package/src/skills/science/references/packages/palabos.md +80 -0
- package/src/skills/science/references/packages/parflow.md +80 -0
- package/src/skills/science/references/packages/pennylane.md +88 -0
- package/src/skills/science/references/packages/perceval.md +73 -0
- package/src/skills/science/references/packages/phono3py.md +73 -0
- package/src/skills/science/references/packages/phonopy.md +73 -0
- package/src/skills/science/references/packages/photutils.md +88 -0
- package/src/skills/science/references/packages/picongpu.md +80 -0
- package/src/skills/science/references/packages/plink-ng.md +88 -0
- package/src/skills/science/references/packages/precice.md +73 -0
- package/src/skills/science/references/packages/psc.md +80 -0
- package/src/skills/science/references/packages/psi4.md +88 -0
- package/src/skills/science/references/packages/pybinding.md +73 -0
- package/src/skills/science/references/packages/pyfr.md +80 -0
- package/src/skills/science/references/packages/pyhf.md +73 -0
- package/src/skills/science/references/packages/pyiron_base.md +80 -0
- package/src/skills/science/references/packages/pylcp.md +73 -0
- package/src/skills/science/references/packages/pylith.md +80 -0
- package/src/skills/science/references/packages/pynbody.md +88 -0
- package/src/skills/science/references/packages/pysam.md +88 -0
- package/src/skills/science/references/packages/pyscf.md +88 -0
- package/src/skills/science/references/packages/q-e.md +73 -0
- package/src/skills/science/references/packages/qibo.md +73 -0
- package/src/skills/science/references/packages/qiskit.md +73 -0
- package/src/skills/science/references/packages/quantica-jl.md +73 -0
- package/src/skills/science/references/packages/quantumoptics-jl.md +73 -0
- package/src/skills/science/references/packages/quimb.md +73 -0
- package/src/skills/science/references/packages/qulacs.md +73 -0
- package/src/skills/science/references/packages/qutip.md +73 -0
- package/src/skills/science/references/packages/rdkit.md +88 -0
- package/src/skills/science/references/packages/rmg-py.md +73 -0
- package/src/skills/science/references/packages/root.md +73 -0
- package/src/skills/science/references/packages/scanpy.md +88 -0
- package/src/skills/science/references/packages/scikit-allel.md +88 -0
- package/src/skills/science/references/packages/scikit-bio.md +88 -0
- package/src/skills/science/references/packages/scqubits.md +73 -0
- package/src/skills/science/references/packages/scuff-em.md +80 -0
- package/src/skills/science/references/packages/scvi-tools.md +73 -0
- package/src/skills/science/references/packages/seissol.md +73 -0
- package/src/skills/science/references/packages/sfepy.md +80 -0
- package/src/skills/science/references/packages/sisl.md +73 -0
- package/src/skills/science/references/packages/smilei.md +80 -0
- package/src/skills/science/references/packages/snakemake.md +88 -0
- package/src/skills/science/references/packages/specfem3d-globe.md +80 -0
- package/src/skills/science/references/packages/specutils.md +88 -0
- package/src/skills/science/references/packages/spglib.md +80 -0
- package/src/skills/science/references/packages/squidpy.md +88 -0
- package/src/skills/science/references/packages/starry.md +88 -0
- package/src/skills/science/references/packages/strawberryfields.md +73 -0
- package/src/skills/science/references/packages/su2.md +80 -0
- package/src/skills/science/references/packages/sunny-jl.md +73 -0
- package/src/skills/science/references/packages/sw4.md +73 -0
- package/src/skills/science/references/packages/swift.md +88 -0
- package/src/skills/science/references/packages/tdnegf.md +73 -0
- package/src/skills/science/references/packages/tenpy.md +73 -0
- package/src/skills/science/references/packages/thermo.md +73 -0
- package/src/skills/science/references/packages/tkwant.md +73 -0
- package/src/skills/science/references/packages/tvb-root.md +73 -0
- package/src/skills/science/references/packages/uproot5.md +73 -0
- package/src/skills/science/references/packages/vampire.md +80 -0
- package/src/skills/science/references/packages/wannier_tools.md +73 -0
- package/src/skills/science/references/packages/warpx.md +80 -0
- package/src/skills/science/references/packages/wrf.md +73 -0
- package/src/skills/science/references/packages/xtb.md +88 -0
- package/src/skills/science/references/packages/yt.md +73 -0
- package/src/skills/science/references/science-task-brief-template.md +71 -0
- package/src/skills/scout/SKILL.md +83 -425
- package/src/skills/scout/references/literature-scout-template.md +5 -24
- package/src/skills/scout/references/operational-guidance.md +191 -0
- package/src/skills/scout/references/paper-triage-playbook.md +11 -35
- package/src/skills/write/SKILL.md +744 -1246
- package/src/skills/write/references/experiments_analysis_patterns.md +129 -0
- package/src/skills/write/references/oral_package_patterns.md +252 -0
- package/src/skills/write/references/oral_writing_principles.md +291 -0
- package/src/skills/write/references/section_rewrite_checklist.md +234 -0
- package/src/tui/dist/app/AppContainer.js +1314 -27
- package/src/tui/dist/components/Composer.js +26 -1
- package/src/tui/dist/components/ConfigScreen.js +2 -1
- package/src/tui/dist/components/InputPrompt.js +25 -9
- package/src/tui/dist/components/MainContent.js +18 -3
- package/src/tui/dist/components/QuestScreen.js +3 -2
- package/src/tui/dist/components/UtilityScreen.js +37 -0
- package/src/tui/dist/hooks/useSafeInput.js +10 -0
- package/src/tui/dist/index.js +13 -1
- package/src/tui/dist/layouts/DefaultAppLayout.js +11 -8
- package/src/tui/dist/lib/api.js +89 -1
- package/src/tui/package.json +1 -1
- package/src/ui/dist/assets/{AnalysisPlugin-BCKAfjba.js → AnalysisPlugin-CA94NGmI.js} +1 -1
- package/src/ui/dist/assets/CliPlugin-DHBzphZU.js +79 -0
- package/src/ui/dist/assets/CodeEditorPlugin-BOFwD2rn.js +2 -0
- package/src/ui/dist/assets/{CodeViewerPlugin-CbaFRrUU.js → CodeViewerPlugin-CqDpgjik.js} +4 -4
- package/src/ui/dist/assets/{DocViewerPlugin-DAjLVeQD.js → DocViewerPlugin-UDBgt8-4.js} +3 -3
- package/src/ui/dist/assets/GitCommitViewerPlugin-BmHtZ0bZ.js +6 -0
- package/src/ui/dist/assets/{GitDiffViewerPlugin-CQACjoAA.js → GitDiffViewerPlugin-CAxjNorQ.js} +2 -2
- package/src/ui/dist/assets/{GitSnapshotViewer-0r4nLPke.js → GitSnapshotViewer-CweA6VON.js} +2 -2
- package/src/ui/dist/assets/{ImageViewerPlugin-nBOmI2v_.js → ImageViewerPlugin-C8wHGvGN.js} +5 -5
- package/src/ui/dist/assets/LabPlugin-COyyLUol.js +32 -0
- package/src/ui/dist/assets/{LatexPlugin-ZwtV8pIp.js → LatexPlugin-BQjAaA5J.js} +4 -4
- package/src/ui/dist/assets/{MarkdownViewerPlugin-DKqVfKyW.js → MarkdownViewerPlugin-Dy1NE2dI.js} +3 -3
- package/src/ui/dist/assets/{MarketplacePlugin-BwxStZ9D.js → MarketplacePlugin-DMIZtEJ2.js} +2 -2
- package/src/ui/dist/assets/NotebookEditor-CFHMq_Qt.js +91 -0
- package/src/ui/dist/assets/{NotebookEditor-DB9N_T9q.js → NotebookEditor-WFyd8Ybt.js} +3 -3
- package/src/ui/dist/assets/{PdfLoader-eWBONbQP.js → PdfLoader-CLE5u5TS.js} +3 -3
- package/src/ui/dist/assets/{PdfMarkdownPlugin-D22YOZL3.js → PdfMarkdownPlugin-_iNK_H83.js} +1 -1
- package/src/ui/dist/assets/PdfViewerPlugin-DgWsbInT.js +22 -0
- package/src/ui/dist/assets/SearchPlugin-DrZmn5iw.js +11 -0
- package/src/ui/dist/assets/{TextViewerPlugin-C5xqeeUH.js → TextViewerPlugin-D1-T3aC7.js} +4 -4
- package/src/ui/dist/assets/branding/runner-claude.svg +107 -0
- package/src/ui/dist/assets/branding/runner-codex.svg +10 -0
- package/src/ui/dist/assets/branding/runner-kimi.svg +14 -0
- package/src/ui/dist/assets/branding/runner-opencode.svg +7 -0
- package/src/ui/dist/assets/cli-store-CoZ-x5Ip.js +1 -0
- package/src/ui/dist/assets/{code-WlFHE7z_.js → code-DbsmSd3Y.js} +1 -1
- package/src/ui/dist/assets/file-diff-panel-DsvyRz47.js +1 -0
- package/src/ui/dist/assets/{wrap-text-BC-Hltpd.js → file-jump-queue-DeQBikaw.js} +3 -3
- package/src/ui/dist/assets/{file-socket-CfQPKQKj.js → file-socket-DA5XIx88.js} +1 -1
- package/src/ui/dist/assets/fonts/ds-fonts.css +50 -4
- package/src/ui/dist/assets/images/deepxiv/register-guide.png +0 -0
- package/src/ui/dist/assets/index-39vY9LmZ.js +1 -0
- package/src/ui/dist/assets/{index-CwNu1aH4.js → index-BsO46tJA.js} +1 -1
- package/src/ui/dist/assets/index-CHzJ2xtB.js +3530 -0
- package/src/ui/dist/assets/index-DH-zxoZ3.css +33 -0
- package/src/ui/dist/assets/{plugin-notebook-HbW2K-1c.js → plugin-notebook-JRhysCqj.js} +2 -2
- package/src/ui/dist/assets/{project-sync-C9IdzdZW.js → project-sync-DPmWKmKD.js} +1 -1
- package/src/ui/dist/assets/{zoom-out-E_gaeAxL.js → zoom-out-DAukFWen.js} +3 -3
- package/src/ui/dist/index.html +3 -3
- package/src/skills/analysis-campaign/references/artifact-orchestration.md +0 -58
- package/src/skills/baseline/references/memory-playbook.md +0 -40
- package/src/skills/baseline/references/publishable-baseline-package.md +0 -30
- package/src/skills/write/references/outline-evidence-contract-example.md +0 -107
- package/src/skills/write/references/paper-experiment-matrix-template.md +0 -131
- package/src/skills/write/references/paper-section-playbook.md +0 -64
- package/src/skills/write/references/reviewer-first-writing.md +0 -64
- package/src/skills/write/references/revision-checklist.md +0 -70
- package/src/skills/write/references/section-contracts.md +0 -82
- package/src/skills/write/references/sentence-level-proofing.md +0 -49
- package/src/ui/dist/assets/AiManusChatView-Bv-Z8YpU.js +0 -204
- package/src/ui/dist/assets/CliPlugin-BCKcpc35.js +0 -109
- package/src/ui/dist/assets/CodeEditorPlugin-DbOfSJ8K.js +0 -2
- package/src/ui/dist/assets/GitCommitViewerPlugin-CIUqbUDO.js +0 -1
- package/src/ui/dist/assets/LabCopilotPanel-BHxOxF4z.js +0 -14
- package/src/ui/dist/assets/LabPlugin-BKoZGs95.js +0 -22
- package/src/ui/dist/assets/NotebookEditor-BEQhaQbt.js +0 -81
- package/src/ui/dist/assets/PdfViewerPlugin-c-RK9DLM.js +0 -17
- package/src/ui/dist/assets/SearchPlugin-CxF9ytAx.js +0 -16
- package/src/ui/dist/assets/VNCViewer-BoLGLnHz.js +0 -11
- package/src/ui/dist/assets/bot-DREQOxzP.js +0 -6
- package/src/ui/dist/assets/chevron-up-C9Qpx4DE.js +0 -6
- package/src/ui/dist/assets/file-content-BZMz3RYp.js +0 -1
- package/src/ui/dist/assets/file-diff-panel-CQhw0jS2.js +0 -1
- package/src/ui/dist/assets/file-jump-queue-DA-SdG__.js +0 -1
- package/src/ui/dist/assets/git-commit-horizontal-DxZ8DCZh.js +0 -6
- package/src/ui/dist/assets/image-Bgl4VIyx.js +0 -6
- package/src/ui/dist/assets/index-BpV6lusQ.css +0 -33
- package/src/ui/dist/assets/index-CBNVuWcP.js +0 -2496
- package/src/ui/dist/assets/index-DrUnlf6K.js +0 -1
- package/src/ui/dist/assets/index-NW-h8VzN.js +0 -1
- package/src/ui/dist/assets/pdf-effect-queue-J8OnM0jE.js +0 -6
- package/src/ui/dist/assets/popover-CLc0pPP8.js +0 -1
- package/src/ui/dist/assets/select-Cs2PmzwL.js +0 -11
- package/src/ui/dist/assets/sigma-ClKcHAXm.js +0 -6
- package/src/ui/dist/assets/trash-DwpbFr3w.js +0 -11
- package/src/ui/dist/assets/useCliAccess-NQ8m0Let.js +0 -1
- package/src/ui/dist/assets/useFileDiffOverlay-FuhcnKiw.js +0 -1
|
@@ -0,0 +1,146 @@
|
|
|
1
|
+
id: aisb.t3.034_ensemblewm
|
|
2
|
+
name: 大语言模型集成水印技术
|
|
3
|
+
version: 0.1.0
|
|
4
|
+
one_line: 结合红绿水印、藏头诗和感觉运动规范的多特征水印检测,用于鲁棒的LLM输出验证。
|
|
5
|
+
task_description: '该基准测试用于评估大语言模型输出的集成水印检测方法。集成方法结合三种不同的水印特征:红绿水印(令牌级logit操作)、藏头诗嵌入(句子首字母编码)和感觉运动规范(感知/动作类别选择)。任务涉及通过logit修改生成带水印文本,并检测清洁和改写后的输出中的水印。评估涵盖检测率、水印分数分布,以及使用基于T5的攻击管道对改写攻击的鲁棒性。统一检测函数可在所有集成配置下应用,无需修改。
|
|
6
|
+
|
|
7
|
+
'
|
|
8
|
+
task_mode: evaluation_driven
|
|
9
|
+
requires_execution: true
|
|
10
|
+
requires_paper: true
|
|
11
|
+
integrity_level: cas_plus_canary
|
|
12
|
+
snapshot_status: runnable
|
|
13
|
+
support_level: advanced
|
|
14
|
+
time_band: 6-24h
|
|
15
|
+
cost_band: medium
|
|
16
|
+
difficulty: hard
|
|
17
|
+
data_access: public
|
|
18
|
+
primary_outputs:
|
|
19
|
+
- detection_rate
|
|
20
|
+
- watermark_scores
|
|
21
|
+
- benchmark_report
|
|
22
|
+
launch_profiles:
|
|
23
|
+
- id: quick_eval
|
|
24
|
+
label: 快速评估
|
|
25
|
+
description: 在生成的输出上运行一个打包的水印检测评估路线。使用单一模型配置和预定义测试提示来测量基线检测率,无需攻击模拟。
|
|
26
|
+
- id: full_eval
|
|
27
|
+
label: 完整评估
|
|
28
|
+
description: 执行完整的多特征水印检测工作流程,包括生成、软改写攻击(基于T5的可配置百分比单词替换)以及所有特征组合的评估。生成检测率矩阵和水印分数分布。
|
|
29
|
+
dataset_download:
|
|
30
|
+
primary_method: mixed
|
|
31
|
+
sources:
|
|
32
|
+
- url: https://deepscientist.cc/AISB/034_ensemblewm
|
|
33
|
+
type: archive
|
|
34
|
+
format: zip
|
|
35
|
+
- url: https://huggingface.co/datasets/know-center/Lancaster_sensorimotor_norms
|
|
36
|
+
type: external
|
|
37
|
+
format: csv
|
|
38
|
+
description: Lancaster感觉运动规范数据集,包含39,707个词,覆盖11个感知维度和5个动作维度
|
|
39
|
+
notes:
|
|
40
|
+
- Lancaster感觉运动规范CSV已打包在压缩包中
|
|
41
|
+
- 测试提示来源于标准LLM评估数据集
|
|
42
|
+
credential_requirements:
|
|
43
|
+
mode: none
|
|
44
|
+
items: []
|
|
45
|
+
notes:
|
|
46
|
+
- 无需外部API密钥
|
|
47
|
+
- 本地LLM权重需另行获取(例如通过HuggingFace获取Llama-3.1-8B)
|
|
48
|
+
resources:
|
|
49
|
+
minimum:
|
|
50
|
+
cpu_cores: 8
|
|
51
|
+
ram_gb: 32
|
|
52
|
+
disk_gb: 80
|
|
53
|
+
gpu_count: 1
|
|
54
|
+
gpu_vram_gb: 24
|
|
55
|
+
notes: 支持量化模型(GPTQ 4位)以降低显存需求
|
|
56
|
+
recommended:
|
|
57
|
+
cpu_cores: 16
|
|
58
|
+
ram_gb: 64
|
|
59
|
+
disk_gb: 150
|
|
60
|
+
gpu_count: 1
|
|
61
|
+
gpu_vram_gb: 48
|
|
62
|
+
notes: 生成实验建议使用全精度;内存受限场景可使用4位量化
|
|
63
|
+
environment:
|
|
64
|
+
python: '3.10'
|
|
65
|
+
cuda: '11.8'
|
|
66
|
+
pytorch: 2.1.0
|
|
67
|
+
flash_attn: 2.x
|
|
68
|
+
key_packages:
|
|
69
|
+
- transformers
|
|
70
|
+
- torch
|
|
71
|
+
- numpy
|
|
72
|
+
- pandas
|
|
73
|
+
- spacy
|
|
74
|
+
- datasets
|
|
75
|
+
- tqdm
|
|
76
|
+
- bitsandbytes
|
|
77
|
+
- auto-gptq
|
|
78
|
+
notes:
|
|
79
|
+
- 需要spacy en_core_web_sm模型:python -m spacy download en_core_web_sm
|
|
80
|
+
- T5改写攻击模型通过transformers加载(T5-small或T5-base)
|
|
81
|
+
- Llama模型通过AutoModelForCausalLM加载,可选BitsAndBytesConfig实现4位量化
|
|
82
|
+
- 完整依赖规范请参阅打包的requirements.txt
|
|
83
|
+
risk_flags:
|
|
84
|
+
- high_vram
|
|
85
|
+
- extended_runtime
|
|
86
|
+
risk_notes:
|
|
87
|
+
- 生成实验需要大量GPU显存用于LLM推理
|
|
88
|
+
- 包含攻击模拟的完整评估可能需要数小时,具体取决于数据集大小
|
|
89
|
+
- 软攻击(改写)的batch_size参数会影响显存使用
|
|
90
|
+
recommended_when: '当您需要一个专注于水印检测质量的LLM安全评估管道、想要评估集成检测对改写的鲁棒性,或需要比较单特征与多特征水印方案时,使用此基准测试。适用于评估纯检测方法,无需模型微调。
|
|
91
|
+
|
|
92
|
+
'
|
|
93
|
+
not_recommended_when: '如果您无法在本地托管开放LLM检查点、缺乏足够显存的GPU资源,或需要一个不包含生成和检测循环的基准测试,请勿使用此基准测试。不适用于评估文本分类或困惑度-based检测方法。
|
|
94
|
+
|
|
95
|
+
'
|
|
96
|
+
paper:
|
|
97
|
+
title: Ensemble Watermarks for Large Language Models
|
|
98
|
+
authors:
|
|
99
|
+
- Georg Niess
|
|
100
|
+
- Roman Kern
|
|
101
|
+
venue: arXiv preprint
|
|
102
|
+
year: 2024
|
|
103
|
+
url: https://arxiv.org/abs/2411.19563
|
|
104
|
+
code_url: https://github.com/
|
|
105
|
+
abstract: '随着大语言模型达到类似人类的流畅度,可靠地区分AI生成的文本与人类撰写的内容变得越来越困难。我们提出了一种多特征水印生成方法,将藏头诗和感觉运动规范与已建立的红绿水印相结合,实现了98%的检测率。经过改写攻击后,性能保持在95%的检测率,而红绿水印单独仅有49%。
|
|
106
|
+
|
|
107
|
+
'
|
|
108
|
+
download:
|
|
109
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.034_ensemblewm.zip
|
|
110
|
+
archive_type: zip
|
|
111
|
+
local_dir_name: aisb-t3-034-ensemblewm
|
|
112
|
+
provider: github_release
|
|
113
|
+
repo: ResearAI/DeepScientist
|
|
114
|
+
tag: aisb-v0.0.1
|
|
115
|
+
asset_name: aisb.t3.034_ensemblewm.zip
|
|
116
|
+
sha256: 8eae0196937e9b32feade1727d417158061f0741b7b0b80a9f154bdd6aaba079
|
|
117
|
+
size_bytes: 25404700
|
|
118
|
+
display:
|
|
119
|
+
palette_seed: aqua-ink-watermark
|
|
120
|
+
art_style: verification-grid
|
|
121
|
+
accent_priority: high
|
|
122
|
+
tags:
|
|
123
|
+
- watermarking
|
|
124
|
+
- detection
|
|
125
|
+
- llm-safety
|
|
126
|
+
- robustness
|
|
127
|
+
- ensemble-methods
|
|
128
|
+
image_path: ../image/034_aisb.t3.034_ensemblewm.jpg
|
|
129
|
+
metric_contract:
|
|
130
|
+
primary_metric: detection_rate
|
|
131
|
+
origin_path: detection_notebook.ipynb
|
|
132
|
+
source_ref: detection_rate
|
|
133
|
+
evaluation_protocol:
|
|
134
|
+
code_paths:
|
|
135
|
+
- batch_run/run_experiments_soft.py
|
|
136
|
+
- batch_run/run_attack_soft.py
|
|
137
|
+
- detection_notebook.ipynb
|
|
138
|
+
- modules/text_generation.py
|
|
139
|
+
metrics_summary: []
|
|
140
|
+
execution_status: pending
|
|
141
|
+
execution_notes: '静态代码审计确认所有分阶段指标的代码可执行锚点。本打包过程中未执行基准测试。指标值应视为暂定,需等待可信运行时输出。
|
|
142
|
+
|
|
143
|
+
'
|
|
144
|
+
executive_summary: '该基准测试为LLM输出中的集成水印检测提供了一个全面的评估框架。该方法结合令牌级(红绿)、句子级(藏头诗)和语义级(感觉运动)水印特征,以在改写攻击后仍能实现鲁棒检测。统一检测函数可在所有特征组合上运行,支持不同安全-鲁棒性权衡的灵活配置。
|
|
145
|
+
|
|
146
|
+
'
|
|
@@ -0,0 +1,207 @@
|
|
|
1
|
+
schema_version: 1
|
|
2
|
+
id: aisb.t3.035_moralvalueswa
|
|
3
|
+
name: Comparing Moral Values in Western English-speaking societies and LLMs with Word
|
|
4
|
+
Associations
|
|
5
|
+
version: 0.1.0
|
|
6
|
+
one_line: Graph-based moral-value propagation benchmark comparing human and LLM word-association
|
|
7
|
+
networks against Moral Foundation Theory using Spearman correlations on five moral
|
|
8
|
+
dimensions.
|
|
9
|
+
task_description: 'This benchmark evaluates moral-value alignment between humans and
|
|
10
|
+
LLMs by constructing word-association graphs and propagating moral scores from Moral
|
|
11
|
+
Foundation Theory (MFT) seed words through them via a random-walk algorithm. The
|
|
12
|
+
primary task is to optimize the propagation parameters and graph construction to
|
|
13
|
+
maximize Spearman correlation of predicted moral scores against the extended Moral
|
|
14
|
+
Foundations Dictionary (eMFD) across five dimensions: care, sanctity, fairness,
|
|
15
|
+
authority, and loyalty. The benchmark ships pre-computed LLaMA 3.1 word-association
|
|
16
|
+
data (llama_2.1_association.json, llama_2.1_moral.json) so the expensive LLM generation
|
|
17
|
+
step (100 prompts × 12k cues on GPU) can be skipped. The evaluation route (eval.py)
|
|
18
|
+
loads pre-computed moral scores, computes correlations against eMFD, and reports
|
|
19
|
+
per-dimension results. Two external data files must be obtained before running:
|
|
20
|
+
SWOW-EN.R100.csv (human word associations, requires registration at smallworldofwords.org)
|
|
21
|
+
and emfd_scoring.csv (auto-downloaded from GitHub if missing). The experiments.py
|
|
22
|
+
script provides additional analyses including alpha tuning, MAG comparison, and
|
|
23
|
+
cross-graph experiments from the paper.
|
|
24
|
+
|
|
25
|
+
'
|
|
26
|
+
capability_tags:
|
|
27
|
+
- research_code_optimization
|
|
28
|
+
- moral_reasoning
|
|
29
|
+
- graph_analysis
|
|
30
|
+
- llm_evaluation
|
|
31
|
+
- nlp
|
|
32
|
+
aisb_direction: T3
|
|
33
|
+
track_fit:
|
|
34
|
+
- paper_track
|
|
35
|
+
- benchmark_track
|
|
36
|
+
task_mode: evaluation_driven
|
|
37
|
+
requires_execution: true
|
|
38
|
+
requires_paper: true
|
|
39
|
+
integrity_level: cas_plus_canary
|
|
40
|
+
snapshot_status: runnable
|
|
41
|
+
support_level: turnkey
|
|
42
|
+
cost_band: low
|
|
43
|
+
time_band: 2-6h
|
|
44
|
+
difficulty: medium
|
|
45
|
+
data_access: public
|
|
46
|
+
primary_outputs:
|
|
47
|
+
- correlation_care
|
|
48
|
+
- correlation_sanctity
|
|
49
|
+
- correlation_fairness
|
|
50
|
+
- correlation_authority
|
|
51
|
+
- correlation_loyalty
|
|
52
|
+
- moral_value_graphs
|
|
53
|
+
- analysis_report
|
|
54
|
+
launch_profiles:
|
|
55
|
+
- id: graph_eval
|
|
56
|
+
label: Graph Eval (Quick)
|
|
57
|
+
description: 'Run eval.py with pre-computed moral scores to reproduce Table 1 GMN-L
|
|
58
|
+
correlation results. Requires only emfd_scoring.csv (auto-downloaded). CPU-only,
|
|
59
|
+
completes in minutes.
|
|
60
|
+
|
|
61
|
+
'
|
|
62
|
+
- id: full_experiments
|
|
63
|
+
label: Full Experiments
|
|
64
|
+
description: 'Run experiments.py to reproduce alpha tuning, MAG comparison, varying-alpha
|
|
65
|
+
sweeps, and cross-graph analyses from the paper. Requires SWOW-EN.R100.csv for
|
|
66
|
+
human association graph construction. CPU-feasible but benefits from GPU for getMoralValue
|
|
67
|
+
propagation.
|
|
68
|
+
|
|
69
|
+
'
|
|
70
|
+
- id: llm_generation
|
|
71
|
+
label: LLM Association Generation
|
|
72
|
+
description: 'Re-run produce.py to regenerate LLaMA word associations from scratch.
|
|
73
|
+
Requires a HuggingFace token with access to meta-llama/Meta-Llama-3.1-8B-Instruct,
|
|
74
|
+
a GPU with ≥24 GB VRAM, and significant compute time (100 iterations × 12k cues).
|
|
75
|
+
Not required for evaluation; pre-computed outputs are bundled.
|
|
76
|
+
|
|
77
|
+
'
|
|
78
|
+
dataset_download:
|
|
79
|
+
primary_method: mixed
|
|
80
|
+
sources:
|
|
81
|
+
- kind: bundled
|
|
82
|
+
url: null
|
|
83
|
+
access: public
|
|
84
|
+
note: 'Pre-computed LLaMA word associations (llama_2.1.json, llama_2.1_association.json,
|
|
85
|
+
llama_2.1_moral.json), MFD2 dictionary, cue words, and mag_words.json are included
|
|
86
|
+
in the snapshot under data/.
|
|
87
|
+
|
|
88
|
+
'
|
|
89
|
+
- kind: external_download
|
|
90
|
+
url: https://raw.githubusercontent.com/medianeuroscience/emfd/master/dictionaries/emfd_scoring.csv
|
|
91
|
+
access: public
|
|
92
|
+
note: 'eMFD scoring CSV. Auto-downloaded by eval.py if missing. ~200 KB.
|
|
93
|
+
|
|
94
|
+
'
|
|
95
|
+
- kind: external_registration
|
|
96
|
+
url: https://smallworldofwords.org/en/project/research
|
|
97
|
+
access: public
|
|
98
|
+
note: 'SWOW-EN.R100.csv (human word association data). Requires free registration
|
|
99
|
+
at smallworldofwords.org. Needed only for full_experiments and human graph construction.
|
|
100
|
+
|
|
101
|
+
'
|
|
102
|
+
- kind: external_download
|
|
103
|
+
url: https://provalisresearch.com/products/content-analysis-software/wordstat-dictionary/moral-foundations-dictionary/
|
|
104
|
+
access: public
|
|
105
|
+
note: 'MFD 2.0 dictionary (mfd2.txt). Already bundled in data/ but original source
|
|
106
|
+
listed for reference.
|
|
107
|
+
|
|
108
|
+
'
|
|
109
|
+
notes:
|
|
110
|
+
- Total disk footprint including all external files is well under 1 GB.
|
|
111
|
+
- The graph_eval launch profile needs only the bundled files plus emfd_scoring.csv.
|
|
112
|
+
credential_requirements:
|
|
113
|
+
mode: optional
|
|
114
|
+
items:
|
|
115
|
+
- HuggingFace token (only if re-running LLM association generation via produce.py)
|
|
116
|
+
notes:
|
|
117
|
+
- produce.py requires HF_TOKEN with access to meta-llama/Meta-Llama-3.1-8B-Instruct
|
|
118
|
+
- Evaluation and experiments routes do not require any credentials
|
|
119
|
+
resources:
|
|
120
|
+
minimum:
|
|
121
|
+
cpu_cores: 8
|
|
122
|
+
ram_gb: 16
|
|
123
|
+
disk_gb: 20
|
|
124
|
+
gpu_count: 0
|
|
125
|
+
gpu_vram_gb: 0
|
|
126
|
+
recommended:
|
|
127
|
+
cpu_cores: 16
|
|
128
|
+
ram_gb: 32
|
|
129
|
+
disk_gb: 80
|
|
130
|
+
gpu_count: 1
|
|
131
|
+
gpu_vram_gb: 24
|
|
132
|
+
environment:
|
|
133
|
+
python: null
|
|
134
|
+
cuda: null
|
|
135
|
+
pytorch: null
|
|
136
|
+
flash_attn: null
|
|
137
|
+
key_packages:
|
|
138
|
+
- numpy
|
|
139
|
+
- pandas
|
|
140
|
+
- scipy
|
|
141
|
+
- gensim
|
|
142
|
+
- matplotlib
|
|
143
|
+
- tqdm
|
|
144
|
+
- torch
|
|
145
|
+
- transformers
|
|
146
|
+
- huggingface_hub
|
|
147
|
+
notes:
|
|
148
|
+
- CPU-only execution is sufficient for eval.py and most of experiments.py.
|
|
149
|
+
- GPU and transformers/torch are needed only for produce.py (LLM association generation).
|
|
150
|
+
- getMoralValue.py imports torch.cuda but the propagation algorithm itself is array-based.
|
|
151
|
+
- See bundled requirements.txt for the full dependency set.
|
|
152
|
+
risk_flags:
|
|
153
|
+
- external_data_registration
|
|
154
|
+
- optional_credential
|
|
155
|
+
risk_notes:
|
|
156
|
+
- SWOW-EN.R100.csv requires free registration at smallworldofwords.org; without it
|
|
157
|
+
the human association graph cannot be rebuilt and some experiments.py functions
|
|
158
|
+
will fail.
|
|
159
|
+
- emfd_scoring.csv is auto-fetched from GitHub but network access is needed on first
|
|
160
|
+
run.
|
|
161
|
+
- Re-generating LLM associations via produce.py is computationally expensive (GPU
|
|
162
|
+
hours) and requires HuggingFace model access; this is optional since pre-computed
|
|
163
|
+
outputs are bundled.
|
|
164
|
+
- No benchmark execution was performed during the packaging pass; metric values are
|
|
165
|
+
from paper Table 1.
|
|
166
|
+
recommended_when: 'Use this benchmark when you want a lightweight graph-based NLP
|
|
167
|
+
evaluation task focused on moral reasoning that can reuse bundled word-association
|
|
168
|
+
data without retraining or prompting a language model. Good for testing optimization
|
|
169
|
+
of graph propagation algorithms, exploring moral foundation analysis, or comparing
|
|
170
|
+
human vs. LLM conceptual organization.
|
|
171
|
+
|
|
172
|
+
'
|
|
173
|
+
not_recommended_when: 'Do not use this if you need a large-scale end-to-end LLM fine-tuning
|
|
174
|
+
benchmark, if you require vision or multimodal capabilities, or if you cannot obtain
|
|
175
|
+
the SWOW dataset registration and need the full experiments pipeline including human
|
|
176
|
+
association graph construction.
|
|
177
|
+
|
|
178
|
+
'
|
|
179
|
+
paper:
|
|
180
|
+
title: Comparing Moral Values in Western English-speaking societies and LLMs with
|
|
181
|
+
Word Associations
|
|
182
|
+
authors:
|
|
183
|
+
- Chaoyi Xiang
|
|
184
|
+
- Chunhua Liu
|
|
185
|
+
- Simon De Deyne
|
|
186
|
+
- Lea Frermann
|
|
187
|
+
venue: ACL 2025 Main
|
|
188
|
+
year: 2025
|
|
189
|
+
url: https://aclanthology.org/2025.acl-long.177/
|
|
190
|
+
doi: 10.18653/v1/2025.acl-long.177
|
|
191
|
+
download:
|
|
192
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.035_moralvalueswa.zip
|
|
193
|
+
archive_type: zip
|
|
194
|
+
local_dir_name: paper-35-MoralValuesWA
|
|
195
|
+
provider: github_release
|
|
196
|
+
repo: ResearAI/DeepScientist
|
|
197
|
+
tag: aisb-v0.0.1
|
|
198
|
+
asset_name: aisb.t3.035_moralvalueswa.zip
|
|
199
|
+
sha256: 49fa6c7fa381c4e8059933d6e2aabda5b93a2d0d1eff27940bfd404a356a4b72
|
|
200
|
+
size_bytes: 48717
|
|
201
|
+
commercial:
|
|
202
|
+
annual_fee: null
|
|
203
|
+
display:
|
|
204
|
+
palette_seed: sepia-emerald-values
|
|
205
|
+
art_style: social-science
|
|
206
|
+
accent_priority: medium
|
|
207
|
+
image_path: ../image/035_aisb.t3.035_moralvalueswa.jpg
|
|
@@ -0,0 +1,165 @@
|
|
|
1
|
+
schema_version: 1
|
|
2
|
+
id: aisb.t3.035_moralvalueswa
|
|
3
|
+
name: 通过词汇联想比较西方英语社会与LLM的道德价值观
|
|
4
|
+
version: 0.1.0
|
|
5
|
+
one_line: 基于图的道德价值观传播基准,通过Spearman相关分析在五个道德维度上比较人类与LLM词汇联想网络与道德基础理论。
|
|
6
|
+
task_description: '本基准通过构建词汇联想图并使用随机游走算法将道德基础理论(MFT)种子词的道德分数传播到图中,来评估人类与LLM之间的道德价值观一致性。主要任务是优化传播参数和图构建,以最大化预测道德分数与扩展道德基础词典(eMFD)在五个维度上的Spearman相关性:关怀、圣洁、公平、权威和忠诚。基准提供了预计算的LLaMA 3.1词汇联想数据(llama_2.1_association.json、llama_2.1_moral.json),因此可以跳过昂贵的LLM生成步骤(100个提示词×12k线索在GPU上运行)。评估路由(eval.py)加载预计算的道德分数,计算与eMFD的相关性,并报告每个维度的结果。运行前必须获取两个外部数据文件:SWOW-EN.R100.csv(人类词汇联想数据,需在smallworldofwords.org注册)和emfd_scoring.csv(缺失时自动从GitHub下载)。experiments.py脚本提供额外分析,包括alpha调优、MAG比较和论文中的跨图实验。
|
|
7
|
+
|
|
8
|
+
'
|
|
9
|
+
capability_tags:
|
|
10
|
+
- research_code_optimization
|
|
11
|
+
- moral_reasoning
|
|
12
|
+
- graph_analysis
|
|
13
|
+
- llm_evaluation
|
|
14
|
+
- nlp
|
|
15
|
+
aisb_direction: T3
|
|
16
|
+
track_fit:
|
|
17
|
+
- paper_track
|
|
18
|
+
- benchmark_track
|
|
19
|
+
task_mode: evaluation_driven
|
|
20
|
+
requires_execution: true
|
|
21
|
+
requires_paper: true
|
|
22
|
+
integrity_level: cas_plus_canary
|
|
23
|
+
snapshot_status: runnable
|
|
24
|
+
support_level: turnkey
|
|
25
|
+
cost_band: low
|
|
26
|
+
time_band: 2-6h
|
|
27
|
+
difficulty: medium
|
|
28
|
+
data_access: public
|
|
29
|
+
primary_outputs:
|
|
30
|
+
- correlation_care
|
|
31
|
+
- correlation_sanctity
|
|
32
|
+
- correlation_fairness
|
|
33
|
+
- correlation_authority
|
|
34
|
+
- correlation_loyalty
|
|
35
|
+
- moral_value_graphs
|
|
36
|
+
- analysis_report
|
|
37
|
+
launch_profiles:
|
|
38
|
+
- id: graph_eval
|
|
39
|
+
label: 图评估(快速)
|
|
40
|
+
description: '运行eval.py,使用预计算的道德分数复现表1中的GMN-L相关性结果。只需emfd_scoring.csv(自动下载)。仅需CPU,几分钟内完成。
|
|
41
|
+
|
|
42
|
+
'
|
|
43
|
+
- id: full_experiments
|
|
44
|
+
label: 完整实验
|
|
45
|
+
description: '运行experiments.py复现论文中的alpha调优、MAG比较、变alpha扫描和跨图分析。需SWOW-EN.R100.csv用于人类联想图构建。CPU可执行,但getMoralValue传播受益于GPU。
|
|
46
|
+
|
|
47
|
+
'
|
|
48
|
+
- id: llm_generation
|
|
49
|
+
label: LLM联想生成
|
|
50
|
+
description: '重新运行produce.py从头生成LLaMA词汇联想。需HuggingFace令牌访问meta-llama/Meta-Llama-3.1-8B-Instruct,需≥24GB显存的GPU,以及大量计算时间(100次迭代×12k线索)。评估不需要此步骤,预计算输出已捆绑。
|
|
51
|
+
|
|
52
|
+
'
|
|
53
|
+
dataset_download:
|
|
54
|
+
primary_method: mixed
|
|
55
|
+
sources:
|
|
56
|
+
- kind: bundled
|
|
57
|
+
url: null
|
|
58
|
+
access: public
|
|
59
|
+
note: '预计算的LLaMA词汇联想数据(llama_2.1.json、llama_2.1_association.json、llama_2.1_moral.json)、MFD2词典、线索词和mag_words.json已包含在快照的data/目录下。
|
|
60
|
+
|
|
61
|
+
'
|
|
62
|
+
- kind: external_download
|
|
63
|
+
url: https://raw.githubusercontent.com/medianeuroscience/emfd/master/dictionaries/emfd_scoring.csv
|
|
64
|
+
access: public
|
|
65
|
+
note: 'eMFD评分CSV。eval.py缺失时自动下载。约200KB。
|
|
66
|
+
|
|
67
|
+
'
|
|
68
|
+
- kind: external_registration
|
|
69
|
+
url: https://smallworldofwords.org/en/project/research
|
|
70
|
+
access: public
|
|
71
|
+
note: 'SWOW-EN.R100.csv(人类词汇联想数据)。需在smallworldofwords.org免费注册。仅在full_experiments和人类图构建时需要。
|
|
72
|
+
|
|
73
|
+
'
|
|
74
|
+
- kind: external_download
|
|
75
|
+
url: https://provalisresearch.com/products/content-analysis-software/wordstat-dictionary/moral-foundations-dictionary/
|
|
76
|
+
access: public
|
|
77
|
+
note: 'MFD 2.0词典(mfd2.txt)。已捆绑在data/中,列为原始参考来源。
|
|
78
|
+
|
|
79
|
+
'
|
|
80
|
+
notes:
|
|
81
|
+
- 包含所有外部文件的总磁盘占用远低于1GB。
|
|
82
|
+
- graph_eval启动配置只需捆绑文件加上emfd_scoring.csv。
|
|
83
|
+
credential_requirements:
|
|
84
|
+
mode: optional
|
|
85
|
+
items:
|
|
86
|
+
- HuggingFace令牌(仅在重新运行produce.py生成LLM联想时需要)
|
|
87
|
+
notes:
|
|
88
|
+
- produce.py需要具有meta-llama/Meta-Llama-3.1-8B-Instruct访问权限的HF_TOKEN
|
|
89
|
+
- 评估和实验路由不需要任何凭据
|
|
90
|
+
resources:
|
|
91
|
+
minimum:
|
|
92
|
+
cpu_cores: 8
|
|
93
|
+
ram_gb: 16
|
|
94
|
+
disk_gb: 20
|
|
95
|
+
gpu_count: 0
|
|
96
|
+
gpu_vram_gb: 0
|
|
97
|
+
recommended:
|
|
98
|
+
cpu_cores: 16
|
|
99
|
+
ram_gb: 32
|
|
100
|
+
disk_gb: 80
|
|
101
|
+
gpu_count: 1
|
|
102
|
+
gpu_vram_gb: 24
|
|
103
|
+
environment:
|
|
104
|
+
python: null
|
|
105
|
+
cuda: null
|
|
106
|
+
pytorch: null
|
|
107
|
+
flash_attn: null
|
|
108
|
+
key_packages:
|
|
109
|
+
- numpy
|
|
110
|
+
- pandas
|
|
111
|
+
- scipy
|
|
112
|
+
- gensim
|
|
113
|
+
- matplotlib
|
|
114
|
+
- tqdm
|
|
115
|
+
- torch
|
|
116
|
+
- transformers
|
|
117
|
+
- huggingface_hub
|
|
118
|
+
notes:
|
|
119
|
+
- 仅CPU执行足以运行eval.py和experiments.py的大部分内容。
|
|
120
|
+
- GPU和transformers/torch仅在produce.py(LLM联想生成)时需要。
|
|
121
|
+
- getMoralValue.py导入torch.cuda,但传播算法本身是基于数组的。
|
|
122
|
+
- 参见捆绑的requirements.txt获取完整依赖集。
|
|
123
|
+
risk_flags:
|
|
124
|
+
- external_data_registration
|
|
125
|
+
- optional_credential
|
|
126
|
+
risk_notes:
|
|
127
|
+
- SWOW-EN.R100.csv需要在smallworldofwords.org免费注册;没有它将无法重建人类联想图,部分experiments.py函数会失败。
|
|
128
|
+
- emfd_scoring.csv从GitHub自动获取,但首次运行需要网络访问。
|
|
129
|
+
- 通过produce.py重新生成LLM联想计算成本高昂(GPU小时级),且需要HuggingFace模型访问权限;这是可选的,因为预计算输出已捆绑。
|
|
130
|
+
- 打包过程中未执行基准测试运行;指标值来自论文表1。
|
|
131
|
+
recommended_when: '当您需要一个轻量级的基于图的NLP评估任务来研究道德推理、且可以重用捆绑的词汇联想数据而无需重新训练或提示语言模型时,可以使用此基准。适合测试图传播算法的优化、探索道德基础分析,或比较人类与LLM的概念组织方式。
|
|
132
|
+
|
|
133
|
+
'
|
|
134
|
+
not_recommended_when: '如果需要大规模端到端LLM微调基准、需要视觉或多模态能力,或无法获取SWOW数据集注册且需要包含人类联想图构建的完整实验管道时,请勿使用。
|
|
135
|
+
|
|
136
|
+
'
|
|
137
|
+
paper:
|
|
138
|
+
title: Comparing Moral Values in Western English-speaking societies and LLMs with
|
|
139
|
+
Word Associations
|
|
140
|
+
authors:
|
|
141
|
+
- Chaoyi Xiang
|
|
142
|
+
- Chunhua Liu
|
|
143
|
+
- Simon De Deyne
|
|
144
|
+
- Lea Frermann
|
|
145
|
+
venue: ACL 2025 Main
|
|
146
|
+
year: 2025
|
|
147
|
+
url: https://aclanthology.org/2025.acl-long.177/
|
|
148
|
+
doi: 10.18653/v1/2025.acl-long.177
|
|
149
|
+
download:
|
|
150
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.035_moralvalueswa.zip
|
|
151
|
+
archive_type: zip
|
|
152
|
+
local_dir_name: paper-35-MoralValuesWA
|
|
153
|
+
provider: github_release
|
|
154
|
+
repo: ResearAI/DeepScientist
|
|
155
|
+
tag: aisb-v0.0.1
|
|
156
|
+
asset_name: aisb.t3.035_moralvalueswa.zip
|
|
157
|
+
sha256: 49fa6c7fa381c4e8059933d6e2aabda5b93a2d0d1eff27940bfd404a356a4b72
|
|
158
|
+
size_bytes: 48717
|
|
159
|
+
commercial:
|
|
160
|
+
annual_fee: null
|
|
161
|
+
display:
|
|
162
|
+
palette_seed: sepia-emerald-values
|
|
163
|
+
art_style: social-science
|
|
164
|
+
accent_priority: medium
|
|
165
|
+
image_path: ../image/035_aisb.t3.035_moralvalueswa.jpg
|