@researai/deepscientist 1.5.17 → 1.6.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/AGENTS.md +309 -130
- package/AISB/catalog/aisb.b1.agentic_coding.yaml +244 -0
- package/AISB/catalog/aisb.b10.climate_earth.yaml +235 -0
- package/AISB/catalog/aisb.b11.model_efficiency.yaml +231 -0
- package/AISB/catalog/aisb.b12.embodied_ai.yaml +238 -0
- package/AISB/catalog/aisb.b2.agent_systems.yaml +229 -0
- package/AISB/catalog/aisb.b3.self_evolving_rl.yaml +237 -0
- package/AISB/catalog/aisb.b4.lm_reasoning.yaml +240 -0
- package/AISB/catalog/aisb.b5.math_proof.yaml +235 -0
- package/AISB/catalog/aisb.b6.research_process.yaml +243 -0
- package/AISB/catalog/aisb.b7.multimodal_fusion.yaml +232 -0
- package/AISB/catalog/aisb.b8.lifesci_drug.yaml +275 -0
- package/AISB/catalog/aisb.b9.material_science.yaml +237 -0
- package/AISB/catalog/aisb.t3.001_savvy.yaml +159 -0
- package/AISB/catalog/aisb.t3.001_savvy.zh.yaml +121 -0
- package/AISB/catalog/aisb.t3.002_pinet.yaml +189 -0
- package/AISB/catalog/aisb.t3.002_pinet.zh.yaml +130 -0
- package/AISB/catalog/aisb.t3.004_decentralattn.yaml +184 -0
- package/AISB/catalog/aisb.t3.004_decentralattn.zh.yaml +153 -0
- package/AISB/catalog/aisb.t3.005_tsae.yaml +193 -0
- package/AISB/catalog/aisb.t3.005_tsae.zh.yaml +139 -0
- package/AISB/catalog/aisb.t3.006_physense.yaml +194 -0
- package/AISB/catalog/aisb.t3.006_physense.zh.yaml +118 -0
- package/AISB/catalog/aisb.t3.007_reasoningiqa.yaml +169 -0
- package/AISB/catalog/aisb.t3.007_reasoningiqa.zh.yaml +133 -0
- package/AISB/catalog/aisb.t3.008_meanflows.yaml +188 -0
- package/AISB/catalog/aisb.t3.008_meanflows.zh.yaml +140 -0
- package/AISB/catalog/aisb.t3.009_scoremissing.yaml +179 -0
- package/AISB/catalog/aisb.t3.009_scoremissing.zh.yaml +119 -0
- package/AISB/catalog/aisb.t3.010_suitabilityfilter.yaml +221 -0
- package/AISB/catalog/aisb.t3.010_suitabilityfilter.zh.yaml +141 -0
- package/AISB/catalog/aisb.t3.011_osd.yaml +206 -0
- package/AISB/catalog/aisb.t3.011_osd.zh.yaml +163 -0
- package/AISB/catalog/aisb.t3.012_efficientqat.yaml +206 -0
- package/AISB/catalog/aisb.t3.012_efficientqat.zh.yaml +159 -0
- package/AISB/catalog/aisb.t3.013_appl.yaml +152 -0
- package/AISB/catalog/aisb.t3.013_appl.zh.yaml +126 -0
- package/AISB/catalog/aisb.t3.014_piguard.yaml +207 -0
- package/AISB/catalog/aisb.t3.014_piguard.zh.yaml +164 -0
- package/AISB/catalog/aisb.t3.015_frspec.yaml +209 -0
- package/AISB/catalog/aisb.t3.015_frspec.zh.yaml +163 -0
- package/AISB/catalog/aisb.t3.016_mathfusion.yaml +166 -0
- package/AISB/catalog/aisb.t3.016_mathfusion.zh.yaml +145 -0
- package/AISB/catalog/aisb.t3.017_multimodalglp.yaml +171 -0
- package/AISB/catalog/aisb.t3.017_multimodalglp.zh.yaml +122 -0
- package/AISB/catalog/aisb.t3.018_cotsynth.yaml +206 -0
- package/AISB/catalog/aisb.t3.018_cotsynth.zh.yaml +162 -0
- package/AISB/catalog/aisb.t3.019_dyscaleut.yaml +211 -0
- package/AISB/catalog/aisb.t3.019_dyscaleut.zh.yaml +148 -0
- package/AISB/catalog/aisb.t3.020_aristotle.yaml +173 -0
- package/AISB/catalog/aisb.t3.020_aristotle.zh.yaml +119 -0
- package/AISB/catalog/aisb.t3.021_tokenrecycling.yaml +160 -0
- package/AISB/catalog/aisb.t3.021_tokenrecycling.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.022_chainofreasoning.yaml +204 -0
- package/AISB/catalog/aisb.t3.022_chainofreasoning.zh.yaml +161 -0
- package/AISB/catalog/aisb.t3.023_guidedembed.yaml +211 -0
- package/AISB/catalog/aisb.t3.023_guidedembed.zh.yaml +189 -0
- package/AISB/catalog/aisb.t3.024_outputcentric.yaml +148 -0
- package/AISB/catalog/aisb.t3.024_outputcentric.zh.yaml +131 -0
- package/AISB/catalog/aisb.t3.025_deeper.yaml +143 -0
- package/AISB/catalog/aisb.t3.025_deeper.zh.yaml +116 -0
- package/AISB/catalog/aisb.t3.026_gartkg.yaml +195 -0
- package/AISB/catalog/aisb.t3.026_gartkg.zh.yaml +127 -0
- package/AISB/catalog/aisb.t3.027_citeeval.yaml +182 -0
- package/AISB/catalog/aisb.t3.027_citeeval.zh.yaml +135 -0
- package/AISB/catalog/aisb.t3.028_sbam.yaml +206 -0
- package/AISB/catalog/aisb.t3.028_sbam.zh.yaml +166 -0
- package/AISB/catalog/aisb.t3.029_cdqgeoembed.yaml +224 -0
- package/AISB/catalog/aisb.t3.029_cdqgeoembed.zh.yaml +142 -0
- package/AISB/catalog/aisb.t3.030_processrm.yaml +211 -0
- package/AISB/catalog/aisb.t3.030_processrm.zh.yaml +166 -0
- package/AISB/catalog/aisb.t3.031_circuitstability.yaml +172 -0
- package/AISB/catalog/aisb.t3.031_circuitstability.zh.yaml +134 -0
- package/AISB/catalog/aisb.t3.032_ptsolver.yaml +169 -0
- package/AISB/catalog/aisb.t3.032_ptsolver.zh.yaml +135 -0
- package/AISB/catalog/aisb.t3.033_gcse.yaml +144 -0
- package/AISB/catalog/aisb.t3.033_gcse.zh.yaml +126 -0
- package/AISB/catalog/aisb.t3.034_ensemblewm.yaml +183 -0
- package/AISB/catalog/aisb.t3.034_ensemblewm.zh.yaml +146 -0
- package/AISB/catalog/aisb.t3.035_moralvalueswa.yaml +207 -0
- package/AISB/catalog/aisb.t3.035_moralvalueswa.zh.yaml +165 -0
- package/AISB/catalog/aisb.t3.036_weakstrongpref.yaml +210 -0
- package/AISB/catalog/aisb.t3.036_weakstrongpref.zh.yaml +194 -0
- package/AISB/catalog/aisb.t3.037_dementiamask.yaml +172 -0
- package/AISB/catalog/aisb.t3.037_dementiamask.zh.yaml +132 -0
- package/AISB/catalog/aisb.t3.038_tinysam.yaml +284 -0
- package/AISB/catalog/aisb.t3.038_tinysam.zh.yaml +240 -0
- package/AISB/catalog/aisb.t3.039_calf.yaml +224 -0
- package/AISB/catalog/aisb.t3.039_calf.zh.yaml +194 -0
- package/AISB/catalog/aisb.t3.040_graniteguardian.yaml +199 -0
- package/AISB/catalog/aisb.t3.040_graniteguardian.zh.yaml +174 -0
- package/AISB/catalog/aisb.t3.041_amdm.yaml +149 -0
- package/AISB/catalog/aisb.t3.041_amdm.zh.yaml +137 -0
- package/AISB/catalog/aisb.t3.042_xpatch.yaml +216 -0
- package/AISB/catalog/aisb.t3.042_xpatch.zh.yaml +182 -0
- package/AISB/catalog/aisb.t3.043_vhm.yaml +268 -0
- package/AISB/catalog/aisb.t3.043_vhm.zh.yaml +193 -0
- package/AISB/catalog/aisb.t3.044_rgvi.yaml +224 -0
- package/AISB/catalog/aisb.t3.044_rgvi.zh.yaml +176 -0
- package/AISB/catalog/aisb.t3.045_pslstm.yaml +203 -0
- package/AISB/catalog/aisb.t3.045_pslstm.zh.yaml +179 -0
- package/AISB/catalog/aisb.t3.046_nonstatts.yaml +208 -0
- package/AISB/catalog/aisb.t3.046_nonstatts.zh.yaml +194 -0
- package/AISB/catalog/aisb.t3.047_timepfn.yaml +156 -0
- package/AISB/catalog/aisb.t3.047_timepfn.zh.yaml +124 -0
- package/AISB/catalog/aisb.t3.048_proxyspex.yaml +148 -0
- package/AISB/catalog/aisb.t3.048_proxyspex.zh.yaml +125 -0
- package/AISB/catalog/aisb.t3.049_hogwildinference.yaml +183 -0
- package/AISB/catalog/aisb.t3.049_hogwildinference.zh.yaml +138 -0
- package/AISB/catalog/aisb.t3.050_causalpfn.yaml +214 -0
- package/AISB/catalog/aisb.t3.050_causalpfn.zh.yaml +190 -0
- package/AISB/catalog/aisb.t3.051_flashtp.yaml +169 -0
- package/AISB/catalog/aisb.t3.051_flashtp.zh.yaml +124 -0
- package/AISB/catalog/aisb.t3.052_nsdiff.yaml +155 -0
- package/AISB/catalog/aisb.t3.052_nsdiff.zh.yaml +138 -0
- package/AISB/catalog/aisb.t3.053_k2vae.yaml +158 -0
- package/AISB/catalog/aisb.t3.053_k2vae.zh.yaml +132 -0
- package/AISB/catalog/aisb.t3.054_timebase.yaml +178 -0
- package/AISB/catalog/aisb.t3.054_timebase.zh.yaml +158 -0
- package/AISB/catalog/aisb.t3.055_csbrain.yaml +238 -0
- package/AISB/catalog/aisb.t3.055_csbrain.zh.yaml +184 -0
- package/AISB/catalog/aisb.t3.056_infosam.yaml +224 -0
- package/AISB/catalog/aisb.t3.056_infosam.zh.yaml +189 -0
- package/AISB/catalog/aisb.t3.057_mdreid.yaml +129 -0
- package/AISB/catalog/aisb.t3.057_mdreid.zh.yaml +117 -0
- package/AISB/catalog/aisb.t3.058_mindglitch.yaml +171 -0
- package/AISB/catalog/aisb.t3.058_mindglitch.zh.yaml +145 -0
- package/AISB/catalog/aisb.t3.059_selfsupervised.yaml +154 -0
- package/AISB/catalog/aisb.t3.059_selfsupervised.zh.yaml +125 -0
- package/AISB/catalog/aisb.t3.060_iaggad.yaml +121 -0
- package/AISB/catalog/aisb.t3.060_iaggad.zh.yaml +100 -0
- package/AISB/catalog/aisb.t3.061_hsgkn.yaml +136 -0
- package/AISB/catalog/aisb.t3.061_hsgkn.zh.yaml +113 -0
- package/AISB/catalog/aisb.t3.062_visionts.yaml +237 -0
- package/AISB/catalog/aisb.t3.062_visionts.zh.yaml +216 -0
- package/AISB/catalog/aisb.t3.063_tsrag.yaml +162 -0
- package/AISB/catalog/aisb.t3.063_tsrag.zh.yaml +138 -0
- package/AISB/catalog/aisb.t3.064_pir.yaml +221 -0
- package/AISB/catalog/aisb.t3.064_pir.zh.yaml +197 -0
- package/AISB/catalog/aisb.t3.065_proteinbinding.yaml +234 -0
- package/AISB/catalog/aisb.t3.065_proteinbinding.zh.yaml +167 -0
- package/AISB/catalog/aisb.t3.066_tropicalattention.yaml +267 -0
- package/AISB/catalog/aisb.t3.066_tropicalattention.zh.yaml +229 -0
- package/AISB/catalog/aisb.t3.067_kanad.yaml +193 -0
- package/AISB/catalog/aisb.t3.067_kanad.zh.yaml +167 -0
- package/AISB/catalog/aisb.t3.068_sempo.yaml +187 -0
- package/AISB/catalog/aisb.t3.068_sempo.zh.yaml +148 -0
- package/AISB/catalog/aisb.t3.069_treehfd.yaml +129 -0
- package/AISB/catalog/aisb.t3.069_treehfd.zh.yaml +111 -0
- package/AISB/catalog/aisb.t3.070_certifiedunlearning.yaml +224 -0
- package/AISB/catalog/aisb.t3.070_certifiedunlearning.zh.yaml +171 -0
- package/AISB/catalog/aisb.t3.071_neuralmjd.yaml +142 -0
- package/AISB/catalog/aisb.t3.071_neuralmjd.zh.yaml +120 -0
- package/AISB/catalog/aisb.t3.072_fedgmt.yaml +181 -0
- package/AISB/catalog/aisb.t3.072_fedgmt.zh.yaml +158 -0
- package/AISB/catalog/aisb.t3.073_rld.yaml +161 -0
- package/AISB/catalog/aisb.t3.073_rld.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.074_lsvi.yaml +163 -0
- package/AISB/catalog/aisb.t3.074_lsvi.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.075_treeslicedentropy.yaml +201 -0
- package/AISB/catalog/aisb.t3.075_treeslicedentropy.zh.yaml +148 -0
- package/AISB/catalog/aisb.t3.076_aanet.yaml +169 -0
- package/AISB/catalog/aisb.t3.076_aanet.zh.yaml +129 -0
- package/AISB/catalog/aisb.t3.077_cmnn.yaml +199 -0
- package/AISB/catalog/aisb.t3.077_cmnn.zh.yaml +165 -0
- package/AISB/catalog/aisb.t3.078_conformalanomaly.yaml +146 -0
- package/AISB/catalog/aisb.t3.078_conformalanomaly.zh.yaml +117 -0
- package/AISB/catalog/aisb.t3.079_dpfkmeans.yaml +131 -0
- package/AISB/catalog/aisb.t3.079_dpfkmeans.zh.yaml +104 -0
- package/AISB/catalog/aisb.t3.080_latentscorereweight.yaml +169 -0
- package/AISB/catalog/aisb.t3.080_latentscorereweight.zh.yaml +123 -0
- package/AISB/catalog/aisb.t3.081_qmamba.yaml +150 -0
- package/AISB/catalog/aisb.t3.081_qmamba.zh.yaml +117 -0
- package/AISB/catalog/aisb.t3.082_onlinellmrouting.yaml +160 -0
- package/AISB/catalog/aisb.t3.082_onlinellmrouting.zh.yaml +133 -0
- package/AISB/catalog/aisb.t3.083_starformer.yaml +178 -0
- package/AISB/catalog/aisb.t3.083_starformer.zh.yaml +140 -0
- package/AISB/catalog/aisb.t3.084_ift.yaml +139 -0
- package/AISB/catalog/aisb.t3.084_ift.zh.yaml +111 -0
- package/AISB/catalog/aisb.t3.085_neuralsurv.yaml +183 -0
- package/AISB/catalog/aisb.t3.085_neuralsurv.zh.yaml +143 -0
- package/AISB/catalog/aisb.t3.086_stella.yaml +197 -0
- package/AISB/catalog/aisb.t3.086_stella.zh.yaml +142 -0
- package/AISB/catalog/aisb.t3.087_moses.yaml +167 -0
- package/AISB/catalog/aisb.t3.087_moses.zh.yaml +132 -0
- package/AISB/catalog/aisb.t3.088_channelnorm.yaml +140 -0
- package/AISB/catalog/aisb.t3.088_channelnorm.zh.yaml +109 -0
- package/AISB/catalog/aisb.t3.089_causalvelocity.yaml +730 -0
- package/AISB/catalog/aisb.t3.089_causalvelocity.zh.yaml +668 -0
- package/AISB/catalog/aisb.t3.090_rstib.yaml +144 -0
- package/AISB/catalog/aisb.t3.090_rstib.zh.yaml +109 -0
- package/AISB/catalog/aisb.t3.091_timeawarecausal.yaml +132 -0
- package/AISB/catalog/aisb.t3.091_timeawarecausal.zh.yaml +107 -0
- package/AISB/catalog/aisb.t3.092_kmeanslocalopt.yaml +138 -0
- package/AISB/catalog/aisb.t3.092_kmeanslocalopt.zh.yaml +110 -0
- package/AISB/catalog/aisb.t3.093_fedwmsam.yaml +134 -0
- package/AISB/catalog/aisb.t3.093_fedwmsam.zh.yaml +106 -0
- package/AISB/catalog/aisb.t3.094_boundre.yaml +147 -0
- package/AISB/catalog/aisb.t3.094_boundre.zh.yaml +114 -0
- package/AISB/catalog/aisb.t3.095_fastfeaturecp.yaml +153 -0
- package/AISB/catalog/aisb.t3.095_fastfeaturecp.zh.yaml +118 -0
- package/AISB/catalog/aisb.t3.096_m3svm.yaml +189 -0
- package/AISB/catalog/aisb.t3.096_m3svm.zh.yaml +149 -0
- package/AISB/catalog/aisb.t3.097_wassersteintl.yaml +212 -0
- package/AISB/catalog/aisb.t3.097_wassersteintl.zh.yaml +169 -0
- package/AISB/catalog/aisb.t3.098_xmahalanobis.yaml +171 -0
- package/AISB/catalog/aisb.t3.098_xmahalanobis.zh.yaml +127 -0
- package/AISB/catalog/aisb.t3.099_ollalanding.yaml +248 -0
- package/AISB/catalog/aisb.t3.099_ollalanding.zh.yaml +182 -0
- package/AISB/catalog/aisb.t3.100_invmissingdata.yaml +179 -0
- package/AISB/catalog/aisb.t3.100_invmissingdata.zh.yaml +150 -0
- package/AISB/catalog/aisb.t3.101_acia.yaml +164 -0
- package/AISB/catalog/aisb.t3.101_acia.zh.yaml +109 -0
- package/AISB/catalog/aisb.t3.102_stochasticff.yaml +178 -0
- package/AISB/catalog/aisb.t3.102_stochasticff.zh.yaml +130 -0
- package/AISB/catalog/aisb.t3.103_qdcp.yaml +150 -0
- package/AISB/catalog/aisb.t3.103_qdcp.zh.yaml +116 -0
- package/AISB/catalog/aisb.t3.104_balancedactiveinf.yaml +137 -0
- package/AISB/catalog/aisb.t3.104_balancedactiveinf.zh.yaml +104 -0
- package/AISB/catalog/aisb.t3.105_binaryclasseval.yaml +161 -0
- package/AISB/catalog/aisb.t3.105_binaryclasseval.zh.yaml +130 -0
- package/AISB/image/001_aisb.t3.001_savvy.jpg +0 -0
- package/AISB/image/002_aisb.t3.002_pinet.jpg +0 -0
- package/AISB/image/003_aisb.t3.003_dmsqd.jpg +0 -0
- package/AISB/image/004_aisb.t3.004_decentralattn.jpg +0 -0
- package/AISB/image/005_aisb.t3.005_tsae.jpg +0 -0
- package/AISB/image/006_aisb.t3.006_physense.jpg +0 -0
- package/AISB/image/007_aisb.t3.007_reasoningiqa.jpg +0 -0
- package/AISB/image/008_aisb.t3.008_meanflows.jpg +0 -0
- package/AISB/image/009_aisb.t3.009_scoremissing.jpg +0 -0
- package/AISB/image/010_aisb.t3.010_suitabilityfilter.jpg +0 -0
- package/AISB/image/011_aisb.t3.011_osd.jpg +0 -0
- package/AISB/image/012_aisb.t3.012_efficientqat.jpg +0 -0
- package/AISB/image/013_aisb.t3.013_appl.jpg +0 -0
- package/AISB/image/014_aisb.t3.014_piguard.jpg +0 -0
- package/AISB/image/015_aisb.t3.015_frspec.jpg +0 -0
- package/AISB/image/016_aisb.t3.016_mathfusion.jpg +0 -0
- package/AISB/image/017_aisb.t3.017_multimodalglp.jpg +0 -0
- package/AISB/image/018_aisb.t3.018_cotsynth.jpg +0 -0
- package/AISB/image/019_aisb.t3.019_dyscaleut.jpg +0 -0
- package/AISB/image/020_aisb.t3.020_aristotle.jpg +0 -0
- package/AISB/image/021_aisb.t3.021_tokenrecycling.jpg +0 -0
- package/AISB/image/022_aisb.t3.022_chainofreasoning.jpg +0 -0
- package/AISB/image/023_aisb.t3.023_guidedembed.jpg +0 -0
- package/AISB/image/024_aisb.t3.024_outputcentric.jpg +0 -0
- package/AISB/image/025_aisb.t3.025_deeper.jpg +0 -0
- package/AISB/image/026_aisb.t3.026_gartkg.jpg +0 -0
- package/AISB/image/027_aisb.t3.027_citeeval.jpg +0 -0
- package/AISB/image/028_aisb.t3.028_sbam.jpg +0 -0
- package/AISB/image/029_aisb.t3.029_cdqgeoembed.jpg +0 -0
- package/AISB/image/030_aisb.t3.030_processrm.jpg +0 -0
- package/AISB/image/031_aisb.t3.031_circuitstability.jpg +0 -0
- package/AISB/image/032_aisb.t3.032_ptsolver.jpg +0 -0
- package/AISB/image/033_aisb.t3.033_gcse.jpg +0 -0
- package/AISB/image/034_aisb.t3.034_ensemblewm.jpg +0 -0
- package/AISB/image/035_aisb.t3.035_moralvalueswa.jpg +0 -0
- package/AISB/image/036_aisb.t3.036_weakstrongpref.jpg +0 -0
- package/AISB/image/037_aisb.t3.037_dementiamask.jpg +0 -0
- package/AISB/image/038_aisb.t3.038_tinysam.jpg +0 -0
- package/AISB/image/039_aisb.t3.039_calf.jpg +0 -0
- package/AISB/image/040_aisb.t3.040_graniteguardian.jpg +0 -0
- package/AISB/image/041_aisb.t3.041_amdm.jpg +0 -0
- package/AISB/image/042_aisb.t3.042_xpatch.jpg +0 -0
- package/AISB/image/043_aisb.t3.043_vhm.jpg +0 -0
- package/AISB/image/044_aisb.t3.044_rgvi.jpg +0 -0
- package/AISB/image/045_aisb.t3.045_pslstm.jpg +0 -0
- package/AISB/image/046_aisb.t3.046_nonstatts.jpg +0 -0
- package/AISB/image/047_aisb.t3.047_timepfn.jpg +0 -0
- package/AISB/image/048_aisb.t3.048_proxyspex.jpg +0 -0
- package/AISB/image/049_aisb.t3.049_hogwildinference.jpg +0 -0
- package/AISB/image/050_aisb.t3.050_causalpfn.jpg +0 -0
- package/AISB/image/051_aisb.t3.051_flashtp.jpg +0 -0
- package/AISB/image/052_aisb.t3.052_nsdiff.jpg +0 -0
- package/AISB/image/053_aisb.t3.053_k2vae.jpg +0 -0
- package/AISB/image/054_aisb.t3.054_timebase.jpg +0 -0
- package/AISB/image/055_aisb.t3.055_csbrain.jpg +0 -0
- package/AISB/image/056_aisb.t3.056_infosam.jpg +0 -0
- package/AISB/image/057_aisb.t3.057_mdreid.jpg +0 -0
- package/AISB/image/058_aisb.t3.058_mindglitch.jpg +0 -0
- package/AISB/image/059_aisb.t3.059_selfsupervised.jpg +0 -0
- package/AISB/image/060_aisb.t3.060_iaggad.jpg +0 -0
- package/AISB/image/061_aisb.t3.061_hsgkn.jpg +0 -0
- package/AISB/image/062_aisb.t3.062_visionts.jpg +0 -0
- package/AISB/image/063_aisb.t3.063_tsrag.jpg +0 -0
- package/AISB/image/064_aisb.t3.064_pir.jpg +0 -0
- package/AISB/image/065_aisb.t3.065_proteinbinding.jpg +0 -0
- package/AISB/image/066_aisb.t3.066_tropicalattention.jpg +0 -0
- package/AISB/image/067_aisb.t3.067_kanad.jpg +0 -0
- package/AISB/image/068_aisb.t3.068_sempo.jpg +0 -0
- package/AISB/image/069_aisb.t3.069_treehfd.jpg +0 -0
- package/AISB/image/070_aisb.t3.070_certifiedunlearning.jpg +0 -0
- package/AISB/image/071_aisb.t3.071_neuralmjd.jpg +0 -0
- package/AISB/image/072_aisb.t3.072_fedgmt.jpg +0 -0
- package/AISB/image/073_aisb.t3.073_rld.jpg +0 -0
- package/AISB/image/074_aisb.t3.074_lsvi.jpg +0 -0
- package/AISB/image/075_aisb.t3.075_treeslicedentropy.jpg +0 -0
- package/AISB/image/076_aisb.t3.076_aanet.jpg +0 -0
- package/AISB/image/077_aisb.t3.077_cmnn.jpg +0 -0
- package/AISB/image/078_aisb.t3.078_conformalanomaly.jpg +0 -0
- package/AISB/image/079_aisb.t3.079_dpfkmeans.jpg +0 -0
- package/AISB/image/080_aisb.t3.080_latentscorereweight.jpg +0 -0
- package/AISB/image/081_aisb.t3.081_qmamba.jpg +0 -0
- package/AISB/image/082_aisb.t3.082_onlinellmrouting.jpg +0 -0
- package/AISB/image/083_aisb.t3.083_starformer.jpg +0 -0
- package/AISB/image/084_aisb.t3.084_ift.jpg +0 -0
- package/AISB/image/085_aisb.t3.085_neuralsurv.jpg +0 -0
- package/AISB/image/086_aisb.t3.086_stella.jpg +0 -0
- package/AISB/image/087_aisb.t3.087_moses.jpg +0 -0
- package/AISB/image/088_aisb.t3.088_channelnorm.jpg +0 -0
- package/AISB/image/089_aisb.t3.089_causalvelocity.jpg +0 -0
- package/AISB/image/090_aisb.t3.090_rstib.jpg +0 -0
- package/AISB/image/091_aisb.t3.091_timeawarecausal.jpg +0 -0
- package/AISB/image/092_aisb.t3.092_kmeanslocalopt.jpg +0 -0
- package/AISB/image/093_aisb.t3.093_fedwmsam.jpg +0 -0
- package/AISB/image/094_aisb.t3.094_boundre.jpg +0 -0
- package/AISB/image/095_aisb.t3.095_fastfeaturecp.jpg +0 -0
- package/AISB/image/096_aisb.t3.096_m3svm.jpg +0 -0
- package/AISB/image/097_aisb.t3.097_wassersteintl.jpg +0 -0
- package/AISB/image/098_aisb.t3.098_xmahalanobis.jpg +0 -0
- package/AISB/image/099_aisb.t3.099_ollalanding.jpg +0 -0
- package/AISB/image/100_aisb.t3.100_invmissingdata.jpg +0 -0
- package/AISB/image/101_aisb.t3.101_acia.jpg +0 -0
- package/AISB/image/102_aisb.t3.102_stochasticff.jpg +0 -0
- package/AISB/image/103_aisb.t3.103_qdcp.jpg +0 -0
- package/AISB/image/104_aisb.t3.104_balancedactiveinf.jpg +0 -0
- package/AISB/image/105_aisb.t3.105_binaryclasseval.jpg +0 -0
- package/AISB/image/106_aisb.t1.reasoning_lite.jpg +0 -0
- package/AISB/image/107_aisb.t2.paper_audit.jpg +0 -0
- package/AISB/image/108_aisb.t3.multi_gpu_search.jpg +0 -0
- package/AISB/image/109_aisb.t3.tdc_admet.jpg +0 -0
- package/AISB/image/aisb.b1.agentic_coding.svg +16 -0
- package/AISB/image/aisb.b10.climate_earth.svg +16 -0
- package/AISB/image/aisb.b11.model_efficiency.svg +16 -0
- package/AISB/image/aisb.b12.embodied_ai.svg +16 -0
- package/AISB/image/aisb.b2.agent_systems.svg +16 -0
- package/AISB/image/aisb.b3.self_evolving_rl.svg +16 -0
- package/AISB/image/aisb.b4.lm_reasoning.svg +16 -0
- package/AISB/image/aisb.b5.math_proof.svg +16 -0
- package/AISB/image/aisb.b6.research_process.svg +16 -0
- package/AISB/image/aisb.b7.multimodal_fusion.svg +16 -0
- package/AISB/image/aisb.b8.lifesci_drug.svg +16 -0
- package/AISB/image/aisb.b9.material_science.svg +16 -0
- package/README.md +132 -11
- package/bin/ds.js +376 -49
- package/docs/en/00_QUICK_START.md +135 -18
- package/docs/en/01_SETTINGS_REFERENCE.md +468 -96
- package/docs/en/02_START_RESEARCH_GUIDE.md +26 -5
- package/docs/en/03_QQ_CONNECTOR_GUIDE.md +14 -3
- package/docs/en/04_LINGZHU_CONNECTOR_GUIDE.md +2 -0
- package/docs/en/05_TUI_GUIDE.md +171 -2
- package/docs/en/07_MEMORY_AND_MCP.md +38 -2
- package/docs/en/09_DOCTOR.md +64 -4
- package/docs/en/10_WEIXIN_CONNECTOR_GUIDE.md +38 -1
- package/docs/en/11_LICENSE_AND_RISK.md +4 -0
- package/docs/en/12_GUIDED_WORKFLOW_TOUR.md +15 -0
- package/docs/en/14_PROMPT_SKILLS_AND_MCP_GUIDE.md +9 -0
- package/docs/en/15_CODEX_PROVIDER_SETUP.md +622 -187
- package/docs/en/16_TELEGRAM_CONNECTOR_GUIDE.md +14 -0
- package/docs/en/17_WHATSAPP_CONNECTOR_GUIDE.md +14 -0
- package/docs/en/18_FEISHU_CONNECTOR_GUIDE.md +14 -0
- package/docs/en/21_LOCAL_MODEL_BACKENDS_GUIDE.md +105 -2
- package/docs/en/22_BENCHSTORE_YAML_REFERENCE.md +469 -0
- package/docs/en/23_BENCHSTORE_GITHUB_RELEASES_SPEC.md +316 -0
- package/docs/en/24_CLAUDE_CODE_PROVIDER_SETUP.md +469 -0
- package/docs/en/25_OPENCODE_PROVIDER_SETUP.md +653 -0
- package/docs/en/26_CITATION_AND_ATTRIBUTION.md +119 -0
- package/docs/en/27_KIMI_CODE_PROVIDER_SETUP.md +180 -0
- package/docs/en/28_DISCORD_CONNECTOR_GUIDE.md +61 -0
- package/docs/en/29_SLACK_CONNECTOR_GUIDE.md +60 -0
- package/docs/en/30_SETTINGS_CONTROL_CENTER_GUIDE.md +371 -0
- package/docs/en/{19_LOCAL_BROWSER_AUTH.md → 31_LOCAL_BROWSER_AUTH.md} +1 -1
- package/docs/en/32_WINDOWS_WSL2_DEPLOYMENT_GUIDE.md +273 -0
- package/docs/en/33_WORKSPACE_EXPLORER_QA.md +121 -0
- package/docs/en/91_DEVELOPMENT.md +29 -0
- package/docs/en/99_ACKNOWLEDGEMENTS.md +24 -19
- package/docs/en/README.md +44 -7
- package/docs/images/admin/admin-connectors-health-en.png +0 -0
- package/docs/images/admin/admin-controllers-en.png +0 -0
- package/docs/images/admin/admin-diagnostics-en.png +0 -0
- package/docs/images/admin/admin-errors-en.png +0 -0
- package/docs/images/admin/admin-issues-en.png +0 -0
- package/docs/images/admin/admin-logs-en.png +0 -0
- package/docs/images/admin/admin-quest-detail-en.png +0 -0
- package/docs/images/admin/admin-quests-en.png +0 -0
- package/docs/images/admin/admin-repairs-en.png +0 -0
- package/docs/images/admin/admin-runtime-en.png +0 -0
- package/docs/images/admin/admin-search-en.png +0 -0
- package/docs/images/admin/admin-stats-en.png +0 -0
- package/docs/images/admin/admin-summary-en.png +0 -0
- package/docs/images/connectors/connector-discord-en.png +0 -0
- package/docs/images/connectors/connector-feishu-en.png +0 -0
- package/docs/images/connectors/connector-lingzhu-en.png +0 -0
- package/docs/images/connectors/connector-qq-en.png +0 -0
- package/docs/images/connectors/connector-slack-en.png +0 -0
- package/docs/images/connectors/connector-telegram-en.png +0 -0
- package/docs/images/connectors/connector-weixin-en.png +0 -0
- package/docs/images/connectors/connector-whatsapp-en.png +0 -0
- package/docs/images/settings/settings-baselines-en.png +0 -0
- package/docs/images/settings/settings-config-en.png +0 -0
- package/docs/images/settings/settings-connectors-overview-en.png +0 -0
- package/docs/images/settings/settings-deepxiv-en.png +0 -0
- package/docs/images/settings/settings-mcp-servers-en.png +0 -0
- package/docs/images/settings/settings-plugins-en.png +0 -0
- package/docs/images/settings/settings-runners-en.png +0 -0
- package/docs/zh/00_QUICK_START.md +92 -17
- package/docs/zh/01_SETTINGS_REFERENCE.md +219 -98
- package/docs/zh/02_START_RESEARCH_GUIDE.md +26 -5
- package/docs/zh/05_TUI_GUIDE.md +171 -2
- package/docs/zh/07_MEMORY_AND_MCP.md +29 -2
- package/docs/zh/09_DOCTOR.md +39 -4
- package/docs/zh/10_WEIXIN_CONNECTOR_GUIDE.md +24 -1
- package/docs/zh/11_LICENSE_AND_RISK.md +4 -0
- package/docs/zh/12_GUIDED_WORKFLOW_TOUR.md +15 -0
- package/docs/zh/14_PROMPT_SKILLS_AND_MCP_GUIDE.md +9 -0
- package/docs/zh/15_CODEX_PROVIDER_SETUP.md +550 -188
- package/docs/zh/21_LOCAL_MODEL_BACKENDS_GUIDE.md +105 -2
- package/docs/zh/22_BENCHSTORE_YAML_REFERENCE.md +459 -0
- package/docs/zh/23_BENCHSTORE_GITHUB_RELEASES_SPEC.md +287 -0
- package/docs/zh/23_CLAUDE_RUNNER_GUIDE.md +103 -0
- package/docs/zh/24_CLAUDE_CODE_PROVIDER_SETUP.md +460 -0
- package/docs/zh/25_OPENCODE_PROVIDER_SETUP.md +660 -0
- package/docs/zh/26_CITATION_AND_ATTRIBUTION.md +102 -0
- package/docs/zh/27_KIMI_CODE_PROVIDER_SETUP.md +51 -0
- package/docs/zh/{19_LOCAL_BROWSER_AUTH.md → 31_LOCAL_BROWSER_AUTH.md} +1 -1
- package/docs/zh/32_WINDOWS_WSL2_DEPLOYMENT_GUIDE.md +264 -0
- package/docs/zh/33_WORKSPACE_EXPLORER_QA.md +127 -0
- package/docs/zh/99_ACKNOWLEDGEMENTS.md +23 -19
- package/docs/zh/README.md +29 -7
- package/install.sh +122 -16
- package/package.json +4 -1
- package/pyproject.toml +2 -1
- package/src/deepscientist/__init__.py +1 -1
- package/src/deepscientist/acp/envelope.py +13 -0
- package/src/deepscientist/admin/__init__.py +3 -0
- package/src/deepscientist/admin/charts.py +681 -0
- package/src/deepscientist/admin/logs.py +119 -0
- package/src/deepscientist/admin/repairs.py +217 -0
- package/src/deepscientist/admin/service.py +1310 -0
- package/src/deepscientist/admin/system_info.py +700 -0
- package/src/deepscientist/admin/tasks.py +465 -0
- package/src/deepscientist/admin/tool_metrics.py +600 -0
- package/src/deepscientist/artifact/guidance.py +8 -4
- package/src/deepscientist/artifact/schemas.py +115 -0
- package/src/deepscientist/artifact/service.py +4268 -260
- package/src/deepscientist/bash_exec/monitor.py +30 -3
- package/src/deepscientist/bash_exec/service.py +134 -1
- package/src/deepscientist/benchstore/__init__.py +4 -0
- package/src/deepscientist/benchstore/prompt_builder.py +224 -0
- package/src/deepscientist/benchstore/service.py +1716 -0
- package/src/deepscientist/channels/weixin_ilink.py +8 -1
- package/src/deepscientist/cli.py +92 -17
- package/src/deepscientist/codex_cli_compat.py +2 -2
- package/src/deepscientist/config/models.py +82 -11
- package/src/deepscientist/config/service.py +927 -91
- package/src/deepscientist/connector/weixin_support.py +48 -17
- package/src/deepscientist/daemon/api/handlers.py +697 -210
- package/src/deepscientist/daemon/api/router.py +76 -1
- package/src/deepscientist/daemon/app.py +1054 -51
- package/src/deepscientist/diagnostics/runner_failures.py +147 -0
- package/src/deepscientist/doctor.py +212 -65
- package/src/deepscientist/evidence_packets.py +590 -0
- package/src/deepscientist/home.py +52 -4
- package/src/deepscientist/kimi_cli_compat.py +50 -0
- package/src/deepscientist/latex_runtime.py +2 -2
- package/src/deepscientist/mcp/context.py +2 -0
- package/src/deepscientist/mcp/schemas.py +114 -0
- package/src/deepscientist/mcp/server.py +1566 -126
- package/src/deepscientist/memory/service.py +203 -16
- package/src/deepscientist/process_control.py +8 -1
- package/src/deepscientist/prompts/builder.py +836 -92
- package/src/deepscientist/quest/__init__.py +2 -2
- package/src/deepscientist/quest/layout.py +12 -1
- package/src/deepscientist/quest/node_traces.py +10 -0
- package/src/deepscientist/quest/service.py +1430 -139
- package/src/deepscientist/quest/stage_views.py +1 -1
- package/src/deepscientist/runners/__init__.py +18 -0
- package/src/deepscientist/runners/base.py +89 -1
- package/src/deepscientist/runners/builtins.py +13 -1
- package/src/deepscientist/runners/claude.py +391 -0
- package/src/deepscientist/runners/codex.py +421 -21
- package/src/deepscientist/runners/codex_telemetry.py +127 -0
- package/src/deepscientist/runners/kimi.py +334 -0
- package/src/deepscientist/runners/metadata.py +68 -0
- package/src/deepscientist/runners/opencode.py +414 -0
- package/src/deepscientist/runners/runtime_overrides.py +100 -0
- package/src/deepscientist/runners/simple_cli.py +538 -0
- package/src/deepscientist/runtime_storage.py +303 -0
- package/src/deepscientist/shared.py +61 -16
- package/src/deepscientist/skills/installer.py +37 -0
- package/src/deepscientist/skills/registry.py +2 -0
- package/src/deepscientist/tinytex.py +2 -2
- package/src/deepscientist/tui.py +10 -3
- package/src/prompts/benchstore/system.md +77 -0
- package/src/prompts/connectors/qq.md +33 -2
- package/src/prompts/connectors/weixin.md +208 -23
- package/src/prompts/contracts/admin_ops.md +74 -0
- package/src/prompts/contracts/admin_ops_knowledge.md +138 -0
- package/src/prompts/contracts/shared_interaction.md +5 -11
- package/src/prompts/start_setup/system.md +422 -0
- package/src/prompts/system.md +409 -315
- package/src/prompts/system_copilot.md +88 -12
- package/src/skills/analysis-campaign/SKILL.md +239 -578
- package/src/skills/analysis-campaign/references/artifact-flow-examples.md +102 -0
- package/src/skills/analysis-campaign/references/boundary-cases.md +98 -0
- package/src/skills/analysis-campaign/references/campaign-checklist-template.md +39 -24
- package/src/skills/analysis-campaign/references/campaign-design.md +26 -10
- package/src/skills/analysis-campaign/references/campaign-plan-template.md +53 -54
- package/src/skills/analysis-campaign/references/operational-guidance.md +97 -0
- package/src/skills/analysis-campaign/references/writing-facing-slice-examples.md +10 -20
- package/src/skills/baseline/SKILL.md +183 -461
- package/src/skills/baseline/references/artifact-flow-examples.md +106 -0
- package/src/skills/baseline/references/artifact-payload-examples.md +1 -1
- package/src/skills/baseline/references/baseline-checklist-template.md +27 -35
- package/src/skills/baseline/references/baseline-plan-template.md +37 -76
- package/src/skills/baseline/references/boundary-cases.md +86 -0
- package/src/skills/baseline/references/codebase-audit-checklist.md +2 -6
- package/src/skills/baseline/references/comparability-contract.md +7 -12
- package/src/skills/baseline/references/operational-guidance.md +56 -0
- package/src/skills/baseline/references/route-selection.md +5 -25
- package/src/skills/decision/SKILL.md +113 -306
- package/src/skills/decision/references/checkpoint-memory-template.md +47 -0
- package/src/skills/decision/references/operational-guidance.md +94 -0
- package/src/skills/decision/references/research-route-criteria.md +7 -8
- package/src/skills/decision/references/strategic-decision-template.md +13 -26
- package/src/skills/experiment/SKILL.md +132 -670
- package/src/skills/experiment/references/execution-playbook.md +374 -0
- package/src/skills/experiment/references/main-experiment-checklist-template.md +26 -2
- package/src/skills/experiment/references/main-experiment-plan-template.md +28 -17
- package/src/skills/experiment/references/operational-guidance.md +108 -0
- package/src/skills/finalize/SKILL.md +62 -0
- package/src/skills/finalize/references/checkpoint-memory-template.md +49 -0
- package/src/skills/finalize/references/resume-packet-template.md +7 -0
- package/src/skills/idea/SKILL.md +228 -15
- package/src/skills/idea/references/controlled-brainstorming-playbook.md +78 -0
- package/src/skills/idea/references/current-board-packet-template.md +61 -0
- package/src/skills/idea/references/high-value-idea-sourcing.md +119 -0
- package/src/skills/idea/references/idea-generation-playbook.md +21 -0
- package/src/skills/idea/references/idea-thinking-flow.md +6 -0
- package/src/skills/idea/references/literature-survey-template.md +3 -0
- package/src/skills/idea/references/objective-contract-template.md +54 -0
- package/src/skills/idea/references/outline-seeding-example.md +56 -0
- package/src/skills/idea/references/pre-idea-draft-template.md +105 -0
- package/src/skills/idea/references/related-work-playbook.md +75 -2
- package/src/skills/idea/references/research-history-playbook.md +114 -0
- package/src/skills/idea/references/selection-gate.md +58 -6
- package/src/skills/intake-audit/SKILL.md +43 -2
- package/src/skills/intake-audit/references/state-audit-template.md +10 -0
- package/src/skills/nature-data/SKILL.md +128 -0
- package/src/skills/nature-data/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-data/agents/openai.yaml +4 -0
- package/src/skills/nature-data/references/chinese-author-alignment.md +84 -0
- package/src/skills/nature-data/references/fair-metadata-checklist.md +105 -0
- package/src/skills/nature-data/references/policy-principles.md +103 -0
- package/src/skills/nature-data/references/repository-and-identifiers.md +96 -0
- package/src/skills/nature-data/references/source-basis.md +54 -0
- package/src/skills/nature-data/references/statement-patterns.md +153 -0
- package/src/skills/nature-figure/SKILL.md +197 -0
- package/src/skills/nature-figure/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-figure/agents/openai.yaml +4 -0
- package/src/skills/nature-figure/evals/evals.json +37 -0
- package/src/skills/nature-figure/references/api.md +428 -0
- package/src/skills/nature-figure/references/backend-selection.md +100 -0
- package/src/skills/nature-figure/references/chart-types.md +281 -0
- package/src/skills/nature-figure/references/common-patterns.md +349 -0
- package/src/skills/nature-figure/references/design-theory.md +436 -0
- package/src/skills/nature-figure/references/figure-contract.md +93 -0
- package/src/skills/nature-figure/references/nature-2026-observations.md +112 -0
- package/src/skills/nature-figure/references/qa-contract.md +119 -0
- package/src/skills/nature-figure/references/r-template-index.md +66 -0
- package/src/skills/nature-figure/references/r-workflow.md +161 -0
- package/src/skills/nature-figure/references/tutorials.md +250 -0
- package/src/skills/nature-paper2ppt/SKILL.md +507 -0
- package/src/skills/nature-paper2ppt/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-paper2ppt/agents/openai.yaml +4 -0
- package/src/skills/nature-polishing/SKILL.md +385 -0
- package/src/skills/nature-polishing/UPSTREAM_LICENSE.txt +21 -0
- package/src/skills/nature-polishing/agents/openai.yaml +4 -0
- package/src/skills/nature-polishing/references/phrasebank-playbook.md +162 -0
- package/src/skills/nature-polishing/references/section-moves.md +240 -0
- package/src/skills/nature-polishing/references/style-guardrails.md +94 -0
- package/src/skills/nature-polishing/references/writing-strategy.md +148 -0
- package/src/skills/optimize/SKILL.md +177 -1568
- package/src/skills/optimize/references/brief-shaping-playbook.md +95 -0
- package/src/skills/optimize/references/candidate-board-template.md +13 -0
- package/src/skills/optimize/references/candidate-ranking-template.md +51 -0
- package/src/skills/optimize/references/codegen-route-playbook.md +50 -0
- package/src/skills/optimize/references/debug-response-template.md +29 -0
- package/src/skills/optimize/references/frontier-review-template.md +32 -0
- package/src/skills/optimize/references/fusion-playbook.md +36 -0
- package/src/skills/optimize/references/method-brief-template.md +73 -0
- package/src/skills/optimize/references/operational-guidance.md +621 -0
- package/src/skills/optimize/references/optimization-memory-template.md +30 -0
- package/src/skills/optimize/references/optimize-checklist-template.md +18 -0
- package/src/skills/optimize/references/plateau-response-playbook.md +28 -0
- package/src/skills/optimize/references/prompt-patterns.md +49 -0
- package/src/skills/paper-outline/SKILL.md +227 -0
- package/src/skills/paper-outline/references/outline-patterns.md +87 -0
- package/src/skills/paper-plot/SKILL.md +79 -0
- package/src/skills/paper-plot/agents/openai.yaml +4 -0
- package/src/skills/paper-plot/references/bar_grouped_hatch.md +96 -0
- package/src/skills/paper-plot/references/bar_paired_delta.md +72 -0
- package/src/skills/paper-plot/references/line_confidence_band.md +75 -0
- package/src/skills/paper-plot/references/line_loss_with_inset.md +65 -0
- package/src/skills/paper-plot/references/line_training_curve.md +44 -0
- package/src/skills/paper-plot/references/radar_dual_series.md +59 -0
- package/src/skills/paper-plot/references/scatter_broken_axis.md +59 -0
- package/src/skills/paper-plot/references/scatter_tsne_cluster.md +72 -0
- package/src/skills/paper-plot/scripts/bar_memevolve.py +109 -0
- package/src/skills/paper-plot/scripts/bar_spice.py +166 -0
- package/src/skills/paper-plot/scripts/line_aime.py +94 -0
- package/src/skills/paper-plot/scripts/line_loss_inset.py +157 -0
- package/src/skills/paper-plot/scripts/line_selfdistill.py +168 -0
- package/src/skills/paper-plot/scripts/radar_dora.py +151 -0
- package/src/skills/paper-plot/scripts/scatter_break.py +169 -0
- package/src/skills/paper-plot/scripts/scatter_tsne.py +133 -0
- package/src/skills/rebuttal/SKILL.md +9 -0
- package/src/skills/references/tool-usage-by-stage.md +438 -0
- package/src/skills/review/SKILL.md +105 -7
- package/src/skills/science/PROVENANCE.md +44 -0
- package/src/skills/science/SKILL.md +137 -0
- package/src/skills/science/references/artifact-science-tool.md +110 -0
- package/src/skills/science/references/claim-type-discipline.md +56 -0
- package/src/skills/science/references/domain-index.md +422 -0
- package/src/skills/science/references/hpc-via-bash-exec.md +42 -0
- package/src/skills/science/references/package-check-playbook.md +64 -0
- package/src/skills/science/references/package-index.min.json +3616 -0
- package/src/skills/science/references/packages/abinit.md +80 -0
- package/src/skills/science/references/packages/acts.md +73 -0
- package/src/skills/science/references/packages/aiida-core.md +80 -0
- package/src/skills/science/references/packages/alamode.md +80 -0
- package/src/skills/science/references/packages/amuse.md +88 -0
- package/src/skills/science/references/packages/anndata.md +88 -0
- package/src/skills/science/references/packages/arbor.md +80 -0
- package/src/skills/science/references/packages/arc.md +73 -0
- package/src/skills/science/references/packages/astropy.md +88 -0
- package/src/skills/science/references/packages/astroquery.md +88 -0
- package/src/skills/science/references/packages/atomate2.md +80 -0
- package/src/skills/science/references/packages/atomsmltr.md +73 -0
- package/src/skills/science/references/packages/awkward.md +73 -0
- package/src/skills/science/references/packages/batman.md +88 -0
- package/src/skills/science/references/packages/biopython.md +88 -0
- package/src/skills/science/references/packages/bloqade.md +73 -0
- package/src/skills/science/references/packages/brian2.md +73 -0
- package/src/skills/science/references/packages/bullet3.md +73 -0
- package/src/skills/science/references/packages/calculix.md +80 -0
- package/src/skills/science/references/packages/cantera.md +73 -0
- package/src/skills/science/references/packages/cavity-md-ipi.md +80 -0
- package/src/skills/science/references/packages/ccdproc.md +88 -0
- package/src/skills/science/references/packages/celerite2.md +88 -0
- package/src/skills/science/references/packages/cellrank.md +73 -0
- package/src/skills/science/references/packages/cesm.md +80 -0
- package/src/skills/science/references/packages/chemicals.md +73 -0
- package/src/skills/science/references/packages/chempy.md +73 -0
- package/src/skills/science/references/packages/cirq.md +73 -0
- package/src/skills/science/references/packages/coffea.md +73 -0
- package/src/skills/science/references/packages/cp2k.md +88 -0
- package/src/skills/science/references/packages/custodian.md +80 -0
- package/src/skills/science/references/packages/dart.md +73 -0
- package/src/skills/science/references/packages/datamol.md +88 -0
- package/src/skills/science/references/packages/dd4hep.md +73 -0
- package/src/skills/science/references/packages/dealii.md +80 -0
- package/src/skills/science/references/packages/deepchem.md +88 -0
- package/src/skills/science/references/packages/delphes.md +73 -0
- package/src/skills/science/references/packages/devito.md +80 -0
- package/src/skills/science/references/packages/dftb.md +88 -0
- package/src/skills/science/references/packages/dftd4.md +88 -0
- package/src/skills/science/references/packages/dftk-jl.md +80 -0
- package/src/skills/science/references/packages/dolfinx.md +80 -0
- package/src/skills/science/references/packages/drake.md +73 -0
- package/src/skills/science/references/packages/dumux.md +73 -0
- package/src/skills/science/references/packages/elk.md +80 -0
- package/src/skills/science/references/packages/elmerfem.md +80 -0
- package/src/skills/science/references/packages/enzo-e.md +88 -0
- package/src/skills/science/references/packages/espresso.md +80 -0
- package/src/skills/science/references/packages/exoplanet.md +88 -0
- package/src/skills/science/references/packages/fairroot.md +73 -0
- package/src/skills/science/references/packages/fbpic.md +80 -0
- package/src/skills/science/references/packages/fdtdbath-meep.md +80 -0
- package/src/skills/science/references/packages/geant4.md +73 -0
- package/src/skills/science/references/packages/geosx.md +80 -0
- package/src/skills/science/references/packages/gprmax.md +80 -0
- package/src/skills/science/references/packages/gromacs.md +80 -0
- package/src/skills/science/references/packages/gwaslab.md +73 -0
- package/src/skills/science/references/packages/gz-sim.md +73 -0
- package/src/skills/science/references/packages/hail.md +88 -0
- package/src/skills/science/references/packages/hiphive.md +80 -0
- package/src/skills/science/references/packages/hoomd-blue.md +80 -0
- package/src/skills/science/references/packages/itensor.md +73 -0
- package/src/skills/science/references/packages/itensors-jl.md +73 -0
- package/src/skills/science/references/packages/jdftx.md +73 -0
- package/src/skills/science/references/packages/jobflow.md +80 -0
- package/src/skills/science/references/packages/kadanoffbaym-jl.md +73 -0
- package/src/skills/science/references/packages/kite.md +80 -0
- package/src/skills/science/references/packages/kratos.md +80 -0
- package/src/skills/science/references/packages/kwant.md +73 -0
- package/src/skills/science/references/packages/lammps.md +80 -0
- package/src/skills/science/references/packages/lightkurve.md +88 -0
- package/src/skills/science/references/packages/limix.md +73 -0
- package/src/skills/science/references/packages/maxwelllink.md +80 -0
- package/src/skills/science/references/packages/mcdc.md +73 -0
- package/src/skills/science/references/packages/meep.md +80 -0
- package/src/skills/science/references/packages/mfem.md +80 -0
- package/src/skills/science/references/packages/mitgcm.md +73 -0
- package/src/skills/science/references/packages/modflow6.md +73 -0
- package/src/skills/science/references/packages/molecool.md +73 -0
- package/src/skills/science/references/packages/mom6.md +73 -0
- package/src/skills/science/references/packages/moose.md +80 -0
- package/src/skills/science/references/packages/mpas-model.md +73 -0
- package/src/skills/science/references/packages/mujoco.md +73 -0
- package/src/skills/science/references/packages/mumax3.md +73 -0
- package/src/skills/science/references/packages/nekrs.md +80 -0
- package/src/skills/science/references/packages/nessi.md +73 -0
- package/src/skills/science/references/packages/nest-simulator.md +73 -0
- package/src/skills/science/references/packages/netket.md +73 -0
- package/src/skills/science/references/packages/neuron.md +73 -0
- package/src/skills/science/references/packages/nextflow.md +88 -0
- package/src/skills/science/references/packages/nwchem.md +88 -0
- package/src/skills/science/references/packages/openbabel.md +88 -0
- package/src/skills/science/references/packages/openems.md +80 -0
- package/src/skills/science/references/packages/openff-toolkit.md +88 -0
- package/src/skills/science/references/packages/openfoam-dev.md +80 -0
- package/src/skills/science/references/packages/openmc.md +73 -0
- package/src/skills/science/references/packages/openmm.md +80 -0
- package/src/skills/science/references/packages/openmoc.md +73 -0
- package/src/skills/science/references/packages/openmx.md +80 -0
- package/src/skills/science/references/packages/opensees.md +80 -0
- package/src/skills/science/references/packages/opensn.md +80 -0
- package/src/skills/science/references/packages/opm-simulators.md +73 -0
- package/src/skills/science/references/packages/oqupy.md +73 -0
- package/src/skills/science/references/packages/packmol.md +80 -0
- package/src/skills/science/references/packages/palabos.md +80 -0
- package/src/skills/science/references/packages/parflow.md +80 -0
- package/src/skills/science/references/packages/pennylane.md +88 -0
- package/src/skills/science/references/packages/perceval.md +73 -0
- package/src/skills/science/references/packages/phono3py.md +73 -0
- package/src/skills/science/references/packages/phonopy.md +73 -0
- package/src/skills/science/references/packages/photutils.md +88 -0
- package/src/skills/science/references/packages/picongpu.md +80 -0
- package/src/skills/science/references/packages/plink-ng.md +88 -0
- package/src/skills/science/references/packages/precice.md +73 -0
- package/src/skills/science/references/packages/psc.md +80 -0
- package/src/skills/science/references/packages/psi4.md +88 -0
- package/src/skills/science/references/packages/pybinding.md +73 -0
- package/src/skills/science/references/packages/pyfr.md +80 -0
- package/src/skills/science/references/packages/pyhf.md +73 -0
- package/src/skills/science/references/packages/pyiron_base.md +80 -0
- package/src/skills/science/references/packages/pylcp.md +73 -0
- package/src/skills/science/references/packages/pylith.md +80 -0
- package/src/skills/science/references/packages/pynbody.md +88 -0
- package/src/skills/science/references/packages/pysam.md +88 -0
- package/src/skills/science/references/packages/pyscf.md +88 -0
- package/src/skills/science/references/packages/q-e.md +73 -0
- package/src/skills/science/references/packages/qibo.md +73 -0
- package/src/skills/science/references/packages/qiskit.md +73 -0
- package/src/skills/science/references/packages/quantica-jl.md +73 -0
- package/src/skills/science/references/packages/quantumoptics-jl.md +73 -0
- package/src/skills/science/references/packages/quimb.md +73 -0
- package/src/skills/science/references/packages/qulacs.md +73 -0
- package/src/skills/science/references/packages/qutip.md +73 -0
- package/src/skills/science/references/packages/rdkit.md +88 -0
- package/src/skills/science/references/packages/rmg-py.md +73 -0
- package/src/skills/science/references/packages/root.md +73 -0
- package/src/skills/science/references/packages/scanpy.md +88 -0
- package/src/skills/science/references/packages/scikit-allel.md +88 -0
- package/src/skills/science/references/packages/scikit-bio.md +88 -0
- package/src/skills/science/references/packages/scqubits.md +73 -0
- package/src/skills/science/references/packages/scuff-em.md +80 -0
- package/src/skills/science/references/packages/scvi-tools.md +73 -0
- package/src/skills/science/references/packages/seissol.md +73 -0
- package/src/skills/science/references/packages/sfepy.md +80 -0
- package/src/skills/science/references/packages/sisl.md +73 -0
- package/src/skills/science/references/packages/smilei.md +80 -0
- package/src/skills/science/references/packages/snakemake.md +88 -0
- package/src/skills/science/references/packages/specfem3d-globe.md +80 -0
- package/src/skills/science/references/packages/specutils.md +88 -0
- package/src/skills/science/references/packages/spglib.md +80 -0
- package/src/skills/science/references/packages/squidpy.md +88 -0
- package/src/skills/science/references/packages/starry.md +88 -0
- package/src/skills/science/references/packages/strawberryfields.md +73 -0
- package/src/skills/science/references/packages/su2.md +80 -0
- package/src/skills/science/references/packages/sunny-jl.md +73 -0
- package/src/skills/science/references/packages/sw4.md +73 -0
- package/src/skills/science/references/packages/swift.md +88 -0
- package/src/skills/science/references/packages/tdnegf.md +73 -0
- package/src/skills/science/references/packages/tenpy.md +73 -0
- package/src/skills/science/references/packages/thermo.md +73 -0
- package/src/skills/science/references/packages/tkwant.md +73 -0
- package/src/skills/science/references/packages/tvb-root.md +73 -0
- package/src/skills/science/references/packages/uproot5.md +73 -0
- package/src/skills/science/references/packages/vampire.md +80 -0
- package/src/skills/science/references/packages/wannier_tools.md +73 -0
- package/src/skills/science/references/packages/warpx.md +80 -0
- package/src/skills/science/references/packages/wrf.md +73 -0
- package/src/skills/science/references/packages/xtb.md +88 -0
- package/src/skills/science/references/packages/yt.md +73 -0
- package/src/skills/science/references/science-task-brief-template.md +71 -0
- package/src/skills/scout/SKILL.md +83 -425
- package/src/skills/scout/references/literature-scout-template.md +5 -24
- package/src/skills/scout/references/operational-guidance.md +191 -0
- package/src/skills/scout/references/paper-triage-playbook.md +11 -35
- package/src/skills/write/SKILL.md +744 -1246
- package/src/skills/write/references/experiments_analysis_patterns.md +129 -0
- package/src/skills/write/references/oral_package_patterns.md +252 -0
- package/src/skills/write/references/oral_writing_principles.md +291 -0
- package/src/skills/write/references/section_rewrite_checklist.md +234 -0
- package/src/tui/dist/app/AppContainer.js +1314 -27
- package/src/tui/dist/components/Composer.js +26 -1
- package/src/tui/dist/components/ConfigScreen.js +2 -1
- package/src/tui/dist/components/InputPrompt.js +25 -9
- package/src/tui/dist/components/MainContent.js +18 -3
- package/src/tui/dist/components/QuestScreen.js +3 -2
- package/src/tui/dist/components/UtilityScreen.js +37 -0
- package/src/tui/dist/hooks/useSafeInput.js +10 -0
- package/src/tui/dist/index.js +13 -1
- package/src/tui/dist/layouts/DefaultAppLayout.js +11 -8
- package/src/tui/dist/lib/api.js +89 -1
- package/src/tui/package.json +1 -1
- package/src/ui/dist/assets/{AnalysisPlugin-BCKAfjba.js → AnalysisPlugin-CA94NGmI.js} +1 -1
- package/src/ui/dist/assets/CliPlugin-DHBzphZU.js +79 -0
- package/src/ui/dist/assets/CodeEditorPlugin-BOFwD2rn.js +2 -0
- package/src/ui/dist/assets/{CodeViewerPlugin-CbaFRrUU.js → CodeViewerPlugin-CqDpgjik.js} +4 -4
- package/src/ui/dist/assets/{DocViewerPlugin-DAjLVeQD.js → DocViewerPlugin-UDBgt8-4.js} +3 -3
- package/src/ui/dist/assets/GitCommitViewerPlugin-BmHtZ0bZ.js +6 -0
- package/src/ui/dist/assets/{GitDiffViewerPlugin-CQACjoAA.js → GitDiffViewerPlugin-CAxjNorQ.js} +2 -2
- package/src/ui/dist/assets/{GitSnapshotViewer-0r4nLPke.js → GitSnapshotViewer-CweA6VON.js} +2 -2
- package/src/ui/dist/assets/{ImageViewerPlugin-nBOmI2v_.js → ImageViewerPlugin-C8wHGvGN.js} +5 -5
- package/src/ui/dist/assets/LabPlugin-COyyLUol.js +32 -0
- package/src/ui/dist/assets/{LatexPlugin-ZwtV8pIp.js → LatexPlugin-BQjAaA5J.js} +4 -4
- package/src/ui/dist/assets/{MarkdownViewerPlugin-DKqVfKyW.js → MarkdownViewerPlugin-Dy1NE2dI.js} +3 -3
- package/src/ui/dist/assets/{MarketplacePlugin-BwxStZ9D.js → MarketplacePlugin-DMIZtEJ2.js} +2 -2
- package/src/ui/dist/assets/NotebookEditor-CFHMq_Qt.js +91 -0
- package/src/ui/dist/assets/{NotebookEditor-DB9N_T9q.js → NotebookEditor-WFyd8Ybt.js} +3 -3
- package/src/ui/dist/assets/{PdfLoader-eWBONbQP.js → PdfLoader-CLE5u5TS.js} +3 -3
- package/src/ui/dist/assets/{PdfMarkdownPlugin-D22YOZL3.js → PdfMarkdownPlugin-_iNK_H83.js} +1 -1
- package/src/ui/dist/assets/PdfViewerPlugin-DgWsbInT.js +22 -0
- package/src/ui/dist/assets/SearchPlugin-DrZmn5iw.js +11 -0
- package/src/ui/dist/assets/{TextViewerPlugin-C5xqeeUH.js → TextViewerPlugin-D1-T3aC7.js} +4 -4
- package/src/ui/dist/assets/branding/runner-claude.svg +107 -0
- package/src/ui/dist/assets/branding/runner-codex.svg +10 -0
- package/src/ui/dist/assets/branding/runner-kimi.svg +14 -0
- package/src/ui/dist/assets/branding/runner-opencode.svg +7 -0
- package/src/ui/dist/assets/cli-store-CoZ-x5Ip.js +1 -0
- package/src/ui/dist/assets/{code-WlFHE7z_.js → code-DbsmSd3Y.js} +1 -1
- package/src/ui/dist/assets/file-diff-panel-DsvyRz47.js +1 -0
- package/src/ui/dist/assets/{wrap-text-BC-Hltpd.js → file-jump-queue-DeQBikaw.js} +3 -3
- package/src/ui/dist/assets/{file-socket-CfQPKQKj.js → file-socket-DA5XIx88.js} +1 -1
- package/src/ui/dist/assets/fonts/ds-fonts.css +50 -4
- package/src/ui/dist/assets/images/deepxiv/register-guide.png +0 -0
- package/src/ui/dist/assets/index-39vY9LmZ.js +1 -0
- package/src/ui/dist/assets/{index-CwNu1aH4.js → index-BsO46tJA.js} +1 -1
- package/src/ui/dist/assets/index-CHzJ2xtB.js +3530 -0
- package/src/ui/dist/assets/index-DH-zxoZ3.css +33 -0
- package/src/ui/dist/assets/{plugin-notebook-HbW2K-1c.js → plugin-notebook-JRhysCqj.js} +2 -2
- package/src/ui/dist/assets/{project-sync-C9IdzdZW.js → project-sync-DPmWKmKD.js} +1 -1
- package/src/ui/dist/assets/{zoom-out-E_gaeAxL.js → zoom-out-DAukFWen.js} +3 -3
- package/src/ui/dist/index.html +3 -3
- package/src/skills/analysis-campaign/references/artifact-orchestration.md +0 -58
- package/src/skills/baseline/references/memory-playbook.md +0 -40
- package/src/skills/baseline/references/publishable-baseline-package.md +0 -30
- package/src/skills/write/references/outline-evidence-contract-example.md +0 -107
- package/src/skills/write/references/paper-experiment-matrix-template.md +0 -131
- package/src/skills/write/references/paper-section-playbook.md +0 -64
- package/src/skills/write/references/reviewer-first-writing.md +0 -64
- package/src/skills/write/references/revision-checklist.md +0 -70
- package/src/skills/write/references/section-contracts.md +0 -82
- package/src/skills/write/references/sentence-level-proofing.md +0 -49
- package/src/ui/dist/assets/AiManusChatView-Bv-Z8YpU.js +0 -204
- package/src/ui/dist/assets/CliPlugin-BCKcpc35.js +0 -109
- package/src/ui/dist/assets/CodeEditorPlugin-DbOfSJ8K.js +0 -2
- package/src/ui/dist/assets/GitCommitViewerPlugin-CIUqbUDO.js +0 -1
- package/src/ui/dist/assets/LabCopilotPanel-BHxOxF4z.js +0 -14
- package/src/ui/dist/assets/LabPlugin-BKoZGs95.js +0 -22
- package/src/ui/dist/assets/NotebookEditor-BEQhaQbt.js +0 -81
- package/src/ui/dist/assets/PdfViewerPlugin-c-RK9DLM.js +0 -17
- package/src/ui/dist/assets/SearchPlugin-CxF9ytAx.js +0 -16
- package/src/ui/dist/assets/VNCViewer-BoLGLnHz.js +0 -11
- package/src/ui/dist/assets/bot-DREQOxzP.js +0 -6
- package/src/ui/dist/assets/chevron-up-C9Qpx4DE.js +0 -6
- package/src/ui/dist/assets/file-content-BZMz3RYp.js +0 -1
- package/src/ui/dist/assets/file-diff-panel-CQhw0jS2.js +0 -1
- package/src/ui/dist/assets/file-jump-queue-DA-SdG__.js +0 -1
- package/src/ui/dist/assets/git-commit-horizontal-DxZ8DCZh.js +0 -6
- package/src/ui/dist/assets/image-Bgl4VIyx.js +0 -6
- package/src/ui/dist/assets/index-BpV6lusQ.css +0 -33
- package/src/ui/dist/assets/index-CBNVuWcP.js +0 -2496
- package/src/ui/dist/assets/index-DrUnlf6K.js +0 -1
- package/src/ui/dist/assets/index-NW-h8VzN.js +0 -1
- package/src/ui/dist/assets/pdf-effect-queue-J8OnM0jE.js +0 -6
- package/src/ui/dist/assets/popover-CLc0pPP8.js +0 -1
- package/src/ui/dist/assets/select-Cs2PmzwL.js +0 -11
- package/src/ui/dist/assets/sigma-ClKcHAXm.js +0 -6
- package/src/ui/dist/assets/trash-DwpbFr3w.js +0 -11
- package/src/ui/dist/assets/useCliAccess-NQ8m0Let.js +0 -1
- package/src/ui/dist/assets/useFileDiffOverlay-FuhcnKiw.js +0 -1
|
@@ -0,0 +1,143 @@
|
|
|
1
|
+
id: aisb.t3.025_deeper
|
|
2
|
+
name: 'DEEPER: Directed Persona Refinement for Dynamic Persona Modeling'
|
|
3
|
+
version: 0.1.0
|
|
4
|
+
one_line: Novel refinement-based dynamic persona modeling using offline reinforcement
|
|
5
|
+
learning and discrepancy-driven update direction search across 10 domains.
|
|
6
|
+
task_description: 'This packaged benchmark covers dynamic persona modeling with a
|
|
7
|
+
novel refinement-based paradigm that addresses fundamental limitations of traditional
|
|
8
|
+
regeneration (replacing personas) and extension (incrementally appending behaviors)
|
|
9
|
+
approaches. DEEPER leverages prediction-behavior discrepancies to guide update directions
|
|
10
|
+
in a structured, reward-driven way, enabling continual persona optimization over
|
|
11
|
+
time. The method employs a tri-objective reward design balancing Previous Preservation,
|
|
12
|
+
Current Reflection, and Future Advancement, trained via two-stage iterative offline
|
|
13
|
+
RL with DPO fine-tuning. Evaluated on 4,800+ users across 10 domains including 4
|
|
14
|
+
unseen domains for cross-domain generalization testing. Primary evaluation metric
|
|
15
|
+
is MAE on future behavior prediction across four refinement rounds.
|
|
16
|
+
|
|
17
|
+
'
|
|
18
|
+
task_mode: experiment_driven
|
|
19
|
+
requires_execution: true
|
|
20
|
+
requires_paper: true
|
|
21
|
+
integrity_level: cas_plus_canary
|
|
22
|
+
snapshot_status: external_eval_required
|
|
23
|
+
support_level: recovery
|
|
24
|
+
time_band: 2-4d
|
|
25
|
+
cost_band: high
|
|
26
|
+
difficulty: hard
|
|
27
|
+
data_access: public
|
|
28
|
+
primary_outputs:
|
|
29
|
+
- mae_round4
|
|
30
|
+
- persona_refinement_model
|
|
31
|
+
- evaluation_report
|
|
32
|
+
launch_profiles:
|
|
33
|
+
- id: quick_eval
|
|
34
|
+
label: Quick Eval
|
|
35
|
+
description: 'Run one packaged evaluation pass on the DEEPER persona modeling stack
|
|
36
|
+
using eval_round4.py. Evaluates MAE on Round 4 (predicting window 5 ratings using
|
|
37
|
+
persona S4 after 4 DEEPER updates). Uses meta-llama/llama-3.3-70b-instruct via
|
|
38
|
+
OpenRouter API with configurable concurrency.
|
|
39
|
+
|
|
40
|
+
'
|
|
41
|
+
- id: full_refinement
|
|
42
|
+
label: Full Refinement
|
|
43
|
+
description: 'Run the complete persona refinement workflow including two-stage offline
|
|
44
|
+
training (iteration 1 and iteration 2) via LLaMA-Factory, persona updates across
|
|
45
|
+
all rounds, and downstream evaluation with MAE reporting per domain and round.
|
|
46
|
+
|
|
47
|
+
'
|
|
48
|
+
dataset_download:
|
|
49
|
+
primary_method: huggingface
|
|
50
|
+
sources:
|
|
51
|
+
- https://huggingface.co/datasets/deeper-team/DEEPER_preprocess_data
|
|
52
|
+
- https://huggingface.co/datasets/deeper-team/DEEPER_user_context_data
|
|
53
|
+
- https://huggingface.co/datasets/deeper-team/DEEPER_train_data
|
|
54
|
+
- https://huggingface.co/deeper-team/DEEPER-llama-8B
|
|
55
|
+
notes:
|
|
56
|
+
- Four dataset components: Preprocessed Data, User Context Data, DEEPER Training
|
|
57
|
+
Data, Evolving Personas
|
|
58
|
+
- Preprocessed data spans 10 domains with chronological user rating histories (50+
|
|
59
|
+
entries per user)
|
|
60
|
+
- User context data organized by iteration (iteration_1 through iteration_4 for
|
|
61
|
+
eval, iteration_1 through iteration_2 for train)
|
|
62
|
+
- Training data constructed via self-sampling and iterative optimization over two
|
|
63
|
+
training iterations
|
|
64
|
+
credential_requirements:
|
|
65
|
+
mode: api_key
|
|
66
|
+
items:
|
|
67
|
+
- openrouter_api_key
|
|
68
|
+
notes:
|
|
69
|
+
- OpenRouter API key required for LLM-based persona generation and behavior prediction
|
|
70
|
+
during evaluation
|
|
71
|
+
- API calls use meta-llama/llama-3.3-70b-instruct model
|
|
72
|
+
resources:
|
|
73
|
+
minimum:
|
|
74
|
+
cpu_cores: 16
|
|
75
|
+
ram_gb: 64
|
|
76
|
+
disk_gb: 150
|
|
77
|
+
gpu_count: 1
|
|
78
|
+
gpu_vram_gb: 48
|
|
79
|
+
gpu_type: NVIDIA A100 or equivalent with high VRAM
|
|
80
|
+
recommended:
|
|
81
|
+
cpu_cores: 32
|
|
82
|
+
ram_gb: 128
|
|
83
|
+
disk_gb: 300
|
|
84
|
+
gpu_count: 2
|
|
85
|
+
gpu_vram_gb: 80
|
|
86
|
+
gpu_type: Multi-GPU setup for parallel training and inference
|
|
87
|
+
environment:
|
|
88
|
+
python: '3.10'
|
|
89
|
+
cuda: '11.8'
|
|
90
|
+
pytorch: 2.1.0
|
|
91
|
+
flash_attn: null
|
|
92
|
+
key_packages:
|
|
93
|
+
- transformers==4.45.0
|
|
94
|
+
- vllm==0.7.3
|
|
95
|
+
- openai>=1.0.0
|
|
96
|
+
- llama-factory @ git+https://github.com/hiyouga/LLaMA-Factory.git
|
|
97
|
+
notes:
|
|
98
|
+
- LLaMA-Factory required for training pipeline (install via git clone and pip install
|
|
99
|
+
-e ".[torch,metrics]")
|
|
100
|
+
- Conda environment strongly recommended for dependency isolation
|
|
101
|
+
- See bundled README/requirements.txt for complete dependency set
|
|
102
|
+
risk_flags:
|
|
103
|
+
- multi_gpu_training
|
|
104
|
+
- llm_api_dependency
|
|
105
|
+
- rate_limiting
|
|
106
|
+
- long_runtime
|
|
107
|
+
risk_notes:
|
|
108
|
+
- Multi-GPU training orchestration increases failure complexity; checkpointing recommended
|
|
109
|
+
- Evaluation relies on OpenRouter API calls; rate limits may cause timeouts at MAX_WORKERS=8
|
|
110
|
+
- Full refinement workflow spans multiple training iterations and 4 evaluation rounds
|
|
111
|
+
- Train iterations use DPO fine-tuning which is sensitive to hyperparameters
|
|
112
|
+
recommended_when: 'Use this benchmark when you need a personalization task that combines
|
|
113
|
+
behavior modeling, preference optimization, and cross-domain generalization testing.
|
|
114
|
+
Ideal for evaluating offline RL methods on dynamic user modeling scenarios requiring
|
|
115
|
+
iterative persona refinement with explicit direction search capabilities.
|
|
116
|
+
|
|
117
|
+
'
|
|
118
|
+
not_recommended_when: 'Do not use this benchmark if you cannot host or access 8B-class
|
|
119
|
+
LLMs, if you lack API infrastructure for OpenRouter calls, or if you need a quick
|
|
120
|
+
baseline task without RL components. Not suitable for environments without GPU capability
|
|
121
|
+
or with strict compute budgets below the minimum resource requirements.
|
|
122
|
+
|
|
123
|
+
'
|
|
124
|
+
paper:
|
|
125
|
+
title: 'DEEPER: Directed Persona Refinement for Dynamic Persona Modeling'
|
|
126
|
+
venue: ACL 2025
|
|
127
|
+
year: 2025
|
|
128
|
+
url: https://arxiv.org/abs/2502.11078
|
|
129
|
+
download:
|
|
130
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.025_deeper.zip
|
|
131
|
+
archive_type: zip
|
|
132
|
+
local_dir_name: paper-25-DEEPER
|
|
133
|
+
provider: github_release
|
|
134
|
+
repo: ResearAI/DeepScientist
|
|
135
|
+
tag: aisb-v0.0.1
|
|
136
|
+
asset_name: aisb.t3.025_deeper.zip
|
|
137
|
+
sha256: 1c2f13d7dc01d3ff88b03778010e1d29b20d5bb4b687e9953a0b0b0977b0057e
|
|
138
|
+
size_bytes: 3096009
|
|
139
|
+
display:
|
|
140
|
+
palette_seed: plum-sand-persona
|
|
141
|
+
art_style: human-centered
|
|
142
|
+
accent_priority: high
|
|
143
|
+
image_path: ../image/025_aisb.t3.025_deeper.jpg
|
|
@@ -0,0 +1,116 @@
|
|
|
1
|
+
id: aisb.t3.025_deeper
|
|
2
|
+
name: 'DEEPER:用于动态人格建模的导向性人格优化方法'
|
|
3
|
+
version: 0.1.0
|
|
4
|
+
one_line: 基于优化的新型动态人格建模方法,采用离线强化学习和差异驱动的更新方向搜索,涵盖10个领域。
|
|
5
|
+
task_description: '该基准测试包涵盖动态人格建模,采用一种基于优化的新范式,解决了传统再生(替换人格)和扩展(增量追加行为)方法的根本性局限。DEEPER利用预测-行为差异以结构化、奖励驱动的方式引导更新方向,实现人格的持续优化。该方法采用三目标奖励设计,平衡"历史保持"、"当前反映"和"未来推进",通过两阶段迭代离线强化学习和DPO微调进行训练。在4800+用户和10个领域(包括4个未见领域用于跨域泛化测试)上进行评估。主要评估指标是在四轮优化过程中对未来行为预测的MAE。
|
|
6
|
+
|
|
7
|
+
'
|
|
8
|
+
task_mode: experiment_driven
|
|
9
|
+
requires_execution: true
|
|
10
|
+
requires_paper: true
|
|
11
|
+
integrity_level: cas_plus_canary
|
|
12
|
+
snapshot_status: external_eval_required
|
|
13
|
+
support_level: recovery
|
|
14
|
+
time_band: 2-4d
|
|
15
|
+
cost_band: high
|
|
16
|
+
difficulty: hard
|
|
17
|
+
data_access: public
|
|
18
|
+
primary_outputs:
|
|
19
|
+
- mae_round4
|
|
20
|
+
- persona_refinement_model
|
|
21
|
+
- evaluation_report
|
|
22
|
+
launch_profiles:
|
|
23
|
+
- id: quick_eval
|
|
24
|
+
label: 快速评估
|
|
25
|
+
description: '使用eval_round4.py对DEEPER人格建模栈运行一次打包评估。评估第4轮MAE(使用经过4次DEEPER更新后的人格S4预测第5窗口评分)。通过OpenRouter API使用meta-llama/llama-3.3-70b-instruct,支持可配置的并发数。
|
|
26
|
+
|
|
27
|
+
'
|
|
28
|
+
- id: full_refinement
|
|
29
|
+
label: 完整优化流程
|
|
30
|
+
description: '运行完整的人格优化工作流,包括通过LLaMA-Factory进行两阶段离线训练(迭代1和迭代2)、所有轮次的人格更新,以及包含各领域和各轮次MAE报告的下游评估。
|
|
31
|
+
|
|
32
|
+
'
|
|
33
|
+
dataset_download:
|
|
34
|
+
primary_method: huggingface
|
|
35
|
+
sources:
|
|
36
|
+
- https://huggingface.co/datasets/deeper-team/DEEPER_preprocess_data
|
|
37
|
+
- https://huggingface.co/datasets/deeper-team/DEEPER_user_context_data
|
|
38
|
+
- https://huggingface.co/datasets/deeper-team/DEEPER_train_data
|
|
39
|
+
- https://huggingface.co/deeper-team/DEEPER-llama-8B
|
|
40
|
+
notes:
|
|
41
|
+
- 四个数据集组件:预处理数据、用户上下文数据、DEEPER训练数据、演化人格
|
|
42
|
+
- 预处理数据涵盖10个领域,包含按时间顺序的用户评分历史(每用户50+条记录)
|
|
43
|
+
- 用户上下文数据按迭代组织(评估用iteration_1到iteration_4,训练用iteration_1到iteration_2)
|
|
44
|
+
- 训练数据通过自采样和两轮迭代优化构建
|
|
45
|
+
credential_requirements:
|
|
46
|
+
mode: api_key
|
|
47
|
+
items:
|
|
48
|
+
- openrouter_api_key
|
|
49
|
+
notes:
|
|
50
|
+
- 需要OpenRouter API密钥用于评估期间基于LLM的人格生成和行为预测
|
|
51
|
+
- API调用使用meta-llama/llama-3.3-70b-instruct模型
|
|
52
|
+
resources:
|
|
53
|
+
minimum:
|
|
54
|
+
cpu_cores: 16
|
|
55
|
+
ram_gb: 64
|
|
56
|
+
disk_gb: 150
|
|
57
|
+
gpu_count: 1
|
|
58
|
+
gpu_vram_gb: 48
|
|
59
|
+
gpu_type: NVIDIA A100或等效高显存GPU
|
|
60
|
+
recommended:
|
|
61
|
+
cpu_cores: 32
|
|
62
|
+
ram_gb: 128
|
|
63
|
+
disk_gb: 300
|
|
64
|
+
gpu_count: 2
|
|
65
|
+
gpu_vram_gb: 80
|
|
66
|
+
gpu_type: 多GPU配置用于并行训练和推理
|
|
67
|
+
environment:
|
|
68
|
+
python: '3.10'
|
|
69
|
+
cuda: '11.8'
|
|
70
|
+
pytorch: 2.1.0
|
|
71
|
+
flash_attn: null
|
|
72
|
+
key_packages:
|
|
73
|
+
- transformers==4.45.0
|
|
74
|
+
- vllm==0.7.3
|
|
75
|
+
- openai>=1.0.0
|
|
76
|
+
- llama-factory @ git+https://github.com/hiyouga/LLaMA-Factory.git
|
|
77
|
+
notes:
|
|
78
|
+
- LLaMA-Factory是训练流程必需的(通过git clone和pip install -e ".[torch,metrics]"安装)
|
|
79
|
+
- 强烈建议使用Conda环境进行依赖隔离
|
|
80
|
+
- 参见附带的README/requirements.txt获取完整的依赖列表
|
|
81
|
+
risk_flags:
|
|
82
|
+
- multi_gpu_training
|
|
83
|
+
- llm_api_dependency
|
|
84
|
+
- rate_limiting
|
|
85
|
+
- long_runtime
|
|
86
|
+
risk_notes:
|
|
87
|
+
- 多GPU训练编排增加故障复杂性;建议使用检查点
|
|
88
|
+
- 评估依赖OpenRouter API调用;在MAX_WORKERS=8时可能因速率限制导致超时
|
|
89
|
+
- 完整优化工作流跨越多轮训练迭代和4轮评估
|
|
90
|
+
- 训练迭代使用DPO微调,对超参数敏感
|
|
91
|
+
recommended_when: '当您需要一个结合行为建模、偏好优化和跨域泛化测试的个性化任务时使用此基准。特别适合评估离线强化学习方法在需要迭代人格优化和显式方向搜索能力的动态用户建模场景中的表现。
|
|
92
|
+
|
|
93
|
+
'
|
|
94
|
+
not_recommended_when: '如果您无法托管或访问8B级LLM、缺乏OpenRouter调用的API基础设施,或需要一个不含RL组件的快速基线任务,请勿使用此基准。不适合没有GPU能力或计算预算低于最低资源要求的环境。
|
|
95
|
+
|
|
96
|
+
'
|
|
97
|
+
paper:
|
|
98
|
+
title: 'DEEPER: Directed Persona Refinement for Dynamic Persona Modeling'
|
|
99
|
+
venue: ACL 2025
|
|
100
|
+
year: 2025
|
|
101
|
+
url: https://arxiv.org/abs/2502.11078
|
|
102
|
+
download:
|
|
103
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.025_deeper.zip
|
|
104
|
+
archive_type: zip
|
|
105
|
+
local_dir_name: paper-25-DEEPER
|
|
106
|
+
provider: github_release
|
|
107
|
+
repo: ResearAI/DeepScientist
|
|
108
|
+
tag: aisb-v0.0.1
|
|
109
|
+
asset_name: aisb.t3.025_deeper.zip
|
|
110
|
+
sha256: 1c2f13d7dc01d3ff88b03778010e1d29b20d5bb4b687e9953a0b0b0977b0057e
|
|
111
|
+
size_bytes: 3096009
|
|
112
|
+
display:
|
|
113
|
+
palette_seed: plum-sand-persona
|
|
114
|
+
art_style: human-centered
|
|
115
|
+
accent_priority: high
|
|
116
|
+
image_path: ../image/025_aisb.t3.025_deeper.jpg
|
|
@@ -0,0 +1,195 @@
|
|
|
1
|
+
schema_version: 1
|
|
2
|
+
id: aisb.t3.026_gartkg
|
|
3
|
+
name: DGAR – Deep Generative Adaptive Replay for Temporal Knowledge Graph Reasoning
|
|
4
|
+
version: 0.1.0
|
|
5
|
+
one_line: 'Continual temporal knowledge-graph reasoning benchmark: train a diffusion-guided
|
|
6
|
+
adaptive replay model (DGAR) on ICEWS14s-style TKG snapshots and measure link-prediction
|
|
7
|
+
accuracy (MRR, Hits@1/3/10) under sequential task arrival with catastrophic-forgetting
|
|
8
|
+
mitigation.
|
|
9
|
+
|
|
10
|
+
'
|
|
11
|
+
task_description: 'This benchmark packages the DGAR method for continual Temporal
|
|
12
|
+
Knowledge Graph Reasoning (TKGR). The agent must (1) build Historical Context Prompts
|
|
13
|
+
from past KG snapshots, (2) pre-train a diffusion model that generates historical
|
|
14
|
+
entity distributions, (3) run continual learning over a stream of TKG snapshots
|
|
15
|
+
using adaptive replay to mitigate catastrophic forgetting, and (4) evaluate link-prediction
|
|
16
|
+
quality (MRR, Hits@1, Hits@3, Hits@10) on held-out test splits at each time step.
|
|
17
|
+
The core entry point is `src/main.py --gpu 0 --dataset ICEWS14s`. The model combines
|
|
18
|
+
RE-GCN graph reasoning with a Transformer-based DDPM for entity-distribution generation
|
|
19
|
+
and a layer-by-layer deep adaptive replay mechanism. Hyperparameter ranges are defined
|
|
20
|
+
in `src/hyperparameter_range.py`. The primary metric (`accuracy` / MRR) is code-backed
|
|
21
|
+
via `diffusion/difffu_21.py`. No benchmark execution was performed during the packaging
|
|
22
|
+
pass; the agent must execute and validate all metrics at runtime.
|
|
23
|
+
|
|
24
|
+
'
|
|
25
|
+
capability_tags:
|
|
26
|
+
- research_code_optimization
|
|
27
|
+
- temporal_kg_reasoning
|
|
28
|
+
- graph_learning
|
|
29
|
+
- continual_learning
|
|
30
|
+
- generative_replay
|
|
31
|
+
- diffusion_models
|
|
32
|
+
aisb_direction: T3
|
|
33
|
+
track_fit:
|
|
34
|
+
- paper_track
|
|
35
|
+
- benchmark_track
|
|
36
|
+
task_mode: experiment_driven
|
|
37
|
+
requires_execution: true
|
|
38
|
+
requires_paper: true
|
|
39
|
+
integrity_level: cas_plus_canary
|
|
40
|
+
snapshot_status: runnable_not_verified
|
|
41
|
+
support_level: advanced
|
|
42
|
+
cost_band: medium
|
|
43
|
+
time_band: 6-24h
|
|
44
|
+
difficulty: hard
|
|
45
|
+
data_access: public
|
|
46
|
+
primary_outputs:
|
|
47
|
+
- mrr_filter
|
|
48
|
+
- hits_at_1
|
|
49
|
+
- hits_at_3
|
|
50
|
+
- hits_at_10
|
|
51
|
+
- temporal_kg_checkpoint
|
|
52
|
+
- reasoning_report
|
|
53
|
+
launch_profiles:
|
|
54
|
+
- id: quick_check
|
|
55
|
+
label: Quick Check
|
|
56
|
+
description: 'Run a single dataset (e.g. ICEWS14s) through `src/main.py` with default
|
|
57
|
+
hyperparameters for one pass over the continual task stream. Verifies that the
|
|
58
|
+
training loop, diffusion generation, and evaluation pipeline execute end-to-end.
|
|
59
|
+
Expected wall-time: 1–3 hours on a single GPU.
|
|
60
|
+
|
|
61
|
+
'
|
|
62
|
+
- id: full_train_eval
|
|
63
|
+
label: Full Train + Eval
|
|
64
|
+
description: 'Run the full continual training and evaluation workflow including
|
|
65
|
+
hyperparameter sweeps over n_hidden, n_layers, and dropout (see `src/hyperparameter_range.py`).
|
|
66
|
+
Covers all TKG snapshot tasks with adaptive replay, diffusion-enhanced generation,
|
|
67
|
+
and deep adaptive replay. Expected wall-time: 6–24 hours depending on dataset
|
|
68
|
+
size and GPU.
|
|
69
|
+
|
|
70
|
+
'
|
|
71
|
+
- id: hp_sweep
|
|
72
|
+
label: Hyperparameter Sweep
|
|
73
|
+
description: 'Systematic grid search over the ranges in `src/hyperparameter_range.py`:
|
|
74
|
+
n_hidden ∈ {100,200,300,400}, n_layers ∈ {1,2}, dropout ∈ {0.2,0.4}, n_bases=100.
|
|
75
|
+
Produces a comparison table of MRR and Hits@k across configurations.
|
|
76
|
+
|
|
77
|
+
'
|
|
78
|
+
dataset_download:
|
|
79
|
+
primary_method: bundled
|
|
80
|
+
sources:
|
|
81
|
+
- kind: local
|
|
82
|
+
url: null
|
|
83
|
+
access: public
|
|
84
|
+
note: 'Dataset files (e.g. ICEWS14s) are expected to be bundled in the snapshot
|
|
85
|
+
or loadable via the code''s built-in data-reading utilities (see `rgcn/knowledge_graph.py`).
|
|
86
|
+
The README indicates running with `--dataset ICEWS14s`.
|
|
87
|
+
|
|
88
|
+
'
|
|
89
|
+
notes:
|
|
90
|
+
- Dataset size is modest (tens of MB for ICEWS14s). No large external download is
|
|
91
|
+
required.
|
|
92
|
+
- If datasets are not bundled, check the original ICEWS14s public sources.
|
|
93
|
+
credential_requirements:
|
|
94
|
+
mode: none
|
|
95
|
+
items: []
|
|
96
|
+
notes: []
|
|
97
|
+
resources:
|
|
98
|
+
minimum:
|
|
99
|
+
cpu_cores: 8
|
|
100
|
+
ram_gb: 32
|
|
101
|
+
disk_gb: 80
|
|
102
|
+
gpu_count: 1
|
|
103
|
+
gpu_vram_gb: 16
|
|
104
|
+
recommended:
|
|
105
|
+
cpu_cores: 16
|
|
106
|
+
ram_gb: 64
|
|
107
|
+
disk_gb: 150
|
|
108
|
+
gpu_count: 1
|
|
109
|
+
gpu_vram_gb: 24
|
|
110
|
+
environment:
|
|
111
|
+
python: '3.10'
|
|
112
|
+
cuda: '11.8'
|
|
113
|
+
pytorch: 2.1.0
|
|
114
|
+
flash_attn: null
|
|
115
|
+
key_packages:
|
|
116
|
+
- dgl
|
|
117
|
+
- fitlog
|
|
118
|
+
- umap-learn
|
|
119
|
+
- scikit-learn
|
|
120
|
+
- scipy
|
|
121
|
+
- tqdm
|
|
122
|
+
- matplotlib
|
|
123
|
+
notes:
|
|
124
|
+
- See bundled requirements for exact pinned versions.
|
|
125
|
+
- Code imports dgl, fitlog, umap, sklearn, scipy – all must be installed.
|
|
126
|
+
- The diffusion module uses a Transformer-based architecture; no external pretrained
|
|
127
|
+
weights are referenced.
|
|
128
|
+
risk_flags:
|
|
129
|
+
- unverified_execution
|
|
130
|
+
- implicit_dataset_assumption
|
|
131
|
+
- fitlog_dependency
|
|
132
|
+
risk_notes:
|
|
133
|
+
- 'No benchmark execution was performed during the packaging pass. All metric values
|
|
134
|
+
must be produced and validated by the agent at runtime.
|
|
135
|
+
|
|
136
|
+
'
|
|
137
|
+
- 'The README entry point has a formatting artifact (`--dataset ICEWS14s# DGAR`);
|
|
138
|
+
the actual flag is likely `--dataset ICEWS14s`. The agent should verify the correct
|
|
139
|
+
dataset name.
|
|
140
|
+
|
|
141
|
+
'
|
|
142
|
+
- 'fitlog is used for experiment tracking; if not installed or configured it may cause
|
|
143
|
+
import errors.
|
|
144
|
+
|
|
145
|
+
'
|
|
146
|
+
- 'Code depends on dgl (Deep Graph Library) which has CUDA-version-specific builds.
|
|
147
|
+
Ensure the dgl version matches the installed CUDA toolkit.
|
|
148
|
+
|
|
149
|
+
'
|
|
150
|
+
- 'The diffusion model pre-training and TKGR training are interleaved in the continual
|
|
151
|
+
learning loop; GPU memory may spike during the diffusion generation + gradient guidance
|
|
152
|
+
step (Eq. 8–10 in paper).
|
|
153
|
+
|
|
154
|
+
'
|
|
155
|
+
recommended_when: 'Use this benchmark when you want a graph-learning task that combines
|
|
156
|
+
continual learning with generative replay and temporal knowledge graph reasoning.
|
|
157
|
+
Suitable for evaluating agents that must navigate multi-component ML pipelines (GNN
|
|
158
|
+
+ diffusion + replay) with hyperparameter optimization under a continual-learning
|
|
159
|
+
protocol.
|
|
160
|
+
|
|
161
|
+
'
|
|
162
|
+
not_recommended_when: 'Not suitable if you need a lightweight text-only task, cannot
|
|
163
|
+
provide ≥16 GB GPU VRAM for the combined graph + diffusion components, or require
|
|
164
|
+
a benchmark with pre-verified baseline metric values. Also not ideal if you need
|
|
165
|
+
multi-GPU distributed training support (code appears single-GPU only).
|
|
166
|
+
|
|
167
|
+
'
|
|
168
|
+
paper:
|
|
169
|
+
title: A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge
|
|
170
|
+
Graph Reasoning
|
|
171
|
+
authors:
|
|
172
|
+
- Zhiyu Zhang
|
|
173
|
+
- Wei Chen
|
|
174
|
+
- Youfang Lin
|
|
175
|
+
- Huaiyu Wan
|
|
176
|
+
venue: ACL 2025
|
|
177
|
+
year: 2025
|
|
178
|
+
url: https://aclanthology.org/2025.acl-long.537/
|
|
179
|
+
download:
|
|
180
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.026_gartkg.zip
|
|
181
|
+
archive_type: zip
|
|
182
|
+
local_dir_name: paper-26-GARTKG
|
|
183
|
+
provider: github_release
|
|
184
|
+
repo: ResearAI/DeepScientist
|
|
185
|
+
tag: aisb-v0.0.1
|
|
186
|
+
asset_name: aisb.t3.026_gartkg.zip
|
|
187
|
+
sha256: 010bac738fef854266a59273862289b34ebafc7890602fd29f73613783e594f2
|
|
188
|
+
size_bytes: 81352
|
|
189
|
+
commercial:
|
|
190
|
+
annual_fee: null
|
|
191
|
+
display:
|
|
192
|
+
palette_seed: forest-copper-temporal
|
|
193
|
+
art_style: graph-science
|
|
194
|
+
accent_priority: high
|
|
195
|
+
image_path: ../image/026_aisb.t3.026_gartkg.jpg
|
|
@@ -0,0 +1,127 @@
|
|
|
1
|
+
schema_version: 1
|
|
2
|
+
id: aisb.t3.026_gartkg
|
|
3
|
+
name: DGAR – 用于时序知识图谱推理的深度生成式自适应回放
|
|
4
|
+
version: 0.1.0
|
|
5
|
+
one_line: 持续性时序知识图谱推理基准:通过自适应回放训练扩散引导模型(DGAR),在ICEWS14s风格的时间知识图谱快照上完成链接预测任务,在顺序任务到达场景下评估MRR和Hits@1/3/10指标,同时实现灾难性遗忘缓解。
|
|
6
|
+
task_description: 本基准测试封装了DGAR方法用于持续性时序知识图谱推理(TKGR)。智能体需要完成以下工作:(1)从历史知识图谱快照构建上下文提示,(2)预训练扩散模型以生成历史实体分布,(3)在时间知识图谱快照流上运行持续学习,使用自适应回放机制缓解灾难性遗忘,(4)在每个时间步骤的保留测试集上评估链接预测质量(MRR, Hits@1, Hits@3, Hits@10)。核心入口为 `src/main.py --gpu 0 --dataset ICEWS14s`。模型结合了RE-GCN图推理、基于Transformer的DDPM用于实体分布生成,以及逐层深度自适应回放机制。超参数范围定义在 `src/hyperparameter_range.py` 中。主指标(`accuracy`/MRR)通过 `diffusion/difffu_21.py` 实现代码验证。打包过程中未执行基准测试;智能体必须在运行时执行并验证所有指标。
|
|
7
|
+
capability_tags:
|
|
8
|
+
- research_code_optimization
|
|
9
|
+
- temporal_kg_reasoning
|
|
10
|
+
- graph_learning
|
|
11
|
+
- continual_learning
|
|
12
|
+
- generative_replay
|
|
13
|
+
- diffusion_models
|
|
14
|
+
aisb_direction: T3
|
|
15
|
+
track_fit:
|
|
16
|
+
- paper_track
|
|
17
|
+
- benchmark_track
|
|
18
|
+
task_mode: experiment_driven
|
|
19
|
+
requires_execution: true
|
|
20
|
+
requires_paper: true
|
|
21
|
+
integrity_level: cas_plus_canary
|
|
22
|
+
snapshot_status: runnable_not_verified
|
|
23
|
+
support_level: advanced
|
|
24
|
+
cost_band: medium
|
|
25
|
+
time_band: 6-24h
|
|
26
|
+
difficulty: hard
|
|
27
|
+
data_access: public
|
|
28
|
+
primary_outputs:
|
|
29
|
+
- mrr_filter
|
|
30
|
+
- hits_at_1
|
|
31
|
+
- hits_at_3
|
|
32
|
+
- hits_at_10
|
|
33
|
+
- temporal_kg_checkpoint
|
|
34
|
+
- reasoning_report
|
|
35
|
+
launch_profiles:
|
|
36
|
+
- id: quick_check
|
|
37
|
+
label: 快速检查
|
|
38
|
+
description: 使用默认超参数在单个数据集(如ICEWS14s)上运行 `src/main.py`,对持续任务流执行一次完整遍历。验证训练循环、扩散生成和评估管道的端到端执行。预计耗时:单GPU 1-3小时。
|
|
39
|
+
- id: full_train_eval
|
|
40
|
+
label: 完整训练+评估
|
|
41
|
+
description: 运行完整的持续训练和评估工作流,包括对n_hidden、n_layers和dropout的超参数搜索(见 `src/hyperparameter_range.py`)。覆盖所有时间知识图谱快照任务,包含自适应回放、扩散增强生成和深度自适应回放。预计耗时:6-24小时,取决于数据集大小和GPU配置。
|
|
42
|
+
- id: hp_sweep
|
|
43
|
+
label: 超参数搜索
|
|
44
|
+
description: 对 `src/hyperparameter_range.py` 中的范围进行系统性网格搜索:n_hidden ∈ {100,200,300,400}, n_layers ∈ {1,2}, dropout ∈ {0.2,0.4}, n_bases=100。生成不同配置下MRR和Hits@k的对比表。
|
|
45
|
+
dataset_download:
|
|
46
|
+
primary_method: bundled
|
|
47
|
+
sources:
|
|
48
|
+
- kind: local
|
|
49
|
+
url: null
|
|
50
|
+
access: public
|
|
51
|
+
note: 数据集文件(如ICEWS14s)预期随快照打包提供,或可通过代码内置的数据读取工具加载(见 `rgcn/knowledge_graph.py`)。README指示使用 `--dataset ICEWS14s` 运行。
|
|
52
|
+
notes:
|
|
53
|
+
- 数据集规模较小(ICEWS14s仅数十MB)。无需下载大型外部文件。
|
|
54
|
+
- 如数据集未打包,请查阅原始ICEWS14s公开数据源。
|
|
55
|
+
credential_requirements:
|
|
56
|
+
mode: none
|
|
57
|
+
items: []
|
|
58
|
+
notes: []
|
|
59
|
+
resources:
|
|
60
|
+
minimum:
|
|
61
|
+
cpu_cores: 8
|
|
62
|
+
ram_gb: 32
|
|
63
|
+
disk_gb: 80
|
|
64
|
+
gpu_count: 1
|
|
65
|
+
gpu_vram_gb: 16
|
|
66
|
+
recommended:
|
|
67
|
+
cpu_cores: 16
|
|
68
|
+
ram_gb: 64
|
|
69
|
+
disk_gb: 150
|
|
70
|
+
gpu_count: 1
|
|
71
|
+
gpu_vram_gb: 24
|
|
72
|
+
environment:
|
|
73
|
+
python: '3.10'
|
|
74
|
+
cuda: '11.8'
|
|
75
|
+
pytorch: 2.1.0
|
|
76
|
+
flash_attn: null
|
|
77
|
+
key_packages:
|
|
78
|
+
- dgl
|
|
79
|
+
- fitlog
|
|
80
|
+
- umap-learn
|
|
81
|
+
- scikit-learn
|
|
82
|
+
- scipy
|
|
83
|
+
- tqdm
|
|
84
|
+
- matplotlib
|
|
85
|
+
notes:
|
|
86
|
+
- 详见打包的requirements文件以获取精确的版本锁定。
|
|
87
|
+
- 代码依赖dgl、fitlog、umap、sklearn、scipy——全部必须安装。
|
|
88
|
+
- 扩散模块使用基于Transformer的架构;不引用外部预训练权重。
|
|
89
|
+
risk_flags:
|
|
90
|
+
- unverified_execution
|
|
91
|
+
- implicit_dataset_assumption
|
|
92
|
+
- fitlog_dependency
|
|
93
|
+
risk_notes:
|
|
94
|
+
- 打包过程中未执行基准测试。所有指标值必须由智能体在运行时生成并验证。
|
|
95
|
+
- README入口点存在格式问题(`--dataset ICEWS14s# DGAR`);实际参数可能是 `--dataset ICEWS14s`。智能体应验证正确的数据集名称。
|
|
96
|
+
- fitlog用于实验追踪;如果未安装或未配置可能导致导入错误。
|
|
97
|
+
- 代码依赖dgl(深度图库),它有针对CUDA版本特定构建的版本。请确保dgl版本与已安装的CUDA工具包匹配。
|
|
98
|
+
- 扩散模型预训练和知识图谱推理训练在持续学习循环中交错进行;在扩散生成+梯度引导步骤(论文中的公式8-10)期间GPU显存可能达到峰值。
|
|
99
|
+
recommended_when: 当你需要结合持续学习和生成式回放的图学习任务,用于时间知识图谱推理时使用此基准。适用于评估需要处理多组件ML流程(GNN + 扩散 + 回放)且在持续学习协议下进行超参数优化的智能体。
|
|
100
|
+
not_recommended_when: 如果你需要轻量级纯文本任务、无法提供≥16GB GPU显存来支持图+扩散组件组合,或需要一个预验证基准指标值的基准测试,则不适合使用。同样,如果你需要多GPU分布式训练支持(代码似乎仅支持单GPU),也不理想。
|
|
101
|
+
paper:
|
|
102
|
+
title: A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
|
|
103
|
+
authors:
|
|
104
|
+
- Zhiyu Zhang
|
|
105
|
+
- Wei Chen
|
|
106
|
+
- Youfang Lin
|
|
107
|
+
- Huaiyu Wan
|
|
108
|
+
venue: ACL 2025
|
|
109
|
+
year: 2025
|
|
110
|
+
url: https://aclanthology.org/2025.acl-long.537/
|
|
111
|
+
download:
|
|
112
|
+
url: https://github.com/ResearAI/DeepScientist/releases/download/aisb-v0.0.1/aisb.t3.026_gartkg.zip
|
|
113
|
+
archive_type: zip
|
|
114
|
+
local_dir_name: paper-26-GARTKG
|
|
115
|
+
provider: github_release
|
|
116
|
+
repo: ResearAI/DeepScientist
|
|
117
|
+
tag: aisb-v0.0.1
|
|
118
|
+
asset_name: aisb.t3.026_gartkg.zip
|
|
119
|
+
sha256: 010bac738fef854266a59273862289b34ebafc7890602fd29f73613783e594f2
|
|
120
|
+
size_bytes: 81352
|
|
121
|
+
commercial:
|
|
122
|
+
annual_fee: null
|
|
123
|
+
display:
|
|
124
|
+
palette_seed: forest-copper-temporal
|
|
125
|
+
art_style: graph-science
|
|
126
|
+
accent_priority: high
|
|
127
|
+
image_path: ../image/026_aisb.t3.026_gartkg.jpg
|