verifiers 0.1.15.dev168__tar.gz → 0.1.15.dev170__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/PKG-INFO +5 -1
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/pyproject.toml +4 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/README.md +37 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/__init__.py +9 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/openseeker/README.md +33 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/openseeker/__init__.py +5 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/openseeker/taskset.py +603 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/README.md +52 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/__init__.py +5 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/__init__.py +17 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/api_tools/__init__.py +5 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/api_tools/tool_pdf.py +275 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/eval_toolkit.py +1119 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/evaluator.py +1271 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/llm_client/__init__.py +5 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/llm_client/base_client.py +15 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/prompts/__init__.py +4 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/prompts/cache_prompts.py +15 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/utils/__init__.py +7 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/utils/cache_filesys.py +45 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/utils/load_eval_script.py +107 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/utils/misc.py +106 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/utils/tool_visit.py +69 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/utils/url_tools.py +27 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/obj_task_eval/verification_tree.py +153 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/quest/taskset.py +667 -0
- verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/search_tasksets.py +36 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/.gitignore +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/LICENSE +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/AGENTS.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/conftest.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_browser_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_build_script.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_cli_agent_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_client_auth_errors.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_client_config.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_client_multimodal_types.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_composable_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_context_token_metrics.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_decorator_ranks.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_endpoint_registry.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_env_group.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_env_server.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_environment.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_environment_extra.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_envs.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_error_chain.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_eval_cli.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_eval_display.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_eval_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_gepa_cli.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_gepa_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_gym_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_harbor_env_mcp.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_imports.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_init_script.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_install_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_interception_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_langchain_deep_agents_wikispeedia.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_lean_task.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_logging.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_math_rubric.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_maybe_think_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_mcp_search_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_message_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_message_utils_multimodal.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_multiturn_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_nemorl_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_openai_chat_completions_token_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_openai_responses_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_opencode_harbor.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_opencode_rlm_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_openenv_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_path_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_per_turn_timing.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_pricing_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_prime_plugin.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_renderer_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_renderer_e2e.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_rlm_composable_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_rubric.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_rubric_group.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_sandbox_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_sandbox_mixin.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_save_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_setup_script.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_singleturn_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_stateful_tool_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_think_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_tool_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_tool_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_trajectory_processing.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_tui_info_formatting.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_types.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_bfcl.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_config_extension.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_empty_completions.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_endpoint_protocols.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_example_counts.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_group_reward_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_harbor_cli.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_mini_swe_agent.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_nemo_gym_harness.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_openenv_taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_openreward_taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_replay_harness.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_rlm_swe.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_runtime_lifecycle.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_scoring_functions.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_taskset_bindings.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_taskset_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_v1_textarena_taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_wiki_search_v1.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_wordle_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_wordle_v1_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/tests/test_xml_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/AGENTS.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/build.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/eval.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/gepa.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/init.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/install.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/commands/setup.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/plugins/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/plugins/prime.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/cli/tui.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/anthropic_messages_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/nemorl_chat_completions_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/openai_chat_completions_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/openai_chat_completions_token_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/openai_completions_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/openai_responses_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/clients/renderer_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/decorators.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/AGENTS.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/env_group.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/environment.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/cli_agent_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/_filter.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/composable_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/harness.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/harnesses/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/harnesses/mini_swe_agent.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/harnesses/opencode.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/harnesses/prompt.txt +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/harnesses/rlm.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/swe_debug_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/task.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/cp/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/cp/cp_task.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/cp/test_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/harbor/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/harbor/harbor.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/lean/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/lean/lean_task.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/math/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/math/math_task.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/multi_swe/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/multi_swe/extract_fix_patch.sh +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/multi_swe/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/openswe/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/openswe/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/r2e_gym/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/r2e_gym/log_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/r2e_gym/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/scale_swe/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/scale_swe/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/shared/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/shared/test_patch.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_bench/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_bench/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_lego/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_lego/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_rebench_v2/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_rebench_v2/log_parsers.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_rebench_v2/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_smith/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_smith/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/composable/tasksets/swe/swe_tasksets.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/gym_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/harbor_env/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/harbor_env/env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/harbor_env/mcp.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/mcp_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/opencode_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/opencode_qa_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/opencode_rlm_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/sandbox_mixin.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/utils/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/utils/file_locks.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/experimental/utils/git_checkout_cache.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/browser_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/modes/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/modes/base.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/modes/cua_mode.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/browser_env/modes/dom_mode.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/openenv_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/reasoninggym_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/integrations/textarena_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/multiturn_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/python_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/sandbox_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/singleturn_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/stateful_tool_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/envs/tool_env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/errors.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/gepa/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/gepa/adapter.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/gepa/config.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/gepa/display.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/gepa/gepa_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/parsers/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/parsers/maybe_think_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/parsers/parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/parsers/think_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/parsers/xml_parser.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/inference/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/inference/client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/inference/server.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/trainer/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/trainer/config.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/trainer/orchestrator.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/trainer/trainer.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rl/trainer/utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rubrics/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rubrics/experimental/hybrid_math_rubric.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rubrics/judge_rubric.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rubrics/math_rubric.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rubrics/rubric.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/rubrics/rubric_group.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/build.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/eval.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/gepa.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/init.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/install.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/rl.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/setup.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/train.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/tui.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/scripts/vllm.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/client/env_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/client/zmq_env_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/server/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/server/env_router.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/server/env_server.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/server/env_worker.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/server/zmq_env_server.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/serve/types.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/types.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/async_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/client_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/config_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/data_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/display_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/env_config_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/env_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/error_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/eval_display.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/eval_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/heartbeat.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/import_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/install_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/interception_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/logging_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/message_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/metric_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/path_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/pricing_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/process_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/response_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/save_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/serve_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/thread_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/threaded_sandbox_client.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/tool_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/usage_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/utils/version_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/ENVIRONMENT_BEST_PRACTICES.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/README.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/RE_MIGRATION.md +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/artifact.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/config.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/env.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/harness.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/model.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/program.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/runtime.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/runtime_handles.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/sandbox.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/state.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/task.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/taskset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/toolset.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/types.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/user.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/__init__.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/binding_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/config_callable_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/config_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/endpoint_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/json_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/judge_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/lifecycle_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/logging_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/mcp_proxy_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/mcp_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/object_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/program_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/prompt_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/runtime_owner_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/runtime_registry.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/sandbox_program_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/sandbox_python_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/sandbox_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/scoring_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/serialization_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/task_freeze_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/taskset_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/tool_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/toolset_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/trajectory_utils.py +0 -0
- {verifiers-0.1.15.dev168 → verifiers-0.1.15.dev170}/verifiers/v1/utils/usage_utils.py +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: verifiers
|
|
3
|
-
Version: 0.1.15.
|
|
3
|
+
Version: 0.1.15.dev170
|
|
4
4
|
Summary: Verifiers: Environments for LLM Reinforcement Learning
|
|
5
5
|
Project-URL: Homepage, https://github.com/primeintellect-ai/verifiers
|
|
6
6
|
Project-URL: Documentation, https://github.com/primeintellect-ai/verifiers
|
|
@@ -22,8 +22,10 @@ Classifier: Programming Language :: Python :: 3.13
|
|
|
22
22
|
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
|
|
23
23
|
Classifier: Topic :: Software Development :: Libraries :: Python Modules
|
|
24
24
|
Requires-Python: <3.14,>=3.10
|
|
25
|
+
Requires-Dist: aiohttp>=3.9.0
|
|
25
26
|
Requires-Dist: aiolimiter>=1.2.1
|
|
26
27
|
Requires-Dist: anthropic>=0.78.0
|
|
28
|
+
Requires-Dist: certifi
|
|
27
29
|
Requires-Dist: datasets<4.7.0,>=3.0.0
|
|
28
30
|
Requires-Dist: gepa
|
|
29
31
|
Requires-Dist: httpx>=0.27.0
|
|
@@ -35,10 +37,12 @@ Requires-Dist: nest-asyncio>=1.6.0
|
|
|
35
37
|
Requires-Dist: numpy
|
|
36
38
|
Requires-Dist: openai-agents>=0.0.7
|
|
37
39
|
Requires-Dist: openai>=1.108.1
|
|
40
|
+
Requires-Dist: pillow
|
|
38
41
|
Requires-Dist: prime-pydantic-config[toml]
|
|
39
42
|
Requires-Dist: prime-sandboxes>=0.2.25
|
|
40
43
|
Requires-Dist: prime-tunnel>=0.1.6
|
|
41
44
|
Requires-Dist: pydantic>=2.11.9
|
|
45
|
+
Requires-Dist: pymupdf
|
|
42
46
|
Requires-Dist: pyzmq>=27.1.0
|
|
43
47
|
Requires-Dist: regex<2026.4.4
|
|
44
48
|
Requires-Dist: requests
|
|
@@ -53,6 +53,10 @@ dependencies = [
|
|
|
53
53
|
"setproctitle>=1.3.0",
|
|
54
54
|
"regex<2026.4.4",
|
|
55
55
|
"httpx>=0.27.0",
|
|
56
|
+
"aiohttp>=3.9.0",
|
|
57
|
+
"pymupdf",
|
|
58
|
+
"pillow",
|
|
59
|
+
"certifi",
|
|
56
60
|
"prime-pydantic-config[toml]",
|
|
57
61
|
"uvloop>=0.21.0; sys_platform != 'win32' and sys_platform != 'cygwin' and platform_python_implementation != 'PyPy'",
|
|
58
62
|
]
|
|
@@ -0,0 +1,37 @@
|
|
|
1
|
+
# Search Tasksets
|
|
2
|
+
|
|
3
|
+
Composable search/research tasksets for agents that solve live information-seeking tasks in a sandbox.
|
|
4
|
+
|
|
5
|
+
The search family is intentionally backend-oriented, mirroring the SWE taskset pattern while keeping the task contract research-centric: each task expects a single final answer rather than a code patch. Agents may use web/search tools, browser helpers, or other sandbox resources provided by the paired environment.
|
|
6
|
+
|
|
7
|
+
## Backends
|
|
8
|
+
|
|
9
|
+
| Backend | Source | Default dataset | Status |
|
|
10
|
+
|---|---|---|---|
|
|
11
|
+
| `openseeker` | [PolarSeeker/OpenSeeker](https://github.com/PolarSeeker/OpenSeeker) | [`PolarSeeker/OpenSeeker-v1-Data`](https://huggingface.co/datasets/PolarSeeker/OpenSeeker-v1-Data) | Binary semantic answer judge |
|
|
12
|
+
| `quest` | [OSU-NLP-Group/QUEST](https://github.com/OSU-NLP-Group/QUEST) | [`osunlp/QUEST-RL-Data`](https://huggingface.co/datasets/osunlp/QUEST-RL-Data) | Objective tasks supported |
|
|
13
|
+
|
|
14
|
+
## Usage
|
|
15
|
+
|
|
16
|
+
```python
|
|
17
|
+
from verifiers.envs.experimental.composable.tasksets.search import make_search_taskset
|
|
18
|
+
|
|
19
|
+
taskset = make_search_taskset(backend="openseeker")
|
|
20
|
+
taskset = make_search_taskset(backend="quest", category="objective")
|
|
21
|
+
```
|
|
22
|
+
|
|
23
|
+
`make_search_taskset()` dispatches by backend name. Unknown backends raise `ValueError` with the available backend list.
|
|
24
|
+
|
|
25
|
+
## Output Contract
|
|
26
|
+
|
|
27
|
+
Search tasksets should define their own output contract. The `quest` and `openseeker` backends expect the agent to write one final researched response to `/task/answer.txt`, including supporting URLs/citations when available. Scratch reasoning, tool traces, and logs should not be written as the final answer.
|
|
28
|
+
|
|
29
|
+
## Error Handling
|
|
30
|
+
|
|
31
|
+
Search tasksets should use the framework error taxonomy for infrastructure failures:
|
|
32
|
+
|
|
33
|
+
- `vf.SandboxError` for sandbox setup, command, or lifecycle failures.
|
|
34
|
+
- `vf.ModelError` for judge/model provider failures.
|
|
35
|
+
- `vf.InfraError` for dataset, evaluator, or external runtime failures.
|
|
36
|
+
|
|
37
|
+
Incorrect answers should not set `state["error"]`; they should score normally, often as `0.0`.
|
verifiers-0.1.15.dev170/verifiers/envs/experimental/composable/tasksets/search/openseeker/README.md
ADDED
|
@@ -0,0 +1,33 @@
|
|
|
1
|
+
# OpenSeeker Search Taskset
|
|
2
|
+
|
|
3
|
+
Composable search taskset for [`PolarSeeker/OpenSeeker-v1-Data`](https://huggingface.co/datasets/PolarSeeker/OpenSeeker-v1-Data), the OpenSeeker v1 release associated with arXiv `2603.15594`.
|
|
4
|
+
|
|
5
|
+
OpenSeeker v1 data contains synthesized deep-search QA pairs plus trajectories generated with `search` and `visit` tools. The public OpenSeeker evaluator scores only the final answer: it sends the question, gold answer, and model response to an LLM judge and expects `A` for correct or `B` for incorrect. This backend preserves that binary semantic answer-judge contract.
|
|
6
|
+
|
|
7
|
+
## Usage
|
|
8
|
+
|
|
9
|
+
```python
|
|
10
|
+
from verifiers.envs.experimental.composable.tasksets.search import make_search_taskset
|
|
11
|
+
|
|
12
|
+
taskset = make_search_taskset(backend="openseeker")
|
|
13
|
+
```
|
|
14
|
+
|
|
15
|
+
## Arguments
|
|
16
|
+
|
|
17
|
+
| Argument | Default | Description |
|
|
18
|
+
|---|---:|---|
|
|
19
|
+
| `dataset_name` | `PolarSeeker/OpenSeeker-v1-Data` | Hugging Face dataset name. |
|
|
20
|
+
| `split` | `train` | Dataset split. |
|
|
21
|
+
| `trajectory_correctness` | `Correct` | Keep rows with this trajectory label. Use `None` or `all` for all rows. |
|
|
22
|
+
| `min_tool_calls` | `None` | Optional lower bound for `number of tool calls`. |
|
|
23
|
+
| `max_tool_calls` | `None` | Optional upper bound for `number of tool calls`. |
|
|
24
|
+
| `include_trajectory` | `False` | Include the large source trajectory in task metadata. |
|
|
25
|
+
| `answer_file` | `/task/answer.txt` | Final answer path in the sandbox. |
|
|
26
|
+
| `judge_model` | `openai/gpt-5.4-mini` | OpenAI-compatible model used for binary answer judging. |
|
|
27
|
+
| `judge_base_url` | `https://api.pinference.ai/api/v1` | Judge API base URL. |
|
|
28
|
+
| `judge_api_key_var` | `PRIME_API_KEY` | Env var containing the judge API key. |
|
|
29
|
+
| `judge_sampling_args` | `None` | Extra sampling args for judge calls. |
|
|
30
|
+
|
|
31
|
+
## Output Contract
|
|
32
|
+
|
|
33
|
+
Agents should write one final answer to `/task/answer.txt`. The answer should directly satisfy the question and may include supporting URLs/citations. The judge ignores citation verification and evaluates whether the final response semantically contains the gold answer without contradictions.
|