biblicus 0.15.0__tar.gz → 0.15.1__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (335) hide show
  1. {biblicus-0.15.0/src/biblicus.egg-info → biblicus-0.15.1}/PKG-INFO +11 -4
  2. {biblicus-0.15.0 → biblicus-0.15.1}/README.md +10 -3
  3. {biblicus-0.15.0 → biblicus-0.15.1}/docs/ROADMAP.md +32 -2
  4. {biblicus-0.15.0 → biblicus-0.15.1}/docs/index.rst +9 -0
  5. {biblicus-0.15.0 → biblicus-0.15.1}/pyproject.toml +1 -1
  6. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/__init__.py +1 -1
  7. {biblicus-0.15.0 → biblicus-0.15.1/src/biblicus.egg-info}/PKG-INFO +11 -4
  8. {biblicus-0.15.0 → biblicus-0.15.1}/LICENSE +0 -0
  9. {biblicus-0.15.0 → biblicus-0.15.1}/MANIFEST.in +0 -0
  10. {biblicus-0.15.0 → biblicus-0.15.1}/THIRD_PARTY_NOTICES.md +0 -0
  11. {biblicus-0.15.0 → biblicus-0.15.1}/datasets/extraction_lab/labels.json +0 -0
  12. {biblicus-0.15.0 → biblicus-0.15.1}/datasets/retrieval_lab/labels.json +0 -0
  13. {biblicus-0.15.0 → biblicus-0.15.1}/datasets/wikipedia_mini.json +0 -0
  14. {biblicus-0.15.0 → biblicus-0.15.1}/docs/ANALYSIS.md +0 -0
  15. {biblicus-0.15.0 → biblicus-0.15.1}/docs/ARCHITECTURE.md +0 -0
  16. {biblicus-0.15.0 → biblicus-0.15.1}/docs/ARCHITECTURE_DETAIL.md +0 -0
  17. {biblicus-0.15.0 → biblicus-0.15.1}/docs/BACKENDS.md +0 -0
  18. {biblicus-0.15.0 → biblicus-0.15.1}/docs/CONTEXT_PACK.md +0 -0
  19. {biblicus-0.15.0 → biblicus-0.15.1}/docs/CORPUS.md +0 -0
  20. {biblicus-0.15.0 → biblicus-0.15.1}/docs/CORPUS_DESIGN.md +0 -0
  21. {biblicus-0.15.0 → biblicus-0.15.1}/docs/DEMOS.md +0 -0
  22. {biblicus-0.15.0 → biblicus-0.15.1}/docs/EXTRACTION.md +0 -0
  23. {biblicus-0.15.0 → biblicus-0.15.1}/docs/EXTRACTION_EVALUATION.md +0 -0
  24. {biblicus-0.15.0 → biblicus-0.15.1}/docs/FEATURE_INDEX.md +0 -0
  25. {biblicus-0.15.0 → biblicus-0.15.1}/docs/KNOWLEDGE_BASE.md +0 -0
  26. {biblicus-0.15.0 → biblicus-0.15.1}/docs/MARKOV_ANALYSIS.md +0 -0
  27. {biblicus-0.15.0 → biblicus-0.15.1}/docs/PROFILING.md +0 -0
  28. {biblicus-0.15.0 → biblicus-0.15.1}/docs/PR_FAQ_TEXT_ANNOTATE.md +0 -0
  29. {biblicus-0.15.0 → biblicus-0.15.1}/docs/RETRIEVAL.md +0 -0
  30. {biblicus-0.15.0 → biblicus-0.15.1}/docs/RETRIEVAL_EVALUATION.md +0 -0
  31. {biblicus-0.15.0 → biblicus-0.15.1}/docs/RETRIEVAL_QUALITY.md +0 -0
  32. {biblicus-0.15.0 → biblicus-0.15.1}/docs/STT.md +0 -0
  33. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TESTING.md +0 -0
  34. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TEXT_ANNOTATE.md +0 -0
  35. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TEXT_EXTRACT.md +0 -0
  36. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TEXT_LINK.md +0 -0
  37. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TEXT_REDACT.md +0 -0
  38. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TEXT_SLICE.md +0 -0
  39. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TEXT_UTILITIES.md +0 -0
  40. {biblicus-0.15.0 → biblicus-0.15.1}/docs/TOPIC_MODELING.md +0 -0
  41. {biblicus-0.15.0 → biblicus-0.15.1}/docs/USER_CONFIGURATION.md +0 -0
  42. {biblicus-0.15.0 → biblicus-0.15.1}/docs/USE_CASES.md +0 -0
  43. {biblicus-0.15.0 → biblicus-0.15.1}/docs/UTILITIES.md +0 -0
  44. {biblicus-0.15.0 → biblicus-0.15.1}/docs/api.rst +0 -0
  45. {biblicus-0.15.0 → biblicus-0.15.1}/docs/backends/index.md +0 -0
  46. {biblicus-0.15.0 → biblicus-0.15.1}/docs/backends/scan.md +0 -0
  47. {biblicus-0.15.0 → biblicus-0.15.1}/docs/backends/sqlite-full-text-search.md +0 -0
  48. {biblicus-0.15.0 → biblicus-0.15.1}/docs/backends/vector.md +0 -0
  49. {biblicus-0.15.0 → biblicus-0.15.1}/docs/conf.py +0 -0
  50. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/index.md +0 -0
  51. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/ocr/index.md +0 -0
  52. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/ocr/paddleocr-vl.md +0 -0
  53. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/ocr/rapidocr.md +0 -0
  54. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/pipeline-utilities/index.md +0 -0
  55. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/pipeline-utilities/pipeline.md +0 -0
  56. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/pipeline-utilities/select-longest.md +0 -0
  57. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/pipeline-utilities/select-override.md +0 -0
  58. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/pipeline-utilities/select-smart-override.md +0 -0
  59. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/pipeline-utilities/select-text.md +0 -0
  60. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/speech-to-text/deepgram.md +0 -0
  61. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/speech-to-text/index.md +0 -0
  62. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/speech-to-text/openai.md +0 -0
  63. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/text-document/index.md +0 -0
  64. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/text-document/markitdown.md +0 -0
  65. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/text-document/metadata.md +0 -0
  66. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/text-document/pass-through.md +0 -0
  67. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/text-document/pdf.md +0 -0
  68. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/text-document/unstructured.md +0 -0
  69. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/vlm-document/docling-granite.md +0 -0
  70. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/vlm-document/docling-smol.md +0 -0
  71. {biblicus-0.15.0 → biblicus-0.15.1}/docs/extractors/vlm-document/index.md +0 -0
  72. {biblicus-0.15.0 → biblicus-0.15.1}/docs/use_cases/notes_to_context_pack.md +0 -0
  73. {biblicus-0.15.0 → biblicus-0.15.1}/docs/use_cases/sequence_markov.md +0 -0
  74. {biblicus-0.15.0 → biblicus-0.15.1}/docs/use_cases/text_folder_search.md +0 -0
  75. {biblicus-0.15.0 → biblicus-0.15.1}/docs/use_cases/text_redact.md +0 -0
  76. {biblicus-0.15.0 → biblicus-0.15.1}/features/ai_llm.feature +0 -0
  77. {biblicus-0.15.0 → biblicus-0.15.1}/features/ai_models.feature +0 -0
  78. {biblicus-0.15.0 → biblicus-0.15.1}/features/analysis_schema.feature +0 -0
  79. {biblicus-0.15.0 → biblicus-0.15.1}/features/backend_validation.feature +0 -0
  80. {biblicus-0.15.0 → biblicus-0.15.1}/features/biblicus_corpus.feature +0 -0
  81. {biblicus-0.15.0 → biblicus-0.15.1}/features/cli_entrypoint.feature +0 -0
  82. {biblicus-0.15.0 → biblicus-0.15.1}/features/cli_parsing.feature +0 -0
  83. {biblicus-0.15.0 → biblicus-0.15.1}/features/cli_step_spec_parsing.feature +0 -0
  84. {biblicus-0.15.0 → biblicus-0.15.1}/features/content_sniffing.feature +0 -0
  85. {biblicus-0.15.0 → biblicus-0.15.1}/features/context_pack.feature +0 -0
  86. {biblicus-0.15.0 → biblicus-0.15.1}/features/context_pack_cli.feature +0 -0
  87. {biblicus-0.15.0 → biblicus-0.15.1}/features/context_pack_policies.feature +0 -0
  88. {biblicus-0.15.0 → biblicus-0.15.1}/features/corpus_edge_cases.feature +0 -0
  89. {biblicus-0.15.0 → biblicus-0.15.1}/features/corpus_identity.feature +0 -0
  90. {biblicus-0.15.0 → biblicus-0.15.1}/features/corpus_purge.feature +0 -0
  91. {biblicus-0.15.0 → biblicus-0.15.1}/features/crawl.feature +0 -0
  92. {biblicus-0.15.0 → biblicus-0.15.1}/features/docling_granite_extractor.feature +0 -0
  93. {biblicus-0.15.0 → biblicus-0.15.1}/features/docling_smol_extractor.feature +0 -0
  94. {biblicus-0.15.0 → biblicus-0.15.1}/features/embeddings.feature +0 -0
  95. {biblicus-0.15.0 → biblicus-0.15.1}/features/environment.py +0 -0
  96. {biblicus-0.15.0 → biblicus-0.15.1}/features/error_cases.feature +0 -0
  97. {biblicus-0.15.0 → biblicus-0.15.1}/features/evaluation.feature +0 -0
  98. {biblicus-0.15.0 → biblicus-0.15.1}/features/evidence_processing.feature +0 -0
  99. {biblicus-0.15.0 → biblicus-0.15.1}/features/extraction_error_handling.feature +0 -0
  100. {biblicus-0.15.0 → biblicus-0.15.1}/features/extraction_evaluation.feature +0 -0
  101. {biblicus-0.15.0 → biblicus-0.15.1}/features/extraction_evaluation_lab.feature +0 -0
  102. {biblicus-0.15.0 → biblicus-0.15.1}/features/extraction_run_lifecycle.feature +0 -0
  103. {biblicus-0.15.0 → biblicus-0.15.1}/features/extraction_selection.feature +0 -0
  104. {biblicus-0.15.0 → biblicus-0.15.1}/features/extraction_selection_longest.feature +0 -0
  105. {biblicus-0.15.0 → biblicus-0.15.1}/features/extractor_pipeline.feature +0 -0
  106. {biblicus-0.15.0 → biblicus-0.15.1}/features/extractor_validation.feature +0 -0
  107. {biblicus-0.15.0 → biblicus-0.15.1}/features/frontmatter.feature +0 -0
  108. {biblicus-0.15.0 → biblicus-0.15.1}/features/hook_config_validation.feature +0 -0
  109. {biblicus-0.15.0 → biblicus-0.15.1}/features/hook_error_handling.feature +0 -0
  110. {biblicus-0.15.0 → biblicus-0.15.1}/features/import_tree.feature +0 -0
  111. {biblicus-0.15.0 → biblicus-0.15.1}/features/inference_backend.feature +0 -0
  112. {biblicus-0.15.0 → biblicus-0.15.1}/features/ingest_sources.feature +0 -0
  113. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_audio_samples.feature +0 -0
  114. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_image_samples.feature +0 -0
  115. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_mixed_corpus.feature +0 -0
  116. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_mixed_extraction.feature +0 -0
  117. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_ocr_image_extraction.feature +0 -0
  118. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_pdf_retrieval.feature +0 -0
  119. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_pdf_samples.feature +0 -0
  120. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_text_annotate.feature +0 -0
  121. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_text_extract.feature +0 -0
  122. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_text_link.feature +0 -0
  123. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_text_redact.feature +0 -0
  124. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_text_slice.feature +0 -0
  125. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_unstructured_extraction.feature +0 -0
  126. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_use_cases.feature +0 -0
  127. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_use_cases_sequence_markov.feature +0 -0
  128. {biblicus-0.15.0 → biblicus-0.15.1}/features/integration_wikipedia.feature +0 -0
  129. {biblicus-0.15.0 → biblicus-0.15.1}/features/knowledge_base.feature +0 -0
  130. {biblicus-0.15.0 → biblicus-0.15.1}/features/lifecycle_hooks.feature +0 -0
  131. {biblicus-0.15.0 → biblicus-0.15.1}/features/markitdown_extractor.feature +0 -0
  132. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_analysis.feature +0 -0
  133. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_analysis_categorical.feature +0 -0
  134. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_analysis_llm.feature +0 -0
  135. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_analysis_topic_modeling.feature +0 -0
  136. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_analysis_variants.feature +0 -0
  137. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_internal_branches.feature +0 -0
  138. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_schema.feature +0 -0
  139. {biblicus-0.15.0 → biblicus-0.15.1}/features/markov_start_end_labels.feature +0 -0
  140. {biblicus-0.15.0 → biblicus-0.15.1}/features/model_validation.feature +0 -0
  141. {biblicus-0.15.0 → biblicus-0.15.1}/features/ocr_extractor.feature +0 -0
  142. {biblicus-0.15.0 → biblicus-0.15.1}/features/paddleocr_vl_extractor.feature +0 -0
  143. {biblicus-0.15.0 → biblicus-0.15.1}/features/paddleocr_vl_parse_api_response.feature +0 -0
  144. {biblicus-0.15.0 → biblicus-0.15.1}/features/pdf_text_extraction.feature +0 -0
  145. {biblicus-0.15.0 → biblicus-0.15.1}/features/profiling.feature +0 -0
  146. {biblicus-0.15.0 → biblicus-0.15.1}/features/profiling_config_overrides.feature +0 -0
  147. {biblicus-0.15.0 → biblicus-0.15.1}/features/python_api.feature +0 -0
  148. {biblicus-0.15.0 → biblicus-0.15.1}/features/python_hook_logging.feature +0 -0
  149. {biblicus-0.15.0 → biblicus-0.15.1}/features/query_processing.feature +0 -0
  150. {biblicus-0.15.0 → biblicus-0.15.1}/features/recipe_cascading.feature +0 -0
  151. {biblicus-0.15.0 → biblicus-0.15.1}/features/recipe_file_extraction.feature +0 -0
  152. {biblicus-0.15.0 → biblicus-0.15.1}/features/recipe_utilities.feature +0 -0
  153. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_budget.feature +0 -0
  154. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_evaluation_lab.feature +0 -0
  155. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_quality.feature +0 -0
  156. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_scan.feature +0 -0
  157. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_sqlite_full_text_search.feature +0 -0
  158. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_uses_extraction_run.feature +0 -0
  159. {biblicus-0.15.0 → biblicus-0.15.1}/features/retrieval_utilities.feature +0 -0
  160. {biblicus-0.15.0 → biblicus-0.15.1}/features/select_override.feature +0 -0
  161. {biblicus-0.15.0 → biblicus-0.15.1}/features/smart_override_selection.feature +0 -0
  162. {biblicus-0.15.0 → biblicus-0.15.1}/features/source_loading.feature +0 -0
  163. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/ai_llm_steps.py +0 -0
  164. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/ai_models_steps.py +0 -0
  165. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/analysis_steps.py +0 -0
  166. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/backend_steps.py +0 -0
  167. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/cli_parsing_steps.py +0 -0
  168. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/cli_steps.py +0 -0
  169. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/context_pack_steps.py +0 -0
  170. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/crawl_steps.py +0 -0
  171. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/deepgram_steps.py +0 -0
  172. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/docling_steps.py +0 -0
  173. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/embeddings_steps.py +0 -0
  174. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/evidence_processing_steps.py +0 -0
  175. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/extraction_evaluation_lab_steps.py +0 -0
  176. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/extraction_evaluation_steps.py +0 -0
  177. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/extraction_run_lifecycle_steps.py +0 -0
  178. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/extraction_steps.py +0 -0
  179. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/extractor_steps.py +0 -0
  180. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/frontmatter_steps.py +0 -0
  181. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/inference_steps.py +0 -0
  182. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/knowledge_base_steps.py +0 -0
  183. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/markitdown_steps.py +0 -0
  184. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/markov_internal_steps.py +0 -0
  185. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/markov_schema_steps.py +0 -0
  186. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/markov_start_end_steps.py +0 -0
  187. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/markov_steps.py +0 -0
  188. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/model_steps.py +0 -0
  189. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/openai_steps.py +0 -0
  190. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/paddleocr_mock_steps.py +0 -0
  191. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/paddleocr_vl_steps.py +0 -0
  192. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/paddleocr_vl_unit_steps.py +0 -0
  193. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/pdf_steps.py +0 -0
  194. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/profiling_steps.py +0 -0
  195. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/python_api_steps.py +0 -0
  196. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/rapidocr_steps.py +0 -0
  197. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/recipe_steps.py +0 -0
  198. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/requests_mock_steps.py +0 -0
  199. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/retrieval_evaluation_lab_steps.py +0 -0
  200. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/retrieval_quality_steps.py +0 -0
  201. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/retrieval_steps.py +0 -0
  202. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/stt_deepgram_steps.py +0 -0
  203. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/stt_steps.py +0 -0
  204. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_annotate_steps.py +0 -0
  205. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_extract_steps.py +0 -0
  206. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_internal_steps.py +0 -0
  207. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_link_internal_steps.py +0 -0
  208. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_link_steps.py +0 -0
  209. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_mock_steps.py +0 -0
  210. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_redact_steps.py +0 -0
  211. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_slice_steps.py +0 -0
  212. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/text_tool_loop_steps.py +0 -0
  213. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/topic_modeling_steps.py +0 -0
  214. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/unstructured_steps.py +0 -0
  215. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/use_cases_steps.py +0 -0
  216. {biblicus-0.15.0 → biblicus-0.15.1}/features/steps/user_config_steps.py +0 -0
  217. {biblicus-0.15.0 → biblicus-0.15.1}/features/streaming_ingest.feature +0 -0
  218. {biblicus-0.15.0 → biblicus-0.15.1}/features/stt_deepgram_extractor.feature +0 -0
  219. {biblicus-0.15.0 → biblicus-0.15.1}/features/stt_extractor.feature +0 -0
  220. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_annotate.feature +0 -0
  221. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_extract.feature +0 -0
  222. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_extraction_runs.feature +0 -0
  223. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_internal_branches.feature +0 -0
  224. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_link.feature +0 -0
  225. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_link_internal_branches.feature +0 -0
  226. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_mock.feature +0 -0
  227. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_redact.feature +0 -0
  228. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_slice.feature +0 -0
  229. {biblicus-0.15.0 → biblicus-0.15.1}/features/text_utilities.feature +0 -0
  230. {biblicus-0.15.0 → biblicus-0.15.1}/features/token_budget.feature +0 -0
  231. {biblicus-0.15.0 → biblicus-0.15.1}/features/topic_modeling.feature +0 -0
  232. {biblicus-0.15.0 → biblicus-0.15.1}/features/unstructured_extractor.feature +0 -0
  233. {biblicus-0.15.0 → biblicus-0.15.1}/features/use_cases.feature +0 -0
  234. {biblicus-0.15.0 → biblicus-0.15.1}/features/user_config.feature +0 -0
  235. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/download_ag_news.py +0 -0
  236. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/download_audio_samples.py +0 -0
  237. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/download_image_samples.py +0 -0
  238. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/download_mixed_samples.py +0 -0
  239. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/download_pdf_samples.py +0 -0
  240. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/download_wikipedia.py +0 -0
  241. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/extraction_evaluation_demo.py +0 -0
  242. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/extraction_evaluation_lab.py +0 -0
  243. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/markov_analysis_demo.py +0 -0
  244. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/markov_cached_segments_demo.py +0 -0
  245. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/markov_run_report.py +0 -0
  246. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/profiling_demo.py +0 -0
  247. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/readme_end_to_end_demo.py +0 -0
  248. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/retrieval_evaluation_lab.py +0 -0
  249. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/test.py +0 -0
  250. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/topic_modeling_integration.py +0 -0
  251. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/use_cases/notes_to_context_pack_demo.py +0 -0
  252. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/use_cases/sequence_markov_demo.py +0 -0
  253. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/use_cases/text_folder_search_demo.py +0 -0
  254. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/use_cases/text_redact_demo.py +0 -0
  255. {biblicus-0.15.0 → biblicus-0.15.1}/scripts/wikipedia_rag_demo.py +0 -0
  256. {biblicus-0.15.0 → biblicus-0.15.1}/setup.cfg +0 -0
  257. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/__main__.py +0 -0
  258. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/_vendor/dotyaml/__init__.py +0 -0
  259. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/_vendor/dotyaml/interpolation.py +0 -0
  260. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/_vendor/dotyaml/loader.py +0 -0
  261. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/_vendor/dotyaml/transformer.py +0 -0
  262. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/ai/__init__.py +0 -0
  263. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/ai/embeddings.py +0 -0
  264. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/ai/llm.py +0 -0
  265. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/ai/models.py +0 -0
  266. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/__init__.py +0 -0
  267. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/base.py +0 -0
  268. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/markov.py +0 -0
  269. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/models.py +0 -0
  270. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/profiling.py +0 -0
  271. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/schema.py +0 -0
  272. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/analysis/topic_modeling.py +0 -0
  273. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/backends/__init__.py +0 -0
  274. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/backends/base.py +0 -0
  275. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/backends/hybrid.py +0 -0
  276. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/backends/scan.py +0 -0
  277. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/backends/sqlite_full_text_search.py +0 -0
  278. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/backends/vector.py +0 -0
  279. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/cli.py +0 -0
  280. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/constants.py +0 -0
  281. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/context.py +0 -0
  282. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/corpus.py +0 -0
  283. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/crawl.py +0 -0
  284. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/errors.py +0 -0
  285. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/evaluation.py +0 -0
  286. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/evidence_processing.py +0 -0
  287. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extraction.py +0 -0
  288. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extraction_evaluation.py +0 -0
  289. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/__init__.py +0 -0
  290. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/base.py +0 -0
  291. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/deepgram_stt.py +0 -0
  292. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/docling_granite_text.py +0 -0
  293. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/docling_smol_text.py +0 -0
  294. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/markitdown_text.py +0 -0
  295. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/metadata_text.py +0 -0
  296. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/openai_stt.py +0 -0
  297. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/paddleocr_vl_text.py +0 -0
  298. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/pass_through_text.py +0 -0
  299. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/pdf_text.py +0 -0
  300. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/pipeline.py +0 -0
  301. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/rapidocr_text.py +0 -0
  302. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/select_longest_text.py +0 -0
  303. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/select_override.py +0 -0
  304. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/select_smart_override.py +0 -0
  305. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/select_text.py +0 -0
  306. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/extractors/unstructured_text.py +0 -0
  307. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/frontmatter.py +0 -0
  308. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/hook_logging.py +0 -0
  309. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/hook_manager.py +0 -0
  310. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/hooks.py +0 -0
  311. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/ignore.py +0 -0
  312. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/inference.py +0 -0
  313. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/knowledge_base.py +0 -0
  314. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/models.py +0 -0
  315. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/recipes.py +0 -0
  316. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/retrieval.py +0 -0
  317. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/sources.py +0 -0
  318. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/__init__.py +0 -0
  319. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/annotate.py +0 -0
  320. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/extract.py +0 -0
  321. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/link.py +0 -0
  322. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/markup.py +0 -0
  323. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/models.py +0 -0
  324. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/prompts.py +0 -0
  325. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/redact.py +0 -0
  326. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/slice.py +0 -0
  327. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/text/tool_loop.py +0 -0
  328. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/time.py +0 -0
  329. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/uris.py +0 -0
  330. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus/user_config.py +0 -0
  331. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus.egg-info/SOURCES.txt +0 -0
  332. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus.egg-info/dependency_links.txt +0 -0
  333. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus.egg-info/entry_points.txt +0 -0
  334. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus.egg-info/requires.txt +0 -0
  335. {biblicus-0.15.0 → biblicus-0.15.1}/src/biblicus.egg-info/top_level.txt +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: biblicus
3
- Version: 0.15.0
3
+ Version: 0.15.1
4
4
  Summary: Command line interface and Python library for corpus ingestion, retrieval, and evaluation.
5
5
  License: MIT
6
6
  Requires-Python: >=3.9
@@ -57,8 +57,15 @@ Dynamic: license-file
57
57
  ![Coverage][coverage-badge]
58
58
  ![Documentation][documentation-badge]
59
59
 
60
- Make your documents usable by your assistant, then decide later how you will search and retrieve them.
61
-
60
+ <p>
61
+ <img
62
+ src="docs/_static/Biblicus-logo.png"
63
+ alt="Biblicus logo"
64
+ align="right"
65
+ width="216"
66
+ />
67
+ Make your documents usable by your assistant, then decide later how you will search and retrieve them.
68
+ </p>
62
69
  If you are building an assistant in Python, you probably have material you want it to use: notes, documents, web pages, and reference files. A common approach is retrieval augmented generation, where a system retrieves relevant material and uses it as evidence when generating a response.
63
70
 
64
71
  The first practical problem is not retrieval. It is collection and care. You need a stable place to put raw items, you need a small amount of metadata so you can find them again, and you need a way to evolve your retrieval approach over time without rewriting ingestion.
@@ -538,7 +545,7 @@ Three backends are included.
538
545
 
539
546
  - `scan` is a minimal baseline that scans raw items directly.
540
547
  - `sqlite-full-text-search` is a practical baseline that builds a full text search index in SQLite.
541
- - `vector` is a deterministic term-frequency vector baseline with cosine similarity scoring.
548
+ - `tf-vector` is a deterministic term-frequency vector baseline with cosine similarity scoring.
542
549
 
543
550
  For detailed documentation including configuration options, performance characteristics, and usage examples, see the [Backend Reference][backend-reference].
544
551
 
@@ -4,8 +4,15 @@
4
4
  ![Coverage][coverage-badge]
5
5
  ![Documentation][documentation-badge]
6
6
 
7
- Make your documents usable by your assistant, then decide later how you will search and retrieve them.
8
-
7
+ <p>
8
+ <img
9
+ src="docs/_static/Biblicus-logo.png"
10
+ alt="Biblicus logo"
11
+ align="right"
12
+ width="216"
13
+ />
14
+ Make your documents usable by your assistant, then decide later how you will search and retrieve them.
15
+ </p>
9
16
  If you are building an assistant in Python, you probably have material you want it to use: notes, documents, web pages, and reference files. A common approach is retrieval augmented generation, where a system retrieves relevant material and uses it as evidence when generating a response.
10
17
 
11
18
  The first practical problem is not retrieval. It is collection and care. You need a stable place to put raw items, you need a small amount of metadata so you can find them again, and you need a way to evolve your retrieval approach over time without rewriting ingestion.
@@ -485,7 +492,7 @@ Three backends are included.
485
492
 
486
493
  - `scan` is a minimal baseline that scans raw items directly.
487
494
  - `sqlite-full-text-search` is a practical baseline that builds a full text search index in SQLite.
488
- - `vector` is a deterministic term-frequency vector baseline with cosine similarity scoring.
495
+ - `tf-vector` is a deterministic term-frequency vector baseline with cosine similarity scoring.
489
496
 
490
497
  For detailed documentation including configuration options, performance characteristics, and usage examples, see the [Backend Reference][backend-reference].
491
498
 
@@ -52,13 +52,13 @@ Lightweight analysis utilities summarize corpus themes and guide curation:
52
52
  - Topic modeling with BERTopic and optional LLM-assisted labeling.
53
53
  - Side-by-side analysis outputs stored under the corpus for reproducible comparison.
54
54
 
55
- ## Next: sequence analysis (hidden Markov models)
55
+ ### Sequence analysis (Markov analysis)
56
56
 
57
57
  Goal: provide a sequence-oriented analysis backend for corpora where order matters (conversations, timelines, logs).
58
58
 
59
59
  Deliverables:
60
60
 
61
- - Hidden Markov modeling analysis for sequence-driven corpora.
61
+ - Markov analysis for sequence-driven corpora (including hidden Markov models where appropriate).
62
62
  - A report format that explains state transitions and emissions with evidence.
63
63
  - Evaluation guidance for comparing HMM outputs across corpora or snapshots.
64
64
 
@@ -67,6 +67,36 @@ Acceptance checks:
67
67
  - HMM analysis is reproducible for the same corpus state and extraction run.
68
68
  - Reports are exportable and readable without custom tooling.
69
69
 
70
+ ### Text utilities
71
+
72
+ Small, reusable building blocks for transforming text in ways that are hard to do reliably with one-shot generation.
73
+
74
+ Deliverables:
75
+
76
+ - Text extraction and slicing utilities that operate via a virtual file editing tool loop.
77
+ - Optional higher-level utilities built on the same pattern (annotation, linking, redaction).
78
+ - Documentation and runnable demos that show the mechanism and how to use each utility.
79
+
80
+ Acceptance checks:
81
+
82
+ - Utilities have end-to-end behavior specifications and are fully covered by tests.
83
+ - Integration tests can be run against real model APIs when configured.
84
+
85
+ ## Next: Tactus integration
86
+
87
+ Goal: make Biblicus usable from durable agent workflows without baking assistant logic into Biblicus itself.
88
+
89
+ Deliverables:
90
+
91
+ - A Model Context Protocol (MCP) toolset surface for Biblicus (ingest, query, stats, and evidence retrieval).
92
+ - Clear dependency wiring for secrets and network access (in-sandbox vs brokered).
93
+ - One reference procedure demonstrating retrieval-augmented generation built on Biblicus evidence outputs.
94
+
95
+ Acceptance checks:
96
+
97
+ - Tools expose evidence-first outputs with stable schemas.
98
+ - Procedures remain in control of prompting and context budgeting policy.
99
+
70
100
  ## Later: alternate backends and hosting modes
71
101
 
72
102
  Goal: broaden the backend surface while keeping the core predictable.
@@ -1,6 +1,12 @@
1
1
  Biblicus
2
2
  ========
3
3
 
4
+ .. image:: _static/Biblicus-logo.png
5
+ :alt: Biblicus logo
6
+ :align: right
7
+ :width: 216
8
+ :class: docs-logo
9
+
4
10
  You have a folder full of files. Nobody knows what is in it. Someone wants answers anyway.
5
11
 
6
12
  Biblicus is a Python toolkit that turns unstructured data into something you can manage, search,
@@ -146,6 +152,8 @@ cover baseline retrieval, hybrid strategies, and how to evaluate retrieval quali
146
152
  RETRIEVAL
147
153
  RETRIEVAL_QUALITY
148
154
  RETRIEVAL_EVALUATION
155
+ EMBEDDING_RETRIEVAL
156
+ CHUNKING
149
157
 
150
158
  Analysis and Modeling
151
159
  ---------------------
@@ -210,4 +218,5 @@ implementation details or when you want a catalog of features.
210
218
  ARCHITECTURE
211
219
  ARCHITECTURE_DETAIL
212
220
  PR_FAQ_TEXT_ANNOTATE
221
+ PR_FAQ_EMBEDDING_RETRIEVAL
213
222
  api
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
4
4
 
5
5
  [project]
6
6
  name = "biblicus"
7
- version = "0.15.0"
7
+ version = "0.15.1"
8
8
  description = "Command line interface and Python library for corpus ingestion, retrieval, and evaluation."
9
9
  readme = "README.md"
10
10
  requires-python = ">=3.9"
@@ -27,4 +27,4 @@ __all__ = [
27
27
  "RetrievalRun",
28
28
  ]
29
29
 
30
- __version__ = "0.15.0"
30
+ __version__ = "0.15.1"
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: biblicus
3
- Version: 0.15.0
3
+ Version: 0.15.1
4
4
  Summary: Command line interface and Python library for corpus ingestion, retrieval, and evaluation.
5
5
  License: MIT
6
6
  Requires-Python: >=3.9
@@ -57,8 +57,15 @@ Dynamic: license-file
57
57
  ![Coverage][coverage-badge]
58
58
  ![Documentation][documentation-badge]
59
59
 
60
- Make your documents usable by your assistant, then decide later how you will search and retrieve them.
61
-
60
+ <p>
61
+ <img
62
+ src="docs/_static/Biblicus-logo.png"
63
+ alt="Biblicus logo"
64
+ align="right"
65
+ width="216"
66
+ />
67
+ Make your documents usable by your assistant, then decide later how you will search and retrieve them.
68
+ </p>
62
69
  If you are building an assistant in Python, you probably have material you want it to use: notes, documents, web pages, and reference files. A common approach is retrieval augmented generation, where a system retrieves relevant material and uses it as evidence when generating a response.
63
70
 
64
71
  The first practical problem is not retrieval. It is collection and care. You need a stable place to put raw items, you need a small amount of metadata so you can find them again, and you need a way to evolve your retrieval approach over time without rewriting ingestion.
@@ -538,7 +545,7 @@ Three backends are included.
538
545
 
539
546
  - `scan` is a minimal baseline that scans raw items directly.
540
547
  - `sqlite-full-text-search` is a practical baseline that builds a full text search index in SQLite.
541
- - `vector` is a deterministic term-frequency vector baseline with cosine similarity scoring.
548
+ - `tf-vector` is a deterministic term-frequency vector baseline with cosine similarity scoring.
542
549
 
543
550
  For detailed documentation including configuration options, performance characteristics, and usage examples, see the [Backend Reference][backend-reference].
544
551
 
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes