biblicus 1.1.0__py3-none-any.whl → 1.1.1__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- biblicus/__init__.py +1 -1
- {biblicus-1.1.0.dist-info → biblicus-1.1.1.dist-info}/METADATA +24 -24
- {biblicus-1.1.0.dist-info → biblicus-1.1.1.dist-info}/RECORD +7 -7
- {biblicus-1.1.0.dist-info → biblicus-1.1.1.dist-info}/WHEEL +0 -0
- {biblicus-1.1.0.dist-info → biblicus-1.1.1.dist-info}/entry_points.txt +0 -0
- {biblicus-1.1.0.dist-info → biblicus-1.1.1.dist-info}/licenses/LICENSE +0 -0
- {biblicus-1.1.0.dist-info → biblicus-1.1.1.dist-info}/top_level.txt +0 -0
biblicus/__init__.py
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: biblicus
|
|
3
|
-
Version: 1.1.
|
|
3
|
+
Version: 1.1.1
|
|
4
4
|
Summary: Command line interface and Python library for corpus ingestion, retrieval, and evaluation.
|
|
5
5
|
License: MIT
|
|
6
6
|
Requires-Python: >=3.9
|
|
@@ -82,8 +82,8 @@ See [retrieval augmented generation overview] for a short introduction to the id
|
|
|
82
82
|
- `biblicus analyze markov` learns a directed, weighted state transition graph over segmented text.
|
|
83
83
|
- YAML configurations support cascading composition plus dotted `--config key=value` overrides.
|
|
84
84
|
- Text extract splits long texts with an LLM by inserting XML tags in-place for structured spans.
|
|
85
|
-
- See `docs/
|
|
86
|
-
- See `docs/
|
|
85
|
+
- See `docs/markov-analysis.md` for Markov analysis details and runnable demos.
|
|
86
|
+
- See `docs/text-extract.md` for the text extract utility and examples.
|
|
87
87
|
|
|
88
88
|
## Start with a knowledge base
|
|
89
89
|
|
|
@@ -552,9 +552,9 @@ For detailed documentation including configuration options, performance characte
|
|
|
552
552
|
|
|
553
553
|
## Retrieval documentation
|
|
554
554
|
|
|
555
|
-
For the retrieval pipeline overview and snapshot artifacts, see `docs/
|
|
556
|
-
(tuned lexical baseline, reranking, hybrid retrieval), see `docs/
|
|
557
|
-
and dataset formats, see `docs/
|
|
555
|
+
For the retrieval pipeline overview and snapshot artifacts, see `docs/retrieval.md`. For retrieval quality upgrades
|
|
556
|
+
(tuned lexical baseline, reranking, hybrid retrieval), see `docs/retrieval-quality.md`. For evaluation workflows
|
|
557
|
+
and dataset formats, see `docs/retrieval-evaluation.md`. For a runnable walkthrough, use the retrieval evaluation lab
|
|
558
558
|
script (`scripts/retrieval_evaluation_lab.py`).
|
|
559
559
|
|
|
560
560
|
## Extraction backends
|
|
@@ -594,7 +594,7 @@ These extractors are built in. Optional ones require extra dependencies. See [te
|
|
|
594
594
|
For detailed documentation on all extractors, see the [Extractor Reference][extractor-reference].
|
|
595
595
|
|
|
596
596
|
For extraction evaluation workflows, dataset formats, and report interpretation, see
|
|
597
|
-
`docs/
|
|
597
|
+
`docs/extraction-evaluation.md`.
|
|
598
598
|
|
|
599
599
|
## Text extract utility
|
|
600
600
|
|
|
@@ -602,14 +602,14 @@ Text extract is a reusable analysis utility that lets a model insert XML tags in
|
|
|
602
602
|
entire document. It returns structured spans and the marked-up text, and it is used as a segmentation option in Markov
|
|
603
603
|
analysis.
|
|
604
604
|
|
|
605
|
-
See `docs/
|
|
605
|
+
See `docs/text-extract.md` for the utility API and examples, and `docs/markov-analysis.md` for the Markov integration.
|
|
606
606
|
|
|
607
607
|
## Text slice utility
|
|
608
608
|
|
|
609
609
|
Text slice is a reusable analysis utility that lets a model insert `<slice/>` markers into a long text without
|
|
610
610
|
re-emitting the entire document. It returns ordered slices and the marked-up text for auditing and reuse.
|
|
611
611
|
|
|
612
|
-
See `docs/
|
|
612
|
+
See `docs/text-slice.md` for the utility API and examples.
|
|
613
613
|
|
|
614
614
|
## Topic modeling analysis
|
|
615
615
|
|
|
@@ -618,8 +618,8 @@ are the first analysis backends. Profiling summarizes corpus composition and ext
|
|
|
618
618
|
an extraction snapshot, optionally applies an LLM-driven extraction pass, applies lexical processing, runs BERTopic, and
|
|
619
619
|
optionally applies an LLM fine-tuning pass to label topics. The output is structured JavaScript Object Notation.
|
|
620
620
|
|
|
621
|
-
See `docs/
|
|
622
|
-
`docs/
|
|
621
|
+
See `docs/analysis.md` for the analysis pipeline overview, `docs/profiling.md` for profiling, and
|
|
622
|
+
`docs/topic-modeling.md` for topic modeling details.
|
|
623
623
|
|
|
624
624
|
Run a topic analysis using a configuration file:
|
|
625
625
|
|
|
@@ -668,7 +668,7 @@ For a repeatable, real-world integration run that downloads AG News and executes
|
|
|
668
668
|
python scripts/topic_modeling_integration.py --corpus corpora/ag_news_demo --force
|
|
669
669
|
```
|
|
670
670
|
|
|
671
|
-
See `docs/
|
|
671
|
+
See `docs/topic-modeling.md` for parameter examples and per-topic output behavior.
|
|
672
672
|
|
|
673
673
|
## Integration corpus and evaluation dataset
|
|
674
674
|
|
|
@@ -726,20 +726,20 @@ Open `http://localhost:8000` in your browser.
|
|
|
726
726
|
License terms are in `LICENSE`.
|
|
727
727
|
|
|
728
728
|
[retrieval augmented generation overview]: https://en.wikipedia.org/wiki/Retrieval-augmented_generation
|
|
729
|
-
[architecture]: docs/
|
|
730
|
-
[roadmap]: docs/
|
|
731
|
-
[feature-index]: docs/
|
|
732
|
-
[corpus]: docs/
|
|
733
|
-
[knowledge-base]: docs/
|
|
734
|
-
[text-extraction]: docs/
|
|
729
|
+
[architecture]: docs/architecture.md
|
|
730
|
+
[roadmap]: docs/roadmap.md
|
|
731
|
+
[feature-index]: docs/feature-index.md
|
|
732
|
+
[corpus]: docs/corpus.md
|
|
733
|
+
[knowledge-base]: docs/knowledge-base.md
|
|
734
|
+
[text-extraction]: docs/extraction.md
|
|
735
735
|
[extractor-reference]: docs/extractors/index.md
|
|
736
736
|
[backend-reference]: docs/backends/index.md
|
|
737
|
-
[speech-to-text]: docs/
|
|
738
|
-
[user-configuration]: docs/
|
|
739
|
-
[backends]: docs/
|
|
740
|
-
[context-packs]: docs/
|
|
741
|
-
[demos]: docs/
|
|
742
|
-
[testing]: docs/
|
|
737
|
+
[speech-to-text]: docs/stt.md
|
|
738
|
+
[user-configuration]: docs/user-configuration.md
|
|
739
|
+
[backends]: docs/backends.md
|
|
740
|
+
[context-packs]: docs/context-pack.md
|
|
741
|
+
[demos]: docs/demos.md
|
|
742
|
+
[testing]: docs/testing.md
|
|
743
743
|
|
|
744
744
|
[continuous-integration-badge]: https://github.com/AnthusAI/Biblicus/actions/workflows/ci.yml/badge.svg?branch=main
|
|
745
745
|
[coverage-badge]: https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/AnthusAI/Biblicus/main/coverage_badge.json
|
|
@@ -1,4 +1,4 @@
|
|
|
1
|
-
biblicus/__init__.py,sha256=
|
|
1
|
+
biblicus/__init__.py,sha256=KGQ2mjQRe9i8OyE25LZHJAG5jg_fDKOiWMTOprp-NPc,1013
|
|
2
2
|
biblicus/__main__.py,sha256=ipfkUoTlocVnrQDM69C7TeBqQxmHVeiWMRaT3G9rtnk,117
|
|
3
3
|
biblicus/chunking.py,sha256=GdJr0skAAI0Su99mr7dXqCgR7eJ0sJu8n2XesVGyddY,13206
|
|
4
4
|
biblicus/cli.py,sha256=GN7L0-s0k9tAj_lthvBrJlfo_DG9y53vYc6k_IhSea0,45797
|
|
@@ -83,9 +83,9 @@ biblicus/text/prompts.py,sha256=9dx1cWpJb6oBY4AhDHxlkRUYs7DfbySH0gb-uBTNvtk,7567
|
|
|
83
83
|
biblicus/text/redact.py,sha256=tkDRmA0VvOZwMryEmBPLEHf3Z6VHJkkaWjBaNIMyGZ0,8415
|
|
84
84
|
biblicus/text/slice.py,sha256=dlHxGO8c5P8BszXGwlNQoQ-cyWjJf6PfS1LUBJXXGEE,5762
|
|
85
85
|
biblicus/text/tool_loop.py,sha256=dFeIEcCUA-yR8GMqsJ_n4007fHVmn9zK2hhlm6NlWyg,14161
|
|
86
|
-
biblicus-1.1.
|
|
87
|
-
biblicus-1.1.
|
|
88
|
-
biblicus-1.1.
|
|
89
|
-
biblicus-1.1.
|
|
90
|
-
biblicus-1.1.
|
|
91
|
-
biblicus-1.1.
|
|
86
|
+
biblicus-1.1.1.dist-info/licenses/LICENSE,sha256=lw44GXFG_Q0fS8m5VoEvv_xtdBXK26pBcbSPUCXee_Q,1078
|
|
87
|
+
biblicus-1.1.1.dist-info/METADATA,sha256=WtfRn0ON4GjFGAHca6TFcjt-tpdMUGz5uOR4-ePJ_ys,31202
|
|
88
|
+
biblicus-1.1.1.dist-info/WHEEL,sha256=wUyA8OaulRlbfwMtmQsvNngGrxQHAvkKcvRmdizlJi0,92
|
|
89
|
+
biblicus-1.1.1.dist-info/entry_points.txt,sha256=BZmO4H8Uz00fyi1RAFryOCGfZgX7eHWkY2NE-G54U5A,47
|
|
90
|
+
biblicus-1.1.1.dist-info/top_level.txt,sha256=sUD_XVZwDxZ29-FBv1MknTGh4mgDXznGuP28KJY_WKc,9
|
|
91
|
+
biblicus-1.1.1.dist-info/RECORD,,
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|