deepresearch-flow 0.1.1__py3-none-any.whl → 0.2.0__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: deepresearch-flow
3
- Version: 0.1.1
3
+ Version: 0.2.0
4
4
  Summary: Workflow tools for paper extraction, review, and research automation.
5
5
  Author-email: DengQi <dengqi935@gmail.com>
6
6
  License: MIT License
@@ -49,6 +49,7 @@ Requires-Dist: jinja2>=3.1.3
49
49
  Requires-Dist: json-repair>=0.31.0
50
50
  Requires-Dist: jsonschema>=4.21.1
51
51
  Requires-Dist: markdown-it-py>=3.0.0
52
+ Requires-Dist: pypdf>=3.0.0
52
53
  Requires-Dist: pybtex>=0.24.0
53
54
  Requires-Dist: rich>=13.7.1
54
55
  Requires-Dist: starlette>=0.37.2
@@ -152,6 +153,7 @@ Incremental behavior:
152
153
  - Use `--retry-failed` to retry only failed documents listed in `paper_errors.json`.
153
154
  - Use `--verbose` for detailed logs alongside progress bars.
154
155
  - Extract-time rendering defaults to the same built-in template as `--prompt-template`.
156
+ - Output JSON is written as `{"template_tag": "...", "papers": [...]}`.
155
157
  - A summary table prints input/prompt/output character totals, token estimates, and throughput after each run.
156
158
  - Progress bars include a live prompt/completion/total token ticker.
157
159
 
@@ -207,18 +209,23 @@ Render outputs, compute stats, and serve a local web UI over paper JSON.
207
209
 
208
210
  JSON input formats:
209
211
 
210
- - For `db render-md`, `db statistics`, `db filter`, and `db generate-tags`, the input is the aggregated JSON list.
212
+ - For `db render-md`, `db statistics`, `db filter`, and `db generate-tags`, the input can be either an aggregated JSON list or `{"template_tag": "...", "papers": [...]}` (the commands operate on `papers`).
211
213
  - For `db serve`, each input JSON must be an object: `{"template_tag": "simple", "papers": [...]}`.
212
- When `template_tag` is missing, the server attempts to infer it as a fallback.
214
+ When `template_tag` is missing, the server attempts to infer it as a fallback (legacy list-only inputs are rejected).
213
215
 
214
216
  Web UI highlights:
215
217
 
216
218
  - Summary/Source/PDF/PDF Viewer views with tab navigation.
217
219
  - Split view: choose left/right panes independently (summary/source/pdf/pdf viewer) via URL params.
218
- - Summary view includes a collapsible outline panel (top-left) and a back-to-top control (bottom-left).
220
+ - Summary/Source views include a collapsible outline panel (top-left) and a back-to-top control (bottom-left).
219
221
  - Summary template dropdown shows only available templates per paper.
220
- - Source view renders Markdown and supports embedded HTML tables plus `data:image/...;base64` `<img>` tags.
222
+ - Homepage filters: PDF/Source/Summary availability and template tags, plus a filter syntax input (`tmpl:...`, `has:pdf`, `no:source`).
223
+ - Homepage stats: total and filtered counts for PDF/Source/Summary plus per-template totals.
224
+ - Stats page includes keyword frequency charts.
225
+ - Source view renders Markdown and supports embedded HTML tables plus `data:image/...;base64` `<img>` tags (images are constrained to the content width).
221
226
  - PDF Viewer is served locally (PDF.js viewer assets) to avoid cross-origin issues with local PDFs.
227
+ - PDF-only entries are surfaced for unmatched PDFs under `--pdf-root` (metadata title if available, otherwise filename), with badges and detail warnings.
228
+ - PDF-only entries are excluded from stats counts.
222
229
  - Merge behavior for multi-input serve: title similarity (>= 0.95), preferring `bibtex.fields.title` and falling back to `paper_title`.
223
230
  - Cache merged inputs with `--cache-dir`; bypass with `--no-cache`.
224
231
 
@@ -250,6 +257,7 @@ deepresearch-flow paper db filter \
250
257
  deepresearch-flow paper db statistics \
251
258
  --input paper_infos.json \
252
259
  --top-n 20
260
+ # Statistics also include keyword frequency (normalized to lowercase)
253
261
 
254
262
  # Serve a local read-only web UI (loads charts/libs via CDN)
255
263
  deepresearch-flow paper db serve \
@@ -278,7 +286,7 @@ Web search syntax (Scholar-style):
278
286
  - Quoted phrases: `title:"nearest neighbor"`
279
287
  - OR: `fpga OR asic`
280
288
  - Negation: `-survey` or `-tag:survey`
281
- - Fields: `title:`, `author:`, `tag:`, `venue:`, `year:`, `month:`
289
+ - Fields: `title:`, `author:`, `tag:`, `venue:`, `year:`, `month:` (content tags only)
282
290
  - Year range: `year:2020..2024`
283
291
 
284
292
  Other database helpers:
@@ -1,10 +1,11 @@
1
1
  deepresearch_flow/__init__.py,sha256=rjP9ES4zJCfEN_MCDYAYPL1mNJZGjojdmbRwnZ9FlEk,83
2
+ deepresearch_flow/__main__.py,sha256=Ceo0rMTOhHhwFPD-HyDDagenNsmWEzPmsdYLI7kwKVA,115
2
3
  deepresearch_flow/cli.py,sha256=WhPhs-Cg4kHow0h0KTVaGTjQVXCPrlNvMyvgxCD8qgI,371
3
4
  deepresearch_flow/paper/__init__.py,sha256=sunaOkcgAJBrfmcaJTumcWbPGVUSGWvOv2a2Yidzy0A,43
4
5
  deepresearch_flow/paper/cli.py,sha256=4UY3KHi6BUGztL1vB4w0cCMiIAo9KNxrfQn1GBHt6fA,11153
5
6
  deepresearch_flow/paper/config.py,sha256=5uGTWfAfzpv4w_JxC0w6GF2teaxF5b3rD8LaDqPVshU,8611
6
- deepresearch_flow/paper/db.py,sha256=6d8bid9PiXJEHk79nqcAXoi6ZwpLlenKsbrkBIcTG1A,31648
7
- deepresearch_flow/paper/extract.py,sha256=88H1QOpzqpzlYukQ45BkuurNzxoDDWUBM0RF7Zt8IDI,31955
7
+ deepresearch_flow/paper/db.py,sha256=uX-gblqh-ltoMO6mv0KPAm-sgNaRz46jaN0kxtzvP8s,33242
8
+ deepresearch_flow/paper/extract.py,sha256=ID1dd2r6LTB0kRF4qBSH6bGtBGv0znw--g_mXYBcoeU,32314
8
9
  deepresearch_flow/paper/llm.py,sha256=R4rmFoYnGq_JiQODr4Jzk5j8U-j2NSYUXex6eR-WHXg,3929
9
10
  deepresearch_flow/paper/prompts.py,sha256=mV7cEXw8pwukBUE4Trah0SjEPSSDgg5-RGaNaUdo4EU,519
10
11
  deepresearch_flow/paper/render.py,sha256=KeccrRGf1_sxoaiT6SUDkFRj9sStReoEwNvlw1ir7qw,2181
@@ -39,7 +40,7 @@ deepresearch_flow/paper/templates/default_paper.md.j2,sha256=3azu48534QtLtHrCwI1
39
40
  deepresearch_flow/paper/templates/eight_questions.md.j2,sha256=Ecz4CD3nd7jZ4Dg8himZkTwF4WDkk0ILWk8V728uOPI,3038
40
41
  deepresearch_flow/paper/templates/three_pass.md.j2,sha256=ZRj-NkpZePnqp0gSE8OT1dN5Lr5RW4vdOYdeVejYJW0,1576
41
42
  deepresearch_flow/paper/web/__init__.py,sha256=eQBtBjvOYsNEdivHTI0aO286SCG2c86xI02tf-0jz5I,39
42
- deepresearch_flow/paper/web/app.py,sha256=VO1fa-fJrI8mtT0FwPQceeRHU6Ky4meABh-nj4HA4HE,78761
43
+ deepresearch_flow/paper/web/app.py,sha256=OB0iHU5pa7zJmP4IQAHPg4S-ucfcWBRBfHfaSDNJDTE,118325
43
44
  deepresearch_flow/paper/web/query.py,sha256=vTegfm5zGVkYCd6_K3yNrXJEmKMccUUFKG9DePPcKMw,1938
44
45
  deepresearch_flow/paper/web/pdfjs/LICENSE,sha256=DVQuDIgE45qn836wDaWnYhSdxoLXgpRRKH4RuTjpRZQ,10174
45
46
  deepresearch_flow/paper/web/pdfjs/web/compressed.tracemonkey-pldi-09.pdf,sha256=NmL_UZ5IWBBSBVK_MB2MOyuRf9L4MwP0ll16vtNn4RM,1016315
@@ -409,9 +410,9 @@ deepresearch_flow/recognize/__init__.py,sha256=yMAqbdCzpdRSiwFhq9j7yx9ZWxqz_Zq3v
409
410
  deepresearch_flow/recognize/cli.py,sha256=zhJi6f0Ha6UvX-Q4mdPdM9uz0SoBuCEnRwzDslMN2Eg,16276
410
411
  deepresearch_flow/recognize/markdown.py,sha256=y-PMJbGqrfWCNBVGanXK1M4OuMP9e1eqh7HDYye5a7Q,8757
411
412
  deepresearch_flow/recognize/organize.py,sha256=GSLmo037rpARSecaPxNCuIlLBbbilx8msWFJDqYJ4hc,3561
412
- deepresearch_flow-0.1.1.dist-info/licenses/LICENSE,sha256=hT8F2Py1pe6flxq3Ufdm2UKFk0B8CBm0aAQfsLXfvjw,1063
413
- deepresearch_flow-0.1.1.dist-info/METADATA,sha256=zSEBh8cYcPC9yEUjjRGb8LsBzHKSTe-PMJWdTK_SsFU,14395
414
- deepresearch_flow-0.1.1.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
415
- deepresearch_flow-0.1.1.dist-info/entry_points.txt,sha256=1uIKscs0YRMg_mFsg9NjsaTt4CvQqQ_-zGERUKhhL_Y,65
416
- deepresearch_flow-0.1.1.dist-info/top_level.txt,sha256=qBl4RvPJNJUbL8CFfMNWxY0HpQLx5RlF_ko-z_aKpm0,18
417
- deepresearch_flow-0.1.1.dist-info/RECORD,,
413
+ deepresearch_flow-0.2.0.dist-info/licenses/LICENSE,sha256=hT8F2Py1pe6flxq3Ufdm2UKFk0B8CBm0aAQfsLXfvjw,1063
414
+ deepresearch_flow-0.2.0.dist-info/METADATA,sha256=jtphayCMBS9Z1c-TkEkaMNaGa7HB8qRC_Wf0MbfzUr0,15250
415
+ deepresearch_flow-0.2.0.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
416
+ deepresearch_flow-0.2.0.dist-info/entry_points.txt,sha256=1uIKscs0YRMg_mFsg9NjsaTt4CvQqQ_-zGERUKhhL_Y,65
417
+ deepresearch_flow-0.2.0.dist-info/top_level.txt,sha256=qBl4RvPJNJUbL8CFfMNWxY0HpQLx5RlF_ko-z_aKpm0,18
418
+ deepresearch_flow-0.2.0.dist-info/RECORD,,