scientific-writer 2.2.1__py3-none-any.whl → 2.2.3__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Potentially problematic release.
This version of scientific-writer might be problematic. Click here for more details.
- scientific_writer/.claude/WRITER.md +748 -0
- scientific_writer/.claude/settings.local.json +30 -0
- scientific_writer/.claude/skills/citation-management/SKILL.md +1046 -0
- scientific_writer/.claude/skills/citation-management/assets/bibtex_template.bib +264 -0
- scientific_writer/.claude/skills/citation-management/assets/citation_checklist.md +386 -0
- scientific_writer/.claude/skills/citation-management/references/bibtex_formatting.md +908 -0
- scientific_writer/.claude/skills/citation-management/references/citation_validation.md +794 -0
- scientific_writer/.claude/skills/citation-management/references/google_scholar_search.md +725 -0
- scientific_writer/.claude/skills/citation-management/references/metadata_extraction.md +870 -0
- scientific_writer/.claude/skills/citation-management/references/pubmed_search.md +839 -0
- scientific_writer/.claude/skills/citation-management/scripts/doi_to_bibtex.py +204 -0
- scientific_writer/.claude/skills/citation-management/scripts/extract_metadata.py +569 -0
- scientific_writer/.claude/skills/citation-management/scripts/format_bibtex.py +349 -0
- scientific_writer/.claude/skills/citation-management/scripts/search_google_scholar.py +282 -0
- scientific_writer/.claude/skills/citation-management/scripts/search_pubmed.py +398 -0
- scientific_writer/.claude/skills/citation-management/scripts/validate_citations.py +497 -0
- scientific_writer/.claude/skills/clinical-reports/IMPLEMENTATION_SUMMARY.md +641 -0
- scientific_writer/.claude/skills/clinical-reports/README.md +236 -0
- scientific_writer/.claude/skills/clinical-reports/SKILL.md +1088 -0
- scientific_writer/.claude/skills/clinical-reports/assets/case_report_template.md +352 -0
- scientific_writer/.claude/skills/clinical-reports/assets/clinical_trial_csr_template.md +353 -0
- scientific_writer/.claude/skills/clinical-reports/assets/clinical_trial_sae_template.md +359 -0
- scientific_writer/.claude/skills/clinical-reports/assets/consult_note_template.md +305 -0
- scientific_writer/.claude/skills/clinical-reports/assets/discharge_summary_template.md +453 -0
- scientific_writer/.claude/skills/clinical-reports/assets/hipaa_compliance_checklist.md +395 -0
- scientific_writer/.claude/skills/clinical-reports/assets/history_physical_template.md +305 -0
- scientific_writer/.claude/skills/clinical-reports/assets/lab_report_template.md +309 -0
- scientific_writer/.claude/skills/clinical-reports/assets/pathology_report_template.md +249 -0
- scientific_writer/.claude/skills/clinical-reports/assets/quality_checklist.md +338 -0
- scientific_writer/.claude/skills/clinical-reports/assets/radiology_report_template.md +318 -0
- scientific_writer/.claude/skills/clinical-reports/assets/soap_note_template.md +253 -0
- scientific_writer/.claude/skills/clinical-reports/references/case_report_guidelines.md +570 -0
- scientific_writer/.claude/skills/clinical-reports/references/clinical_trial_reporting.md +693 -0
- scientific_writer/.claude/skills/clinical-reports/references/data_presentation.md +530 -0
- scientific_writer/.claude/skills/clinical-reports/references/diagnostic_reports_standards.md +629 -0
- scientific_writer/.claude/skills/clinical-reports/references/medical_terminology.md +588 -0
- scientific_writer/.claude/skills/clinical-reports/references/patient_documentation.md +744 -0
- scientific_writer/.claude/skills/clinical-reports/references/peer_review_standards.md +585 -0
- scientific_writer/.claude/skills/clinical-reports/references/regulatory_compliance.md +577 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/check_deidentification.py +346 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/compliance_checker.py +78 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/extract_clinical_data.py +102 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/format_adverse_events.py +103 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/generate_report_template.py +163 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/terminology_validator.py +133 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/validate_case_report.py +334 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/validate_trial_report.py +89 -0
- scientific_writer/.claude/skills/document-skills/docx/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/docx/SKILL.md +197 -0
- scientific_writer/.claude/skills/document-skills/docx/docx-js.md +350 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chart.xsd +1499 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chartDrawing.xsd +146 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-diagram.xsd +1085 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-lockedCanvas.xsd +11 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-main.xsd +3081 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-picture.xsd +23 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-spreadsheetDrawing.xsd +185 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-wordprocessingDrawing.xsd +287 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/pml.xsd +1676 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-additionalCharacteristics.xsd +28 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-bibliography.xsd +144 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-commonSimpleTypes.xsd +174 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlDataProperties.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlSchemaProperties.xsd +18 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesCustom.xsd +59 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesExtended.xsd +56 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesVariantTypes.xsd +195 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-math.xsd +582 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-relationshipReference.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/sml.xsd +4439 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-main.xsd +570 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-officeDrawing.xsd +509 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-presentationDrawing.xsd +12 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-spreadsheetDrawing.xsd +108 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-wordprocessingDrawing.xsd +96 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/wml.xsd +3646 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/xml.xsd +116 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-contentTypes.xsd +42 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-coreProperties.xsd +50 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-digSig.xsd +49 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-relationships.xsd +33 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/mce/mc.xsd +75 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-2010.xsd +560 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-2012.xsd +67 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-2018.xsd +14 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-cex-2018.xsd +20 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-cid-2016.xsd +13 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-sdtdatahash-2020.xsd +4 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-symex-2015.xsd +8 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/pack.py +159 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/unpack.py +29 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validate.py +69 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/__init__.py +15 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/base.py +951 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/docx.py +274 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/pptx.py +315 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/redlining.py +279 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml.md +610 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/__init__.py +1 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/document.py +1276 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/comments.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/commentsExtended.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/commentsExtensible.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/commentsIds.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/people.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/utilities.py +374 -0
- scientific_writer/.claude/skills/document-skills/pdf/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/pdf/SKILL.md +294 -0
- scientific_writer/.claude/skills/document-skills/pdf/forms.md +205 -0
- scientific_writer/.claude/skills/document-skills/pdf/reference.md +612 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/check_bounding_boxes.py +70 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/check_bounding_boxes_test.py +226 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/check_fillable_fields.py +12 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/convert_pdf_to_images.py +35 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/create_validation_image.py +41 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/extract_form_field_info.py +152 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/fill_fillable_fields.py +114 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/fill_pdf_form_with_annotations.py +108 -0
- scientific_writer/.claude/skills/document-skills/pptx/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/pptx/SKILL.md +484 -0
- scientific_writer/.claude/skills/document-skills/pptx/html2pptx.md +625 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chart.xsd +1499 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chartDrawing.xsd +146 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-diagram.xsd +1085 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-lockedCanvas.xsd +11 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-main.xsd +3081 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-picture.xsd +23 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-spreadsheetDrawing.xsd +185 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-wordprocessingDrawing.xsd +287 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/pml.xsd +1676 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-additionalCharacteristics.xsd +28 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-bibliography.xsd +144 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-commonSimpleTypes.xsd +174 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlDataProperties.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlSchemaProperties.xsd +18 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesCustom.xsd +59 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesExtended.xsd +56 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesVariantTypes.xsd +195 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-math.xsd +582 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-relationshipReference.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/sml.xsd +4439 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-main.xsd +570 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-officeDrawing.xsd +509 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-presentationDrawing.xsd +12 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-spreadsheetDrawing.xsd +108 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-wordprocessingDrawing.xsd +96 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/wml.xsd +3646 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/xml.xsd +116 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-contentTypes.xsd +42 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-coreProperties.xsd +50 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-digSig.xsd +49 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-relationships.xsd +33 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/mce/mc.xsd +75 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-2010.xsd +560 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-2012.xsd +67 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-2018.xsd +14 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-cex-2018.xsd +20 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-cid-2016.xsd +13 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-sdtdatahash-2020.xsd +4 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-symex-2015.xsd +8 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/pack.py +159 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/unpack.py +29 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validate.py +69 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/__init__.py +15 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/base.py +951 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/docx.py +274 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/pptx.py +315 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/redlining.py +279 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml.md +427 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/html2pptx.js +979 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/inventory.py +1020 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/rearrange.py +231 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/replace.py +385 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/thumbnail.py +450 -0
- scientific_writer/.claude/skills/document-skills/xlsx/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/xlsx/SKILL.md +289 -0
- scientific_writer/.claude/skills/document-skills/xlsx/recalc.py +178 -0
- scientific_writer/.claude/skills/hypothesis-generation/SKILL.md +155 -0
- scientific_writer/.claude/skills/hypothesis-generation/assets/hypothesis_output_template.md +302 -0
- scientific_writer/.claude/skills/hypothesis-generation/references/experimental_design_patterns.md +327 -0
- scientific_writer/.claude/skills/hypothesis-generation/references/hypothesis_quality_criteria.md +196 -0
- scientific_writer/.claude/skills/hypothesis-generation/references/literature_search_strategies.md +505 -0
- scientific_writer/.claude/skills/latex-posters/README.md +417 -0
- scientific_writer/.claude/skills/latex-posters/SKILL.md +919 -0
- scientific_writer/.claude/skills/latex-posters/assets/baposter_template.tex +257 -0
- scientific_writer/.claude/skills/latex-posters/assets/beamerposter_template.tex +244 -0
- scientific_writer/.claude/skills/latex-posters/assets/poster_quality_checklist.md +358 -0
- scientific_writer/.claude/skills/latex-posters/assets/tikzposter_template.tex +251 -0
- scientific_writer/.claude/skills/latex-posters/references/latex_poster_packages.md +745 -0
- scientific_writer/.claude/skills/latex-posters/references/poster_content_guide.md +748 -0
- scientific_writer/.claude/skills/latex-posters/references/poster_design_principles.md +806 -0
- scientific_writer/.claude/skills/latex-posters/references/poster_layout_design.md +900 -0
- scientific_writer/.claude/skills/latex-posters/scripts/review_poster.sh +214 -0
- scientific_writer/.claude/skills/literature-review/SKILL.md +546 -0
- scientific_writer/.claude/skills/literature-review/assets/review_template.md +412 -0
- scientific_writer/.claude/skills/literature-review/references/citation_styles.md +166 -0
- scientific_writer/.claude/skills/literature-review/references/database_strategies.md +381 -0
- scientific_writer/.claude/skills/literature-review/scripts/generate_pdf.py +176 -0
- scientific_writer/.claude/skills/literature-review/scripts/search_databases.py +303 -0
- scientific_writer/.claude/skills/literature-review/scripts/verify_citations.py +222 -0
- scientific_writer/.claude/skills/markitdown/INSTALLATION_GUIDE.md +318 -0
- scientific_writer/.claude/skills/markitdown/LICENSE.txt +22 -0
- scientific_writer/.claude/skills/markitdown/OPENROUTER_INTEGRATION.md +359 -0
- scientific_writer/.claude/skills/markitdown/QUICK_REFERENCE.md +309 -0
- scientific_writer/.claude/skills/markitdown/README.md +184 -0
- scientific_writer/.claude/skills/markitdown/SKILL.md +450 -0
- scientific_writer/.claude/skills/markitdown/SKILL_SUMMARY.md +307 -0
- scientific_writer/.claude/skills/markitdown/assets/example_usage.md +463 -0
- scientific_writer/.claude/skills/markitdown/references/api_reference.md +399 -0
- scientific_writer/.claude/skills/markitdown/references/file_formats.md +542 -0
- scientific_writer/.claude/skills/markitdown/scripts/batch_convert.py +228 -0
- scientific_writer/.claude/skills/markitdown/scripts/convert_literature.py +283 -0
- scientific_writer/.claude/skills/markitdown/scripts/convert_with_ai.py +243 -0
- scientific_writer/.claude/skills/paper-2-web/SKILL.md +455 -0
- scientific_writer/.claude/skills/paper-2-web/references/installation.md +141 -0
- scientific_writer/.claude/skills/paper-2-web/references/paper2poster.md +346 -0
- scientific_writer/.claude/skills/paper-2-web/references/paper2video.md +305 -0
- scientific_writer/.claude/skills/paper-2-web/references/paper2web.md +187 -0
- scientific_writer/.claude/skills/paper-2-web/references/usage_examples.md +436 -0
- scientific_writer/.claude/skills/peer-review/SKILL.md +375 -0
- scientific_writer/.claude/skills/peer-review/references/common_issues.md +552 -0
- scientific_writer/.claude/skills/peer-review/references/reporting_standards.md +290 -0
- scientific_writer/.claude/skills/research-grants/README.md +285 -0
- scientific_writer/.claude/skills/research-grants/SKILL.md +896 -0
- scientific_writer/.claude/skills/research-grants/assets/budget_justification_template.md +453 -0
- scientific_writer/.claude/skills/research-grants/assets/nih_specific_aims_template.md +166 -0
- scientific_writer/.claude/skills/research-grants/assets/nsf_project_summary_template.md +92 -0
- scientific_writer/.claude/skills/research-grants/references/broader_impacts.md +392 -0
- scientific_writer/.claude/skills/research-grants/references/darpa_guidelines.md +636 -0
- scientific_writer/.claude/skills/research-grants/references/doe_guidelines.md +586 -0
- scientific_writer/.claude/skills/research-grants/references/nih_guidelines.md +851 -0
- scientific_writer/.claude/skills/research-grants/references/nsf_guidelines.md +570 -0
- scientific_writer/.claude/skills/research-grants/references/specific_aims_guide.md +458 -0
- scientific_writer/.claude/skills/research-lookup/README.md +116 -0
- scientific_writer/.claude/skills/research-lookup/SKILL.md +443 -0
- scientific_writer/.claude/skills/research-lookup/examples.py +174 -0
- scientific_writer/.claude/skills/research-lookup/lookup.py +93 -0
- scientific_writer/.claude/skills/research-lookup/research_lookup.py +335 -0
- scientific_writer/.claude/skills/research-lookup/scripts/research_lookup.py +261 -0
- scientific_writer/.claude/skills/scholar-evaluation/SKILL.md +254 -0
- scientific_writer/.claude/skills/scholar-evaluation/references/evaluation_framework.md +663 -0
- scientific_writer/.claude/skills/scholar-evaluation/scripts/calculate_scores.py +378 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/SKILL.md +530 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/common_biases.md +364 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/evidence_hierarchy.md +484 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/experimental_design.md +496 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/logical_fallacies.md +478 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/scientific_method.md +169 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/statistical_pitfalls.md +506 -0
- scientific_writer/.claude/skills/scientific-schematics/SKILL.md +2035 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/block_diagram_template.tex +199 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/circuit_template.tex +159 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/flowchart_template.tex +161 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/pathway_template.tex +162 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/tikz_styles.tex +422 -0
- scientific_writer/.claude/skills/scientific-schematics/references/best_practices.md +562 -0
- scientific_writer/.claude/skills/scientific-schematics/references/diagram_types.md +637 -0
- scientific_writer/.claude/skills/scientific-schematics/references/python_libraries.md +791 -0
- scientific_writer/.claude/skills/scientific-schematics/references/tikz_guide.md +734 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/circuit_generator.py +307 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/compile_tikz.py +292 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/generate_flowchart.py +281 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/pathway_diagram.py +406 -0
- scientific_writer/.claude/skills/scientific-writing/SKILL.md +443 -0
- scientific_writer/.claude/skills/scientific-writing/references/citation_styles.md +720 -0
- scientific_writer/.claude/skills/scientific-writing/references/figures_tables.md +806 -0
- scientific_writer/.claude/skills/scientific-writing/references/imrad_structure.md +658 -0
- scientific_writer/.claude/skills/scientific-writing/references/reporting_guidelines.md +748 -0
- scientific_writer/.claude/skills/scientific-writing/references/writing_principles.md +824 -0
- scientific_writer/.claude/skills/treatment-plans/README.md +483 -0
- scientific_writer/.claude/skills/treatment-plans/SKILL.md +817 -0
- scientific_writer/.claude/skills/treatment-plans/assets/chronic_disease_management_plan.tex +636 -0
- scientific_writer/.claude/skills/treatment-plans/assets/general_medical_treatment_plan.tex +616 -0
- scientific_writer/.claude/skills/treatment-plans/assets/mental_health_treatment_plan.tex +745 -0
- scientific_writer/.claude/skills/treatment-plans/assets/pain_management_plan.tex +770 -0
- scientific_writer/.claude/skills/treatment-plans/assets/perioperative_care_plan.tex +724 -0
- scientific_writer/.claude/skills/treatment-plans/assets/quality_checklist.md +471 -0
- scientific_writer/.claude/skills/treatment-plans/assets/rehabilitation_treatment_plan.tex +727 -0
- scientific_writer/.claude/skills/treatment-plans/references/goal_setting_frameworks.md +411 -0
- scientific_writer/.claude/skills/treatment-plans/references/intervention_guidelines.md +507 -0
- scientific_writer/.claude/skills/treatment-plans/references/regulatory_compliance.md +476 -0
- scientific_writer/.claude/skills/treatment-plans/references/specialty_specific_guidelines.md +607 -0
- scientific_writer/.claude/skills/treatment-plans/references/treatment_plan_standards.md +456 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/check_completeness.py +318 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/generate_template.py +244 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/timeline_generator.py +369 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/validate_treatment_plan.py +367 -0
- scientific_writer/.claude/skills/venue-templates/SKILL.md +590 -0
- scientific_writer/.claude/skills/venue-templates/assets/grants/nih_specific_aims.tex +235 -0
- scientific_writer/.claude/skills/venue-templates/assets/grants/nsf_proposal_template.tex +375 -0
- scientific_writer/.claude/skills/venue-templates/assets/journals/nature_article.tex +171 -0
- scientific_writer/.claude/skills/venue-templates/assets/journals/neurips_article.tex +283 -0
- scientific_writer/.claude/skills/venue-templates/assets/journals/plos_one.tex +317 -0
- scientific_writer/.claude/skills/venue-templates/assets/posters/beamerposter_academic.tex +311 -0
- scientific_writer/.claude/skills/venue-templates/references/conferences_formatting.md +564 -0
- scientific_writer/.claude/skills/venue-templates/references/grants_requirements.md +787 -0
- scientific_writer/.claude/skills/venue-templates/references/journals_formatting.md +486 -0
- scientific_writer/.claude/skills/venue-templates/references/posters_guidelines.md +628 -0
- scientific_writer/.claude/skills/venue-templates/scripts/customize_template.py +206 -0
- scientific_writer/.claude/skills/venue-templates/scripts/query_template.py +260 -0
- scientific_writer/.claude/skills/venue-templates/scripts/validate_format.py +255 -0
- scientific_writer/__init__.py +1 -1
- scientific_writer/api.py +9 -5
- scientific_writer/cli.py +9 -5
- scientific_writer/core.py +28 -5
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/METADATA +1 -1
- scientific_writer-2.2.3.dist-info/RECORD +312 -0
- scientific_writer-2.2.1.dist-info/RECORD +0 -11
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/WHEEL +0 -0
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/entry_points.txt +0 -0
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/licenses/LICENSE +0 -0
|
@@ -0,0 +1,309 @@
|
|
|
1
|
+
# MarkItDown Quick Reference
|
|
2
|
+
|
|
3
|
+
## Installation
|
|
4
|
+
|
|
5
|
+
```bash
|
|
6
|
+
# All features
|
|
7
|
+
pip install 'markitdown[all]'
|
|
8
|
+
|
|
9
|
+
# Specific formats
|
|
10
|
+
pip install 'markitdown[pdf,docx,pptx,xlsx]'
|
|
11
|
+
```
|
|
12
|
+
|
|
13
|
+
## Basic Usage
|
|
14
|
+
|
|
15
|
+
```python
|
|
16
|
+
from markitdown import MarkItDown
|
|
17
|
+
|
|
18
|
+
md = MarkItDown()
|
|
19
|
+
result = md.convert("file.pdf")
|
|
20
|
+
print(result.text_content)
|
|
21
|
+
```
|
|
22
|
+
|
|
23
|
+
## Command Line
|
|
24
|
+
|
|
25
|
+
```bash
|
|
26
|
+
# Simple conversion
|
|
27
|
+
markitdown input.pdf > output.md
|
|
28
|
+
markitdown input.pdf -o output.md
|
|
29
|
+
|
|
30
|
+
# With plugins
|
|
31
|
+
markitdown --use-plugins file.pdf -o output.md
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
## Common Tasks
|
|
35
|
+
|
|
36
|
+
### Convert PDF
|
|
37
|
+
```python
|
|
38
|
+
md = MarkItDown()
|
|
39
|
+
result = md.convert("paper.pdf")
|
|
40
|
+
```
|
|
41
|
+
|
|
42
|
+
### Convert with AI
|
|
43
|
+
```python
|
|
44
|
+
from openai import OpenAI
|
|
45
|
+
|
|
46
|
+
# Use OpenRouter for multiple model access
|
|
47
|
+
client = OpenAI(
|
|
48
|
+
api_key="your-openrouter-api-key",
|
|
49
|
+
base_url="https://openrouter.ai/api/v1"
|
|
50
|
+
)
|
|
51
|
+
|
|
52
|
+
md = MarkItDown(
|
|
53
|
+
llm_client=client,
|
|
54
|
+
llm_model="anthropic/claude-sonnet-4.5" # recommended for vision
|
|
55
|
+
)
|
|
56
|
+
result = md.convert("slides.pptx")
|
|
57
|
+
```
|
|
58
|
+
|
|
59
|
+
### Batch Convert
|
|
60
|
+
```bash
|
|
61
|
+
python scripts/batch_convert.py input/ output/ --extensions .pdf .docx
|
|
62
|
+
```
|
|
63
|
+
|
|
64
|
+
### Literature Conversion
|
|
65
|
+
```bash
|
|
66
|
+
python scripts/convert_literature.py papers/ markdown/ --create-index
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
## Supported Formats
|
|
70
|
+
|
|
71
|
+
| Format | Extension | Notes |
|
|
72
|
+
|--------|-----------|-------|
|
|
73
|
+
| PDF | `.pdf` | Full text + OCR |
|
|
74
|
+
| Word | `.docx` | Tables, formatting |
|
|
75
|
+
| PowerPoint | `.pptx` | Slides + notes |
|
|
76
|
+
| Excel | `.xlsx`, `.xls` | Tables |
|
|
77
|
+
| Images | `.jpg`, `.png`, `.gif`, `.webp` | EXIF + OCR |
|
|
78
|
+
| Audio | `.wav`, `.mp3` | Transcription |
|
|
79
|
+
| HTML | `.html`, `.htm` | Clean conversion |
|
|
80
|
+
| Data | `.csv`, `.json`, `.xml` | Structured |
|
|
81
|
+
| Archives | `.zip` | Iterates contents |
|
|
82
|
+
| E-books | `.epub` | Full text |
|
|
83
|
+
| YouTube | URLs | Transcripts |
|
|
84
|
+
|
|
85
|
+
## Optional Dependencies
|
|
86
|
+
|
|
87
|
+
```bash
|
|
88
|
+
[all] # All features
|
|
89
|
+
[pdf] # PDF support
|
|
90
|
+
[docx] # Word documents
|
|
91
|
+
[pptx] # PowerPoint
|
|
92
|
+
[xlsx] # Excel
|
|
93
|
+
[xls] # Old Excel
|
|
94
|
+
[outlook] # Outlook messages
|
|
95
|
+
[az-doc-intel] # Azure Document Intelligence
|
|
96
|
+
[audio-transcription] # Audio files
|
|
97
|
+
[youtube-transcription] # YouTube videos
|
|
98
|
+
```
|
|
99
|
+
|
|
100
|
+
## AI-Enhanced Conversion
|
|
101
|
+
|
|
102
|
+
### Scientific Papers
|
|
103
|
+
```python
|
|
104
|
+
from openai import OpenAI
|
|
105
|
+
|
|
106
|
+
# Initialize OpenRouter client
|
|
107
|
+
client = OpenAI(
|
|
108
|
+
api_key="your-openrouter-api-key",
|
|
109
|
+
base_url="https://openrouter.ai/api/v1"
|
|
110
|
+
)
|
|
111
|
+
|
|
112
|
+
md = MarkItDown(
|
|
113
|
+
llm_client=client,
|
|
114
|
+
llm_model="anthropic/claude-sonnet-4.5", # recommended for scientific vision
|
|
115
|
+
llm_prompt="Describe scientific figures with technical precision"
|
|
116
|
+
)
|
|
117
|
+
result = md.convert("paper.pdf")
|
|
118
|
+
```
|
|
119
|
+
|
|
120
|
+
### Custom Prompts
|
|
121
|
+
```python
|
|
122
|
+
prompt = """
|
|
123
|
+
Analyze this data visualization. Describe:
|
|
124
|
+
- Type of chart/graph
|
|
125
|
+
- Key trends and patterns
|
|
126
|
+
- Notable data points
|
|
127
|
+
"""
|
|
128
|
+
|
|
129
|
+
md = MarkItDown(
|
|
130
|
+
llm_client=client,
|
|
131
|
+
llm_model="anthropic/claude-sonnet-4.5",
|
|
132
|
+
llm_prompt=prompt
|
|
133
|
+
)
|
|
134
|
+
```
|
|
135
|
+
|
|
136
|
+
### Available Models via OpenRouter
|
|
137
|
+
- `anthropic/claude-sonnet-4.5` - **Claude Sonnet 4.5 (recommended for scientific vision)**
|
|
138
|
+
- `anthropic/claude-3.5-sonnet` - Claude 3.5 Sonnet (vision)
|
|
139
|
+
- `openai/gpt-4o` - GPT-4 Omni (vision)
|
|
140
|
+
- `openai/gpt-4-vision` - GPT-4 Vision
|
|
141
|
+
- `google/gemini-pro-vision` - Gemini Pro Vision
|
|
142
|
+
|
|
143
|
+
See https://openrouter.ai/models for full list
|
|
144
|
+
|
|
145
|
+
## Azure Document Intelligence
|
|
146
|
+
|
|
147
|
+
```python
|
|
148
|
+
md = MarkItDown(docintel_endpoint="https://YOUR-ENDPOINT.cognitiveservices.azure.com/")
|
|
149
|
+
result = md.convert("complex_layout.pdf")
|
|
150
|
+
```
|
|
151
|
+
|
|
152
|
+
## Batch Processing
|
|
153
|
+
|
|
154
|
+
### Python
|
|
155
|
+
```python
|
|
156
|
+
from markitdown import MarkItDown
|
|
157
|
+
from pathlib import Path
|
|
158
|
+
|
|
159
|
+
md = MarkItDown()
|
|
160
|
+
|
|
161
|
+
for file in Path("input/").glob("*.pdf"):
|
|
162
|
+
result = md.convert(str(file))
|
|
163
|
+
output = Path("output") / f"{file.stem}.md"
|
|
164
|
+
output.write_text(result.text_content)
|
|
165
|
+
```
|
|
166
|
+
|
|
167
|
+
### Script
|
|
168
|
+
```bash
|
|
169
|
+
# Parallel conversion
|
|
170
|
+
python scripts/batch_convert.py input/ output/ --workers 8
|
|
171
|
+
|
|
172
|
+
# Recursive
|
|
173
|
+
python scripts/batch_convert.py input/ output/ -r
|
|
174
|
+
```
|
|
175
|
+
|
|
176
|
+
## Error Handling
|
|
177
|
+
|
|
178
|
+
```python
|
|
179
|
+
try:
|
|
180
|
+
result = md.convert("file.pdf")
|
|
181
|
+
except FileNotFoundError:
|
|
182
|
+
print("File not found")
|
|
183
|
+
except Exception as e:
|
|
184
|
+
print(f"Error: {e}")
|
|
185
|
+
```
|
|
186
|
+
|
|
187
|
+
## Streaming
|
|
188
|
+
|
|
189
|
+
```python
|
|
190
|
+
with open("large_file.pdf", "rb") as f:
|
|
191
|
+
result = md.convert_stream(f, file_extension=".pdf")
|
|
192
|
+
```
|
|
193
|
+
|
|
194
|
+
## Common Prompts
|
|
195
|
+
|
|
196
|
+
### Scientific
|
|
197
|
+
```
|
|
198
|
+
Analyze this scientific figure. Describe:
|
|
199
|
+
- Type of visualization
|
|
200
|
+
- Key data points and trends
|
|
201
|
+
- Axes, labels, and legends
|
|
202
|
+
- Scientific significance
|
|
203
|
+
```
|
|
204
|
+
|
|
205
|
+
### Medical
|
|
206
|
+
```
|
|
207
|
+
Describe this medical image. Include:
|
|
208
|
+
- Type of imaging (X-ray, MRI, CT, etc.)
|
|
209
|
+
- Anatomical structures visible
|
|
210
|
+
- Notable findings
|
|
211
|
+
- Clinical relevance
|
|
212
|
+
```
|
|
213
|
+
|
|
214
|
+
### Data Visualization
|
|
215
|
+
```
|
|
216
|
+
Analyze this data visualization:
|
|
217
|
+
- Chart type
|
|
218
|
+
- Variables and axes
|
|
219
|
+
- Data ranges
|
|
220
|
+
- Key patterns and outliers
|
|
221
|
+
```
|
|
222
|
+
|
|
223
|
+
## Performance Tips
|
|
224
|
+
|
|
225
|
+
1. **Reuse instance**: Create once, use many times
|
|
226
|
+
2. **Parallel processing**: Use ThreadPoolExecutor for multiple files
|
|
227
|
+
3. **Stream large files**: Use `convert_stream()` for big files
|
|
228
|
+
4. **Choose right format**: Install only needed dependencies
|
|
229
|
+
|
|
230
|
+
## Environment Variables
|
|
231
|
+
|
|
232
|
+
```bash
|
|
233
|
+
# OpenRouter for AI-enhanced conversions
|
|
234
|
+
export OPENROUTER_API_KEY="sk-or-v1-..."
|
|
235
|
+
|
|
236
|
+
# Azure Document Intelligence (optional)
|
|
237
|
+
export AZURE_DOCUMENT_INTELLIGENCE_KEY="key..."
|
|
238
|
+
export AZURE_DOCUMENT_INTELLIGENCE_ENDPOINT="https://..."
|
|
239
|
+
```
|
|
240
|
+
|
|
241
|
+
## Scripts Quick Reference
|
|
242
|
+
|
|
243
|
+
### batch_convert.py
|
|
244
|
+
```bash
|
|
245
|
+
python scripts/batch_convert.py INPUT OUTPUT [OPTIONS]
|
|
246
|
+
|
|
247
|
+
Options:
|
|
248
|
+
--extensions .pdf .docx File types to convert
|
|
249
|
+
--recursive, -r Search subdirectories
|
|
250
|
+
--workers 4 Parallel workers
|
|
251
|
+
--verbose, -v Detailed output
|
|
252
|
+
--plugins, -p Enable plugins
|
|
253
|
+
```
|
|
254
|
+
|
|
255
|
+
### convert_with_ai.py
|
|
256
|
+
```bash
|
|
257
|
+
python scripts/convert_with_ai.py INPUT OUTPUT [OPTIONS]
|
|
258
|
+
|
|
259
|
+
Options:
|
|
260
|
+
--api-key KEY OpenRouter API key
|
|
261
|
+
--model MODEL Model name (default: anthropic/claude-sonnet-4.5)
|
|
262
|
+
--prompt-type TYPE Preset prompt (scientific, medical, etc.)
|
|
263
|
+
--custom-prompt TEXT Custom prompt
|
|
264
|
+
--list-prompts Show available prompts
|
|
265
|
+
```
|
|
266
|
+
|
|
267
|
+
### convert_literature.py
|
|
268
|
+
```bash
|
|
269
|
+
python scripts/convert_literature.py INPUT OUTPUT [OPTIONS]
|
|
270
|
+
|
|
271
|
+
Options:
|
|
272
|
+
--organize-by-year, -y Organize by year
|
|
273
|
+
--create-index, -i Create index file
|
|
274
|
+
--recursive, -r Search subdirectories
|
|
275
|
+
```
|
|
276
|
+
|
|
277
|
+
## Troubleshooting
|
|
278
|
+
|
|
279
|
+
### Missing Dependencies
|
|
280
|
+
```bash
|
|
281
|
+
pip install 'markitdown[pdf]' # Install PDF support
|
|
282
|
+
```
|
|
283
|
+
|
|
284
|
+
### Binary File Error
|
|
285
|
+
```python
|
|
286
|
+
# Wrong
|
|
287
|
+
with open("file.pdf", "r") as f:
|
|
288
|
+
|
|
289
|
+
# Correct
|
|
290
|
+
with open("file.pdf", "rb") as f: # Binary mode
|
|
291
|
+
```
|
|
292
|
+
|
|
293
|
+
### OCR Not Working
|
|
294
|
+
```bash
|
|
295
|
+
# macOS
|
|
296
|
+
brew install tesseract
|
|
297
|
+
|
|
298
|
+
# Ubuntu
|
|
299
|
+
sudo apt-get install tesseract-ocr
|
|
300
|
+
```
|
|
301
|
+
|
|
302
|
+
## More Information
|
|
303
|
+
|
|
304
|
+
- **Full Documentation**: See `SKILL.md`
|
|
305
|
+
- **API Reference**: See `references/api_reference.md`
|
|
306
|
+
- **Format Details**: See `references/file_formats.md`
|
|
307
|
+
- **Examples**: See `assets/example_usage.md`
|
|
308
|
+
- **GitHub**: https://github.com/microsoft/markitdown
|
|
309
|
+
|
|
@@ -0,0 +1,184 @@
|
|
|
1
|
+
# MarkItDown Skill
|
|
2
|
+
|
|
3
|
+
This skill provides comprehensive support for converting various file formats to Markdown using Microsoft's MarkItDown tool.
|
|
4
|
+
|
|
5
|
+
## Overview
|
|
6
|
+
|
|
7
|
+
MarkItDown is a Python tool that converts files and office documents to Markdown format. This skill includes:
|
|
8
|
+
|
|
9
|
+
- Complete API documentation
|
|
10
|
+
- Format-specific conversion guides
|
|
11
|
+
- Utility scripts for batch processing
|
|
12
|
+
- AI-enhanced conversion examples
|
|
13
|
+
- Integration with scientific workflows
|
|
14
|
+
|
|
15
|
+
## Contents
|
|
16
|
+
|
|
17
|
+
### Main Skill File
|
|
18
|
+
- **SKILL.md** - Complete guide to using MarkItDown with quick start, examples, and best practices
|
|
19
|
+
|
|
20
|
+
### References
|
|
21
|
+
- **api_reference.md** - Detailed API documentation, class references, and method signatures
|
|
22
|
+
- **file_formats.md** - Format-specific details for all supported file types
|
|
23
|
+
|
|
24
|
+
### Scripts
|
|
25
|
+
- **batch_convert.py** - Batch convert multiple files with parallel processing
|
|
26
|
+
- **convert_with_ai.py** - AI-enhanced conversion with custom prompts
|
|
27
|
+
- **convert_literature.py** - Scientific literature conversion with metadata extraction
|
|
28
|
+
|
|
29
|
+
### Assets
|
|
30
|
+
- **example_usage.md** - Practical examples for common use cases
|
|
31
|
+
|
|
32
|
+
## Installation
|
|
33
|
+
|
|
34
|
+
```bash
|
|
35
|
+
# Install with all features
|
|
36
|
+
pip install 'markitdown[all]'
|
|
37
|
+
|
|
38
|
+
# Or install specific features
|
|
39
|
+
pip install 'markitdown[pdf,docx,pptx,xlsx]'
|
|
40
|
+
```
|
|
41
|
+
|
|
42
|
+
## Quick Start
|
|
43
|
+
|
|
44
|
+
```python
|
|
45
|
+
from markitdown import MarkItDown
|
|
46
|
+
|
|
47
|
+
md = MarkItDown()
|
|
48
|
+
result = md.convert("document.pdf")
|
|
49
|
+
print(result.text_content)
|
|
50
|
+
```
|
|
51
|
+
|
|
52
|
+
## Supported Formats
|
|
53
|
+
|
|
54
|
+
- **Documents**: PDF, DOCX, PPTX, XLSX, EPUB
|
|
55
|
+
- **Images**: JPEG, PNG, GIF, WebP (with OCR)
|
|
56
|
+
- **Audio**: WAV, MP3 (with transcription)
|
|
57
|
+
- **Web**: HTML, YouTube URLs
|
|
58
|
+
- **Data**: CSV, JSON, XML
|
|
59
|
+
- **Archives**: ZIP files
|
|
60
|
+
|
|
61
|
+
## Key Features
|
|
62
|
+
|
|
63
|
+
### 1. AI-Enhanced Conversions
|
|
64
|
+
Use AI models via OpenRouter to generate detailed image descriptions:
|
|
65
|
+
|
|
66
|
+
```python
|
|
67
|
+
from openai import OpenAI
|
|
68
|
+
|
|
69
|
+
# OpenRouter provides access to 100+ AI models
|
|
70
|
+
client = OpenAI(
|
|
71
|
+
api_key="your-openrouter-api-key",
|
|
72
|
+
base_url="https://openrouter.ai/api/v1"
|
|
73
|
+
)
|
|
74
|
+
|
|
75
|
+
md = MarkItDown(
|
|
76
|
+
llm_client=client,
|
|
77
|
+
llm_model="anthropic/claude-sonnet-4.5" # recommended for vision
|
|
78
|
+
)
|
|
79
|
+
result = md.convert("presentation.pptx")
|
|
80
|
+
```
|
|
81
|
+
|
|
82
|
+
### 2. Batch Processing
|
|
83
|
+
Convert multiple files efficiently:
|
|
84
|
+
|
|
85
|
+
```bash
|
|
86
|
+
python scripts/batch_convert.py papers/ output/ --extensions .pdf .docx
|
|
87
|
+
```
|
|
88
|
+
|
|
89
|
+
### 3. Scientific Literature
|
|
90
|
+
Convert and organize research papers:
|
|
91
|
+
|
|
92
|
+
```bash
|
|
93
|
+
python scripts/convert_literature.py papers/ output/ --organize-by-year --create-index
|
|
94
|
+
```
|
|
95
|
+
|
|
96
|
+
### 4. Azure Document Intelligence
|
|
97
|
+
Enhanced PDF conversion with Microsoft Document Intelligence:
|
|
98
|
+
|
|
99
|
+
```python
|
|
100
|
+
md = MarkItDown(docintel_endpoint="https://YOUR-ENDPOINT.cognitiveservices.azure.com/")
|
|
101
|
+
result = md.convert("complex_document.pdf")
|
|
102
|
+
```
|
|
103
|
+
|
|
104
|
+
## Use Cases
|
|
105
|
+
|
|
106
|
+
### Literature Review
|
|
107
|
+
Convert research papers to Markdown for easier analysis and note-taking.
|
|
108
|
+
|
|
109
|
+
### Data Extraction
|
|
110
|
+
Extract tables from Excel files into Markdown format.
|
|
111
|
+
|
|
112
|
+
### Presentation Processing
|
|
113
|
+
Convert PowerPoint slides with AI-generated descriptions.
|
|
114
|
+
|
|
115
|
+
### Document Analysis
|
|
116
|
+
Process documents for LLM consumption with token-efficient Markdown.
|
|
117
|
+
|
|
118
|
+
### YouTube Transcripts
|
|
119
|
+
Fetch and convert YouTube video transcriptions.
|
|
120
|
+
|
|
121
|
+
## Scripts Usage
|
|
122
|
+
|
|
123
|
+
### Batch Convert
|
|
124
|
+
```bash
|
|
125
|
+
# Convert all PDFs in a directory
|
|
126
|
+
python scripts/batch_convert.py input_dir/ output_dir/ --extensions .pdf
|
|
127
|
+
|
|
128
|
+
# Recursive with multiple formats
|
|
129
|
+
python scripts/batch_convert.py docs/ markdown/ --extensions .pdf .docx .pptx -r
|
|
130
|
+
```
|
|
131
|
+
|
|
132
|
+
### AI-Enhanced Conversion
|
|
133
|
+
```bash
|
|
134
|
+
# Convert with AI descriptions via OpenRouter
|
|
135
|
+
export OPENROUTER_API_KEY="sk-or-v1-..."
|
|
136
|
+
python scripts/convert_with_ai.py paper.pdf output.md --prompt-type scientific
|
|
137
|
+
|
|
138
|
+
# Use different models
|
|
139
|
+
python scripts/convert_with_ai.py image.png output.md --model anthropic/claude-sonnet-4.5
|
|
140
|
+
|
|
141
|
+
# Use custom prompt
|
|
142
|
+
python scripts/convert_with_ai.py image.png output.md --custom-prompt "Describe this diagram"
|
|
143
|
+
```
|
|
144
|
+
|
|
145
|
+
### Literature Conversion
|
|
146
|
+
```bash
|
|
147
|
+
# Convert papers with metadata extraction
|
|
148
|
+
python scripts/convert_literature.py papers/ markdown/ --organize-by-year --create-index
|
|
149
|
+
```
|
|
150
|
+
|
|
151
|
+
## Integration with Scientific Writer
|
|
152
|
+
|
|
153
|
+
This skill integrates seamlessly with the Scientific Writer CLI for:
|
|
154
|
+
- Converting source materials for paper writing
|
|
155
|
+
- Processing literature for reviews
|
|
156
|
+
- Extracting data from various document formats
|
|
157
|
+
- Preparing documents for LLM analysis
|
|
158
|
+
|
|
159
|
+
## Resources
|
|
160
|
+
|
|
161
|
+
- **MarkItDown GitHub**: https://github.com/microsoft/markitdown
|
|
162
|
+
- **PyPI**: https://pypi.org/project/markitdown/
|
|
163
|
+
- **OpenRouter**: https://openrouter.ai (AI model access)
|
|
164
|
+
- **OpenRouter API Keys**: https://openrouter.ai/keys
|
|
165
|
+
- **OpenRouter Models**: https://openrouter.ai/models
|
|
166
|
+
- **License**: MIT
|
|
167
|
+
|
|
168
|
+
## Requirements
|
|
169
|
+
|
|
170
|
+
- Python 3.10+
|
|
171
|
+
- Optional dependencies based on formats needed
|
|
172
|
+
- OpenRouter API key (for AI-enhanced conversions) - Get at https://openrouter.ai/keys
|
|
173
|
+
- Azure subscription (optional, for Document Intelligence)
|
|
174
|
+
|
|
175
|
+
## Examples
|
|
176
|
+
|
|
177
|
+
See `assets/example_usage.md` for comprehensive examples covering:
|
|
178
|
+
- Basic conversions
|
|
179
|
+
- Scientific workflows
|
|
180
|
+
- AI-enhanced processing
|
|
181
|
+
- Batch operations
|
|
182
|
+
- Error handling
|
|
183
|
+
- Integration patterns
|
|
184
|
+
|