scientific-writer 2.2.1__py3-none-any.whl → 2.2.3__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Potentially problematic release.
This version of scientific-writer might be problematic. Click here for more details.
- scientific_writer/.claude/WRITER.md +748 -0
- scientific_writer/.claude/settings.local.json +30 -0
- scientific_writer/.claude/skills/citation-management/SKILL.md +1046 -0
- scientific_writer/.claude/skills/citation-management/assets/bibtex_template.bib +264 -0
- scientific_writer/.claude/skills/citation-management/assets/citation_checklist.md +386 -0
- scientific_writer/.claude/skills/citation-management/references/bibtex_formatting.md +908 -0
- scientific_writer/.claude/skills/citation-management/references/citation_validation.md +794 -0
- scientific_writer/.claude/skills/citation-management/references/google_scholar_search.md +725 -0
- scientific_writer/.claude/skills/citation-management/references/metadata_extraction.md +870 -0
- scientific_writer/.claude/skills/citation-management/references/pubmed_search.md +839 -0
- scientific_writer/.claude/skills/citation-management/scripts/doi_to_bibtex.py +204 -0
- scientific_writer/.claude/skills/citation-management/scripts/extract_metadata.py +569 -0
- scientific_writer/.claude/skills/citation-management/scripts/format_bibtex.py +349 -0
- scientific_writer/.claude/skills/citation-management/scripts/search_google_scholar.py +282 -0
- scientific_writer/.claude/skills/citation-management/scripts/search_pubmed.py +398 -0
- scientific_writer/.claude/skills/citation-management/scripts/validate_citations.py +497 -0
- scientific_writer/.claude/skills/clinical-reports/IMPLEMENTATION_SUMMARY.md +641 -0
- scientific_writer/.claude/skills/clinical-reports/README.md +236 -0
- scientific_writer/.claude/skills/clinical-reports/SKILL.md +1088 -0
- scientific_writer/.claude/skills/clinical-reports/assets/case_report_template.md +352 -0
- scientific_writer/.claude/skills/clinical-reports/assets/clinical_trial_csr_template.md +353 -0
- scientific_writer/.claude/skills/clinical-reports/assets/clinical_trial_sae_template.md +359 -0
- scientific_writer/.claude/skills/clinical-reports/assets/consult_note_template.md +305 -0
- scientific_writer/.claude/skills/clinical-reports/assets/discharge_summary_template.md +453 -0
- scientific_writer/.claude/skills/clinical-reports/assets/hipaa_compliance_checklist.md +395 -0
- scientific_writer/.claude/skills/clinical-reports/assets/history_physical_template.md +305 -0
- scientific_writer/.claude/skills/clinical-reports/assets/lab_report_template.md +309 -0
- scientific_writer/.claude/skills/clinical-reports/assets/pathology_report_template.md +249 -0
- scientific_writer/.claude/skills/clinical-reports/assets/quality_checklist.md +338 -0
- scientific_writer/.claude/skills/clinical-reports/assets/radiology_report_template.md +318 -0
- scientific_writer/.claude/skills/clinical-reports/assets/soap_note_template.md +253 -0
- scientific_writer/.claude/skills/clinical-reports/references/case_report_guidelines.md +570 -0
- scientific_writer/.claude/skills/clinical-reports/references/clinical_trial_reporting.md +693 -0
- scientific_writer/.claude/skills/clinical-reports/references/data_presentation.md +530 -0
- scientific_writer/.claude/skills/clinical-reports/references/diagnostic_reports_standards.md +629 -0
- scientific_writer/.claude/skills/clinical-reports/references/medical_terminology.md +588 -0
- scientific_writer/.claude/skills/clinical-reports/references/patient_documentation.md +744 -0
- scientific_writer/.claude/skills/clinical-reports/references/peer_review_standards.md +585 -0
- scientific_writer/.claude/skills/clinical-reports/references/regulatory_compliance.md +577 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/check_deidentification.py +346 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/compliance_checker.py +78 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/extract_clinical_data.py +102 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/format_adverse_events.py +103 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/generate_report_template.py +163 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/terminology_validator.py +133 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/validate_case_report.py +334 -0
- scientific_writer/.claude/skills/clinical-reports/scripts/validate_trial_report.py +89 -0
- scientific_writer/.claude/skills/document-skills/docx/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/docx/SKILL.md +197 -0
- scientific_writer/.claude/skills/document-skills/docx/docx-js.md +350 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chart.xsd +1499 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chartDrawing.xsd +146 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-diagram.xsd +1085 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-lockedCanvas.xsd +11 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-main.xsd +3081 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-picture.xsd +23 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-spreadsheetDrawing.xsd +185 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/dml-wordprocessingDrawing.xsd +287 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/pml.xsd +1676 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-additionalCharacteristics.xsd +28 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-bibliography.xsd +144 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-commonSimpleTypes.xsd +174 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlDataProperties.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlSchemaProperties.xsd +18 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesCustom.xsd +59 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesExtended.xsd +56 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesVariantTypes.xsd +195 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-math.xsd +582 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/shared-relationshipReference.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/sml.xsd +4439 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-main.xsd +570 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-officeDrawing.xsd +509 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-presentationDrawing.xsd +12 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-spreadsheetDrawing.xsd +108 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/vml-wordprocessingDrawing.xsd +96 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/wml.xsd +3646 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ISO-IEC29500-4_2016/xml.xsd +116 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-contentTypes.xsd +42 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-coreProperties.xsd +50 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-digSig.xsd +49 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/ecma/fouth-edition/opc-relationships.xsd +33 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/mce/mc.xsd +75 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-2010.xsd +560 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-2012.xsd +67 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-2018.xsd +14 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-cex-2018.xsd +20 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-cid-2016.xsd +13 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-sdtdatahash-2020.xsd +4 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/schemas/microsoft/wml-symex-2015.xsd +8 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/pack.py +159 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/unpack.py +29 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validate.py +69 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/__init__.py +15 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/base.py +951 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/docx.py +274 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/pptx.py +315 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml/scripts/validation/redlining.py +279 -0
- scientific_writer/.claude/skills/document-skills/docx/ooxml.md +610 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/__init__.py +1 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/document.py +1276 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/comments.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/commentsExtended.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/commentsExtensible.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/commentsIds.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/templates/people.xml +3 -0
- scientific_writer/.claude/skills/document-skills/docx/scripts/utilities.py +374 -0
- scientific_writer/.claude/skills/document-skills/pdf/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/pdf/SKILL.md +294 -0
- scientific_writer/.claude/skills/document-skills/pdf/forms.md +205 -0
- scientific_writer/.claude/skills/document-skills/pdf/reference.md +612 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/check_bounding_boxes.py +70 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/check_bounding_boxes_test.py +226 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/check_fillable_fields.py +12 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/convert_pdf_to_images.py +35 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/create_validation_image.py +41 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/extract_form_field_info.py +152 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/fill_fillable_fields.py +114 -0
- scientific_writer/.claude/skills/document-skills/pdf/scripts/fill_pdf_form_with_annotations.py +108 -0
- scientific_writer/.claude/skills/document-skills/pptx/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/pptx/SKILL.md +484 -0
- scientific_writer/.claude/skills/document-skills/pptx/html2pptx.md +625 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chart.xsd +1499 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-chartDrawing.xsd +146 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-diagram.xsd +1085 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-lockedCanvas.xsd +11 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-main.xsd +3081 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-picture.xsd +23 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-spreadsheetDrawing.xsd +185 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/dml-wordprocessingDrawing.xsd +287 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/pml.xsd +1676 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-additionalCharacteristics.xsd +28 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-bibliography.xsd +144 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-commonSimpleTypes.xsd +174 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlDataProperties.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-customXmlSchemaProperties.xsd +18 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesCustom.xsd +59 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesExtended.xsd +56 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-documentPropertiesVariantTypes.xsd +195 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-math.xsd +582 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/shared-relationshipReference.xsd +25 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/sml.xsd +4439 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-main.xsd +570 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-officeDrawing.xsd +509 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-presentationDrawing.xsd +12 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-spreadsheetDrawing.xsd +108 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/vml-wordprocessingDrawing.xsd +96 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/wml.xsd +3646 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ISO-IEC29500-4_2016/xml.xsd +116 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-contentTypes.xsd +42 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-coreProperties.xsd +50 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-digSig.xsd +49 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/ecma/fouth-edition/opc-relationships.xsd +33 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/mce/mc.xsd +75 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-2010.xsd +560 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-2012.xsd +67 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-2018.xsd +14 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-cex-2018.xsd +20 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-cid-2016.xsd +13 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-sdtdatahash-2020.xsd +4 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/schemas/microsoft/wml-symex-2015.xsd +8 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/pack.py +159 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/unpack.py +29 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validate.py +69 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/__init__.py +15 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/base.py +951 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/docx.py +274 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/pptx.py +315 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml/scripts/validation/redlining.py +279 -0
- scientific_writer/.claude/skills/document-skills/pptx/ooxml.md +427 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/html2pptx.js +979 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/inventory.py +1020 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/rearrange.py +231 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/replace.py +385 -0
- scientific_writer/.claude/skills/document-skills/pptx/scripts/thumbnail.py +450 -0
- scientific_writer/.claude/skills/document-skills/xlsx/LICENSE.txt +30 -0
- scientific_writer/.claude/skills/document-skills/xlsx/SKILL.md +289 -0
- scientific_writer/.claude/skills/document-skills/xlsx/recalc.py +178 -0
- scientific_writer/.claude/skills/hypothesis-generation/SKILL.md +155 -0
- scientific_writer/.claude/skills/hypothesis-generation/assets/hypothesis_output_template.md +302 -0
- scientific_writer/.claude/skills/hypothesis-generation/references/experimental_design_patterns.md +327 -0
- scientific_writer/.claude/skills/hypothesis-generation/references/hypothesis_quality_criteria.md +196 -0
- scientific_writer/.claude/skills/hypothesis-generation/references/literature_search_strategies.md +505 -0
- scientific_writer/.claude/skills/latex-posters/README.md +417 -0
- scientific_writer/.claude/skills/latex-posters/SKILL.md +919 -0
- scientific_writer/.claude/skills/latex-posters/assets/baposter_template.tex +257 -0
- scientific_writer/.claude/skills/latex-posters/assets/beamerposter_template.tex +244 -0
- scientific_writer/.claude/skills/latex-posters/assets/poster_quality_checklist.md +358 -0
- scientific_writer/.claude/skills/latex-posters/assets/tikzposter_template.tex +251 -0
- scientific_writer/.claude/skills/latex-posters/references/latex_poster_packages.md +745 -0
- scientific_writer/.claude/skills/latex-posters/references/poster_content_guide.md +748 -0
- scientific_writer/.claude/skills/latex-posters/references/poster_design_principles.md +806 -0
- scientific_writer/.claude/skills/latex-posters/references/poster_layout_design.md +900 -0
- scientific_writer/.claude/skills/latex-posters/scripts/review_poster.sh +214 -0
- scientific_writer/.claude/skills/literature-review/SKILL.md +546 -0
- scientific_writer/.claude/skills/literature-review/assets/review_template.md +412 -0
- scientific_writer/.claude/skills/literature-review/references/citation_styles.md +166 -0
- scientific_writer/.claude/skills/literature-review/references/database_strategies.md +381 -0
- scientific_writer/.claude/skills/literature-review/scripts/generate_pdf.py +176 -0
- scientific_writer/.claude/skills/literature-review/scripts/search_databases.py +303 -0
- scientific_writer/.claude/skills/literature-review/scripts/verify_citations.py +222 -0
- scientific_writer/.claude/skills/markitdown/INSTALLATION_GUIDE.md +318 -0
- scientific_writer/.claude/skills/markitdown/LICENSE.txt +22 -0
- scientific_writer/.claude/skills/markitdown/OPENROUTER_INTEGRATION.md +359 -0
- scientific_writer/.claude/skills/markitdown/QUICK_REFERENCE.md +309 -0
- scientific_writer/.claude/skills/markitdown/README.md +184 -0
- scientific_writer/.claude/skills/markitdown/SKILL.md +450 -0
- scientific_writer/.claude/skills/markitdown/SKILL_SUMMARY.md +307 -0
- scientific_writer/.claude/skills/markitdown/assets/example_usage.md +463 -0
- scientific_writer/.claude/skills/markitdown/references/api_reference.md +399 -0
- scientific_writer/.claude/skills/markitdown/references/file_formats.md +542 -0
- scientific_writer/.claude/skills/markitdown/scripts/batch_convert.py +228 -0
- scientific_writer/.claude/skills/markitdown/scripts/convert_literature.py +283 -0
- scientific_writer/.claude/skills/markitdown/scripts/convert_with_ai.py +243 -0
- scientific_writer/.claude/skills/paper-2-web/SKILL.md +455 -0
- scientific_writer/.claude/skills/paper-2-web/references/installation.md +141 -0
- scientific_writer/.claude/skills/paper-2-web/references/paper2poster.md +346 -0
- scientific_writer/.claude/skills/paper-2-web/references/paper2video.md +305 -0
- scientific_writer/.claude/skills/paper-2-web/references/paper2web.md +187 -0
- scientific_writer/.claude/skills/paper-2-web/references/usage_examples.md +436 -0
- scientific_writer/.claude/skills/peer-review/SKILL.md +375 -0
- scientific_writer/.claude/skills/peer-review/references/common_issues.md +552 -0
- scientific_writer/.claude/skills/peer-review/references/reporting_standards.md +290 -0
- scientific_writer/.claude/skills/research-grants/README.md +285 -0
- scientific_writer/.claude/skills/research-grants/SKILL.md +896 -0
- scientific_writer/.claude/skills/research-grants/assets/budget_justification_template.md +453 -0
- scientific_writer/.claude/skills/research-grants/assets/nih_specific_aims_template.md +166 -0
- scientific_writer/.claude/skills/research-grants/assets/nsf_project_summary_template.md +92 -0
- scientific_writer/.claude/skills/research-grants/references/broader_impacts.md +392 -0
- scientific_writer/.claude/skills/research-grants/references/darpa_guidelines.md +636 -0
- scientific_writer/.claude/skills/research-grants/references/doe_guidelines.md +586 -0
- scientific_writer/.claude/skills/research-grants/references/nih_guidelines.md +851 -0
- scientific_writer/.claude/skills/research-grants/references/nsf_guidelines.md +570 -0
- scientific_writer/.claude/skills/research-grants/references/specific_aims_guide.md +458 -0
- scientific_writer/.claude/skills/research-lookup/README.md +116 -0
- scientific_writer/.claude/skills/research-lookup/SKILL.md +443 -0
- scientific_writer/.claude/skills/research-lookup/examples.py +174 -0
- scientific_writer/.claude/skills/research-lookup/lookup.py +93 -0
- scientific_writer/.claude/skills/research-lookup/research_lookup.py +335 -0
- scientific_writer/.claude/skills/research-lookup/scripts/research_lookup.py +261 -0
- scientific_writer/.claude/skills/scholar-evaluation/SKILL.md +254 -0
- scientific_writer/.claude/skills/scholar-evaluation/references/evaluation_framework.md +663 -0
- scientific_writer/.claude/skills/scholar-evaluation/scripts/calculate_scores.py +378 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/SKILL.md +530 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/common_biases.md +364 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/evidence_hierarchy.md +484 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/experimental_design.md +496 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/logical_fallacies.md +478 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/scientific_method.md +169 -0
- scientific_writer/.claude/skills/scientific-critical-thinking/references/statistical_pitfalls.md +506 -0
- scientific_writer/.claude/skills/scientific-schematics/SKILL.md +2035 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/block_diagram_template.tex +199 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/circuit_template.tex +159 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/flowchart_template.tex +161 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/pathway_template.tex +162 -0
- scientific_writer/.claude/skills/scientific-schematics/assets/tikz_styles.tex +422 -0
- scientific_writer/.claude/skills/scientific-schematics/references/best_practices.md +562 -0
- scientific_writer/.claude/skills/scientific-schematics/references/diagram_types.md +637 -0
- scientific_writer/.claude/skills/scientific-schematics/references/python_libraries.md +791 -0
- scientific_writer/.claude/skills/scientific-schematics/references/tikz_guide.md +734 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/circuit_generator.py +307 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/compile_tikz.py +292 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/generate_flowchart.py +281 -0
- scientific_writer/.claude/skills/scientific-schematics/scripts/pathway_diagram.py +406 -0
- scientific_writer/.claude/skills/scientific-writing/SKILL.md +443 -0
- scientific_writer/.claude/skills/scientific-writing/references/citation_styles.md +720 -0
- scientific_writer/.claude/skills/scientific-writing/references/figures_tables.md +806 -0
- scientific_writer/.claude/skills/scientific-writing/references/imrad_structure.md +658 -0
- scientific_writer/.claude/skills/scientific-writing/references/reporting_guidelines.md +748 -0
- scientific_writer/.claude/skills/scientific-writing/references/writing_principles.md +824 -0
- scientific_writer/.claude/skills/treatment-plans/README.md +483 -0
- scientific_writer/.claude/skills/treatment-plans/SKILL.md +817 -0
- scientific_writer/.claude/skills/treatment-plans/assets/chronic_disease_management_plan.tex +636 -0
- scientific_writer/.claude/skills/treatment-plans/assets/general_medical_treatment_plan.tex +616 -0
- scientific_writer/.claude/skills/treatment-plans/assets/mental_health_treatment_plan.tex +745 -0
- scientific_writer/.claude/skills/treatment-plans/assets/pain_management_plan.tex +770 -0
- scientific_writer/.claude/skills/treatment-plans/assets/perioperative_care_plan.tex +724 -0
- scientific_writer/.claude/skills/treatment-plans/assets/quality_checklist.md +471 -0
- scientific_writer/.claude/skills/treatment-plans/assets/rehabilitation_treatment_plan.tex +727 -0
- scientific_writer/.claude/skills/treatment-plans/references/goal_setting_frameworks.md +411 -0
- scientific_writer/.claude/skills/treatment-plans/references/intervention_guidelines.md +507 -0
- scientific_writer/.claude/skills/treatment-plans/references/regulatory_compliance.md +476 -0
- scientific_writer/.claude/skills/treatment-plans/references/specialty_specific_guidelines.md +607 -0
- scientific_writer/.claude/skills/treatment-plans/references/treatment_plan_standards.md +456 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/check_completeness.py +318 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/generate_template.py +244 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/timeline_generator.py +369 -0
- scientific_writer/.claude/skills/treatment-plans/scripts/validate_treatment_plan.py +367 -0
- scientific_writer/.claude/skills/venue-templates/SKILL.md +590 -0
- scientific_writer/.claude/skills/venue-templates/assets/grants/nih_specific_aims.tex +235 -0
- scientific_writer/.claude/skills/venue-templates/assets/grants/nsf_proposal_template.tex +375 -0
- scientific_writer/.claude/skills/venue-templates/assets/journals/nature_article.tex +171 -0
- scientific_writer/.claude/skills/venue-templates/assets/journals/neurips_article.tex +283 -0
- scientific_writer/.claude/skills/venue-templates/assets/journals/plos_one.tex +317 -0
- scientific_writer/.claude/skills/venue-templates/assets/posters/beamerposter_academic.tex +311 -0
- scientific_writer/.claude/skills/venue-templates/references/conferences_formatting.md +564 -0
- scientific_writer/.claude/skills/venue-templates/references/grants_requirements.md +787 -0
- scientific_writer/.claude/skills/venue-templates/references/journals_formatting.md +486 -0
- scientific_writer/.claude/skills/venue-templates/references/posters_guidelines.md +628 -0
- scientific_writer/.claude/skills/venue-templates/scripts/customize_template.py +206 -0
- scientific_writer/.claude/skills/venue-templates/scripts/query_template.py +260 -0
- scientific_writer/.claude/skills/venue-templates/scripts/validate_format.py +255 -0
- scientific_writer/__init__.py +1 -1
- scientific_writer/api.py +9 -5
- scientific_writer/cli.py +9 -5
- scientific_writer/core.py +28 -5
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/METADATA +1 -1
- scientific_writer-2.2.3.dist-info/RECORD +312 -0
- scientific_writer-2.2.1.dist-info/RECORD +0 -11
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/WHEEL +0 -0
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/entry_points.txt +0 -0
- {scientific_writer-2.2.1.dist-info → scientific_writer-2.2.3.dist-info}/licenses/LICENSE +0 -0
|
@@ -0,0 +1,725 @@
|
|
|
1
|
+
# Google Scholar Search Guide
|
|
2
|
+
|
|
3
|
+
Comprehensive guide to searching Google Scholar for academic papers, including advanced search operators, filtering strategies, and metadata extraction.
|
|
4
|
+
|
|
5
|
+
## Overview
|
|
6
|
+
|
|
7
|
+
Google Scholar provides the most comprehensive coverage of academic literature across all disciplines:
|
|
8
|
+
- **Coverage**: 100+ million scholarly documents
|
|
9
|
+
- **Scope**: All academic disciplines
|
|
10
|
+
- **Content types**: Journal articles, books, theses, conference papers, preprints, patents, court opinions
|
|
11
|
+
- **Citation tracking**: "Cited by" links for forward citation tracking
|
|
12
|
+
- **Accessibility**: Free to use, no account required
|
|
13
|
+
|
|
14
|
+
## Basic Search
|
|
15
|
+
|
|
16
|
+
### Simple Keyword Search
|
|
17
|
+
|
|
18
|
+
Search for papers containing specific terms anywhere in the document (title, abstract, full text):
|
|
19
|
+
|
|
20
|
+
```
|
|
21
|
+
CRISPR gene editing
|
|
22
|
+
machine learning protein folding
|
|
23
|
+
climate change impact agriculture
|
|
24
|
+
quantum computing algorithms
|
|
25
|
+
```
|
|
26
|
+
|
|
27
|
+
**Tips**:
|
|
28
|
+
- Use specific technical terms
|
|
29
|
+
- Include key acronyms and abbreviations
|
|
30
|
+
- Start broad, then refine
|
|
31
|
+
- Check spelling of technical terms
|
|
32
|
+
|
|
33
|
+
### Exact Phrase Search
|
|
34
|
+
|
|
35
|
+
Use quotation marks to search for exact phrases:
|
|
36
|
+
|
|
37
|
+
```
|
|
38
|
+
"deep learning"
|
|
39
|
+
"CRISPR-Cas9"
|
|
40
|
+
"systematic review"
|
|
41
|
+
"randomized controlled trial"
|
|
42
|
+
```
|
|
43
|
+
|
|
44
|
+
**When to use**:
|
|
45
|
+
- Technical terms that must appear together
|
|
46
|
+
- Proper names
|
|
47
|
+
- Specific methodologies
|
|
48
|
+
- Exact titles
|
|
49
|
+
|
|
50
|
+
## Advanced Search Operators
|
|
51
|
+
|
|
52
|
+
### Author Search
|
|
53
|
+
|
|
54
|
+
Find papers by specific authors:
|
|
55
|
+
|
|
56
|
+
```
|
|
57
|
+
author:LeCun
|
|
58
|
+
author:"Geoffrey Hinton"
|
|
59
|
+
author:Church synthetic biology
|
|
60
|
+
```
|
|
61
|
+
|
|
62
|
+
**Variations**:
|
|
63
|
+
- Single last name: `author:Smith`
|
|
64
|
+
- Full name in quotes: `author:"Jane Smith"`
|
|
65
|
+
- Author + topic: `author:Doudna CRISPR`
|
|
66
|
+
|
|
67
|
+
**Tips**:
|
|
68
|
+
- Authors may publish under different name variations
|
|
69
|
+
- Try with and without middle initials
|
|
70
|
+
- Consider name changes (marriage, etc.)
|
|
71
|
+
- Use quotation marks for full names
|
|
72
|
+
|
|
73
|
+
### Title Search
|
|
74
|
+
|
|
75
|
+
Search only in article titles:
|
|
76
|
+
|
|
77
|
+
```
|
|
78
|
+
intitle:transformer
|
|
79
|
+
intitle:"attention mechanism"
|
|
80
|
+
intitle:review climate change
|
|
81
|
+
```
|
|
82
|
+
|
|
83
|
+
**Use cases**:
|
|
84
|
+
- Finding papers specifically about a topic
|
|
85
|
+
- More precise than full-text search
|
|
86
|
+
- Reduces irrelevant results
|
|
87
|
+
- Good for finding reviews or methods
|
|
88
|
+
|
|
89
|
+
### Source (Journal) Search
|
|
90
|
+
|
|
91
|
+
Search within specific journals or conferences:
|
|
92
|
+
|
|
93
|
+
```
|
|
94
|
+
source:Nature
|
|
95
|
+
source:"Nature Communications"
|
|
96
|
+
source:NeurIPS
|
|
97
|
+
source:"Journal of Machine Learning Research"
|
|
98
|
+
```
|
|
99
|
+
|
|
100
|
+
**Applications**:
|
|
101
|
+
- Track publications in top-tier venues
|
|
102
|
+
- Find papers in specialized journals
|
|
103
|
+
- Identify conference-specific work
|
|
104
|
+
- Verify publication venue
|
|
105
|
+
|
|
106
|
+
### Exclusion Operator
|
|
107
|
+
|
|
108
|
+
Exclude terms from results:
|
|
109
|
+
|
|
110
|
+
```
|
|
111
|
+
machine learning -survey
|
|
112
|
+
CRISPR -patent
|
|
113
|
+
climate change -news
|
|
114
|
+
deep learning -tutorial -review
|
|
115
|
+
```
|
|
116
|
+
|
|
117
|
+
**Common exclusions**:
|
|
118
|
+
- `-survey`: Exclude survey papers
|
|
119
|
+
- `-review`: Exclude review articles
|
|
120
|
+
- `-patent`: Exclude patents
|
|
121
|
+
- `-book`: Exclude books
|
|
122
|
+
- `-news`: Exclude news articles
|
|
123
|
+
- `-tutorial`: Exclude tutorials
|
|
124
|
+
|
|
125
|
+
### OR Operator
|
|
126
|
+
|
|
127
|
+
Search for papers containing any of multiple terms:
|
|
128
|
+
|
|
129
|
+
```
|
|
130
|
+
"machine learning" OR "deep learning"
|
|
131
|
+
CRISPR OR "gene editing"
|
|
132
|
+
"climate change" OR "global warming"
|
|
133
|
+
```
|
|
134
|
+
|
|
135
|
+
**Best practices**:
|
|
136
|
+
- OR must be uppercase
|
|
137
|
+
- Combine synonyms
|
|
138
|
+
- Include acronyms and spelled-out versions
|
|
139
|
+
- Use with exact phrases
|
|
140
|
+
|
|
141
|
+
### Wildcard Search
|
|
142
|
+
|
|
143
|
+
Use asterisk (*) as wildcard for unknown words:
|
|
144
|
+
|
|
145
|
+
```
|
|
146
|
+
"machine * learning"
|
|
147
|
+
"CRISPR * editing"
|
|
148
|
+
"* neural network"
|
|
149
|
+
```
|
|
150
|
+
|
|
151
|
+
**Note**: Limited wildcard support in Google Scholar compared to other databases.
|
|
152
|
+
|
|
153
|
+
## Advanced Filtering
|
|
154
|
+
|
|
155
|
+
### Year Range
|
|
156
|
+
|
|
157
|
+
Filter by publication year:
|
|
158
|
+
|
|
159
|
+
**Using interface**:
|
|
160
|
+
- Click "Since [year]" on left sidebar
|
|
161
|
+
- Select custom range
|
|
162
|
+
|
|
163
|
+
**Using search operators**:
|
|
164
|
+
```
|
|
165
|
+
# Not directly in search query
|
|
166
|
+
# Use interface or URL parameters
|
|
167
|
+
```
|
|
168
|
+
|
|
169
|
+
**In script**:
|
|
170
|
+
```bash
|
|
171
|
+
python scripts/search_google_scholar.py "quantum computing" \
|
|
172
|
+
--year-start 2020 \
|
|
173
|
+
--year-end 2024
|
|
174
|
+
```
|
|
175
|
+
|
|
176
|
+
### Sorting Options
|
|
177
|
+
|
|
178
|
+
**By relevance** (default):
|
|
179
|
+
- Google's algorithm determines relevance
|
|
180
|
+
- Considers citations, author reputation, publication venue
|
|
181
|
+
- Generally good for most searches
|
|
182
|
+
|
|
183
|
+
**By date**:
|
|
184
|
+
- Most recent papers first
|
|
185
|
+
- Good for fast-moving fields
|
|
186
|
+
- May miss highly cited older papers
|
|
187
|
+
- Click "Sort by date" in interface
|
|
188
|
+
|
|
189
|
+
**By citation count** (via script):
|
|
190
|
+
```bash
|
|
191
|
+
python scripts/search_google_scholar.py "transformers" \
|
|
192
|
+
--sort-by citations \
|
|
193
|
+
--limit 50
|
|
194
|
+
```
|
|
195
|
+
|
|
196
|
+
### Language Filtering
|
|
197
|
+
|
|
198
|
+
**In interface**:
|
|
199
|
+
- Settings → Languages
|
|
200
|
+
- Select preferred languages
|
|
201
|
+
|
|
202
|
+
**Default**: English and papers with English abstracts
|
|
203
|
+
|
|
204
|
+
## Search Strategies
|
|
205
|
+
|
|
206
|
+
### Finding Seminal Papers
|
|
207
|
+
|
|
208
|
+
Identify highly influential papers in a field:
|
|
209
|
+
|
|
210
|
+
1. **Search by topic** with broad terms
|
|
211
|
+
2. **Sort by citations** (most cited first)
|
|
212
|
+
3. **Look for review articles** for comprehensive overviews
|
|
213
|
+
4. **Check publication dates** for foundational vs recent work
|
|
214
|
+
|
|
215
|
+
**Example**:
|
|
216
|
+
```
|
|
217
|
+
"generative adversarial networks"
|
|
218
|
+
# Sort by citations
|
|
219
|
+
# Top results: original GAN paper (Goodfellow et al., 2014), key variants
|
|
220
|
+
```
|
|
221
|
+
|
|
222
|
+
### Finding Recent Work
|
|
223
|
+
|
|
224
|
+
Stay current with latest research:
|
|
225
|
+
|
|
226
|
+
1. **Search by topic**
|
|
227
|
+
2. **Filter to recent years** (last 1-2 years)
|
|
228
|
+
3. **Sort by date** for newest first
|
|
229
|
+
4. **Set up alerts** for ongoing tracking
|
|
230
|
+
|
|
231
|
+
**Example**:
|
|
232
|
+
```bash
|
|
233
|
+
python scripts/search_google_scholar.py "AlphaFold protein structure" \
|
|
234
|
+
--year-start 2023 \
|
|
235
|
+
--year-end 2024 \
|
|
236
|
+
--limit 50
|
|
237
|
+
```
|
|
238
|
+
|
|
239
|
+
### Finding Review Articles
|
|
240
|
+
|
|
241
|
+
Get comprehensive overviews of a field:
|
|
242
|
+
|
|
243
|
+
```
|
|
244
|
+
intitle:review "machine learning"
|
|
245
|
+
"systematic review" CRISPR
|
|
246
|
+
intitle:survey "natural language processing"
|
|
247
|
+
```
|
|
248
|
+
|
|
249
|
+
**Indicators**:
|
|
250
|
+
- "review", "survey", "perspective" in title
|
|
251
|
+
- Often highly cited
|
|
252
|
+
- Published in review journals (Nature Reviews, Trends, etc.)
|
|
253
|
+
- Comprehensive reference lists
|
|
254
|
+
|
|
255
|
+
### Citation Chain Search
|
|
256
|
+
|
|
257
|
+
**Forward citations** (papers citing a key paper):
|
|
258
|
+
1. Find seminal paper
|
|
259
|
+
2. Click "Cited by X"
|
|
260
|
+
3. See all papers that cite it
|
|
261
|
+
4. Identify how field has developed
|
|
262
|
+
|
|
263
|
+
**Backward citations** (references in a key paper):
|
|
264
|
+
1. Find recent review or important paper
|
|
265
|
+
2. Check its reference list
|
|
266
|
+
3. Identify foundational work
|
|
267
|
+
4. Trace development of ideas
|
|
268
|
+
|
|
269
|
+
**Example workflow**:
|
|
270
|
+
```
|
|
271
|
+
# Find original transformer paper
|
|
272
|
+
"Attention is all you need" author:Vaswani
|
|
273
|
+
|
|
274
|
+
# Check "Cited by 120,000+"
|
|
275
|
+
# See evolution: BERT, GPT, T5, etc.
|
|
276
|
+
|
|
277
|
+
# Check references in original paper
|
|
278
|
+
# Find RNN, LSTM, attention mechanism origins
|
|
279
|
+
```
|
|
280
|
+
|
|
281
|
+
### Comprehensive Literature Search
|
|
282
|
+
|
|
283
|
+
For thorough coverage (e.g., systematic reviews):
|
|
284
|
+
|
|
285
|
+
1. **Generate synonym list**:
|
|
286
|
+
- Main terms + alternatives
|
|
287
|
+
- Acronyms + spelled out
|
|
288
|
+
- US vs UK spelling
|
|
289
|
+
|
|
290
|
+
2. **Use OR operators**:
|
|
291
|
+
```
|
|
292
|
+
("machine learning" OR "deep learning" OR "neural networks")
|
|
293
|
+
```
|
|
294
|
+
|
|
295
|
+
3. **Combine multiple concepts**:
|
|
296
|
+
```
|
|
297
|
+
("machine learning" OR "deep learning") ("drug discovery" OR "drug development")
|
|
298
|
+
```
|
|
299
|
+
|
|
300
|
+
4. **Search without date filters** initially:
|
|
301
|
+
- Get total landscape
|
|
302
|
+
- Filter later if too many results
|
|
303
|
+
|
|
304
|
+
5. **Export results** for systematic analysis:
|
|
305
|
+
```bash
|
|
306
|
+
python scripts/search_google_scholar.py \
|
|
307
|
+
'"machine learning" OR "deep learning" drug discovery' \
|
|
308
|
+
--limit 500 \
|
|
309
|
+
--output comprehensive_search.json
|
|
310
|
+
```
|
|
311
|
+
|
|
312
|
+
## Extracting Citation Information
|
|
313
|
+
|
|
314
|
+
### From Google Scholar Results Page
|
|
315
|
+
|
|
316
|
+
Each result shows:
|
|
317
|
+
- **Title**: Paper title (linked to full text if available)
|
|
318
|
+
- **Authors**: Author list (often truncated)
|
|
319
|
+
- **Source**: Journal/conference, year, publisher
|
|
320
|
+
- **Cited by**: Number of citations + link to citing papers
|
|
321
|
+
- **Related articles**: Link to similar papers
|
|
322
|
+
- **All versions**: Different versions of the same paper
|
|
323
|
+
|
|
324
|
+
### Export Options
|
|
325
|
+
|
|
326
|
+
**Manual export**:
|
|
327
|
+
1. Click "Cite" under paper
|
|
328
|
+
2. Select BibTeX format
|
|
329
|
+
3. Copy citation
|
|
330
|
+
|
|
331
|
+
**Limitations**:
|
|
332
|
+
- One paper at a time
|
|
333
|
+
- Manual process
|
|
334
|
+
- Time-consuming for many papers
|
|
335
|
+
|
|
336
|
+
**Automated export** (using script):
|
|
337
|
+
```bash
|
|
338
|
+
# Search and export to BibTeX
|
|
339
|
+
python scripts/search_google_scholar.py "quantum computing" \
|
|
340
|
+
--limit 50 \
|
|
341
|
+
--format bibtex \
|
|
342
|
+
--output quantum_papers.bib
|
|
343
|
+
```
|
|
344
|
+
|
|
345
|
+
### Metadata Available
|
|
346
|
+
|
|
347
|
+
From Google Scholar you can typically extract:
|
|
348
|
+
- Title
|
|
349
|
+
- Authors (may be incomplete)
|
|
350
|
+
- Year
|
|
351
|
+
- Source (journal/conference)
|
|
352
|
+
- Citation count
|
|
353
|
+
- Link to full text (when available)
|
|
354
|
+
- Link to PDF (when available)
|
|
355
|
+
|
|
356
|
+
**Note**: Metadata quality varies:
|
|
357
|
+
- Some fields may be missing
|
|
358
|
+
- Author names may be incomplete
|
|
359
|
+
- Need to verify with DOI lookup for accuracy
|
|
360
|
+
|
|
361
|
+
## Rate Limiting and Access
|
|
362
|
+
|
|
363
|
+
### Rate Limits
|
|
364
|
+
|
|
365
|
+
Google Scholar has rate limiting to prevent automated scraping:
|
|
366
|
+
|
|
367
|
+
**Symptoms of rate limiting**:
|
|
368
|
+
- CAPTCHA challenges
|
|
369
|
+
- Temporary IP blocks
|
|
370
|
+
- 429 "Too Many Requests" errors
|
|
371
|
+
|
|
372
|
+
**Best practices**:
|
|
373
|
+
1. **Add delays between requests**: 2-5 seconds minimum
|
|
374
|
+
2. **Limit query volume**: Don't search hundreds of queries rapidly
|
|
375
|
+
3. **Use scholarly library**: Handles rate limiting automatically
|
|
376
|
+
4. **Rotate User-Agents**: Appear as different browsers
|
|
377
|
+
5. **Consider proxies**: For large-scale searches (use ethically)
|
|
378
|
+
|
|
379
|
+
**In our scripts**:
|
|
380
|
+
```python
|
|
381
|
+
# Automatic rate limiting built in
|
|
382
|
+
time.sleep(random.uniform(3, 7)) # Random delay 3-7 seconds
|
|
383
|
+
```
|
|
384
|
+
|
|
385
|
+
### Ethical Considerations
|
|
386
|
+
|
|
387
|
+
**DO**:
|
|
388
|
+
- Respect rate limits
|
|
389
|
+
- Use reasonable delays
|
|
390
|
+
- Cache results (don't re-query)
|
|
391
|
+
- Use official APIs when available
|
|
392
|
+
- Attribute data properly
|
|
393
|
+
|
|
394
|
+
**DON'T**:
|
|
395
|
+
- Scrape aggressively
|
|
396
|
+
- Use multiple IPs to bypass limits
|
|
397
|
+
- Violate terms of service
|
|
398
|
+
- Burden servers unnecessarily
|
|
399
|
+
- Use data commercially without permission
|
|
400
|
+
|
|
401
|
+
### Institutional Access
|
|
402
|
+
|
|
403
|
+
**Benefits of institutional access**:
|
|
404
|
+
- Access to full-text PDFs through library subscriptions
|
|
405
|
+
- Better download capabilities
|
|
406
|
+
- Integration with library systems
|
|
407
|
+
- Link resolver to full text
|
|
408
|
+
|
|
409
|
+
**Setup**:
|
|
410
|
+
- Google Scholar → Settings → Library links
|
|
411
|
+
- Add your institution
|
|
412
|
+
- Links appear in search results
|
|
413
|
+
|
|
414
|
+
## Tips and Best Practices
|
|
415
|
+
|
|
416
|
+
### Search Optimization
|
|
417
|
+
|
|
418
|
+
1. **Start simple, then refine**:
|
|
419
|
+
```
|
|
420
|
+
# Too specific initially
|
|
421
|
+
intitle:"deep learning" intitle:review source:Nature 2023..2024
|
|
422
|
+
|
|
423
|
+
# Better approach
|
|
424
|
+
deep learning review
|
|
425
|
+
# Review results
|
|
426
|
+
# Add intitle:, source:, year filters as needed
|
|
427
|
+
```
|
|
428
|
+
|
|
429
|
+
2. **Use multiple search strategies**:
|
|
430
|
+
- Keyword search
|
|
431
|
+
- Author search for known experts
|
|
432
|
+
- Citation chaining from key papers
|
|
433
|
+
- Source search in top journals
|
|
434
|
+
|
|
435
|
+
3. **Check spelling and variations**:
|
|
436
|
+
- Color vs colour
|
|
437
|
+
- Optimization vs optimisation
|
|
438
|
+
- Tumor vs tumour
|
|
439
|
+
- Try common misspellings if few results
|
|
440
|
+
|
|
441
|
+
4. **Combine operators strategically**:
|
|
442
|
+
```
|
|
443
|
+
# Good combination
|
|
444
|
+
author:Church intitle:"synthetic biology" 2015..2024
|
|
445
|
+
|
|
446
|
+
# Find reviews by specific author on topic in recent years
|
|
447
|
+
```
|
|
448
|
+
|
|
449
|
+
### Result Evaluation
|
|
450
|
+
|
|
451
|
+
1. **Check citation counts**:
|
|
452
|
+
- High citations indicate influence
|
|
453
|
+
- Recent papers may have low citations but be important
|
|
454
|
+
- Citation counts vary by field
|
|
455
|
+
|
|
456
|
+
2. **Verify publication venue**:
|
|
457
|
+
- Peer-reviewed journals vs preprints
|
|
458
|
+
- Conference proceedings
|
|
459
|
+
- Book chapters
|
|
460
|
+
- Technical reports
|
|
461
|
+
|
|
462
|
+
3. **Check for full text access**:
|
|
463
|
+
- [PDF] link on right side
|
|
464
|
+
- "All X versions" may have open access version
|
|
465
|
+
- Check institutional access
|
|
466
|
+
- Try author's website or ResearchGate
|
|
467
|
+
|
|
468
|
+
4. **Look for review articles**:
|
|
469
|
+
- Comprehensive overviews
|
|
470
|
+
- Good starting point for new topics
|
|
471
|
+
- Extensive reference lists
|
|
472
|
+
|
|
473
|
+
### Managing Results
|
|
474
|
+
|
|
475
|
+
1. **Use citation manager integration**:
|
|
476
|
+
- Export to BibTeX
|
|
477
|
+
- Import to Zotero, Mendeley, EndNote
|
|
478
|
+
- Maintain organized library
|
|
479
|
+
|
|
480
|
+
2. **Set up alerts** for ongoing research:
|
|
481
|
+
- Google Scholar → Alerts
|
|
482
|
+
- Get emails for new papers matching query
|
|
483
|
+
- Track specific authors or topics
|
|
484
|
+
|
|
485
|
+
3. **Create collections**:
|
|
486
|
+
- Save papers to Google Scholar Library
|
|
487
|
+
- Organize by project or topic
|
|
488
|
+
- Add labels and notes
|
|
489
|
+
|
|
490
|
+
4. **Export systematically**:
|
|
491
|
+
```bash
|
|
492
|
+
# Save search results for later analysis
|
|
493
|
+
python scripts/search_google_scholar.py "your topic" \
|
|
494
|
+
--output topic_papers.json
|
|
495
|
+
|
|
496
|
+
# Can re-process later without re-searching
|
|
497
|
+
python scripts/extract_metadata.py \
|
|
498
|
+
--input topic_papers.json \
|
|
499
|
+
--output topic_refs.bib
|
|
500
|
+
```
|
|
501
|
+
|
|
502
|
+
## Advanced Techniques
|
|
503
|
+
|
|
504
|
+
### Boolean Logic Combinations
|
|
505
|
+
|
|
506
|
+
Combine multiple operators for precise searches:
|
|
507
|
+
|
|
508
|
+
```
|
|
509
|
+
# Highly cited reviews on specific topic by known authors
|
|
510
|
+
intitle:review "machine learning" ("drug discovery" OR "drug development")
|
|
511
|
+
author:Horvath OR author:Bengio 2020..2024
|
|
512
|
+
|
|
513
|
+
# Method papers excluding reviews
|
|
514
|
+
intitle:method "protein folding" -review -survey
|
|
515
|
+
|
|
516
|
+
# Papers in top journals only
|
|
517
|
+
("Nature" OR "Science" OR "Cell") CRISPR 2022..2024
|
|
518
|
+
```
|
|
519
|
+
|
|
520
|
+
### Finding Open Access Papers
|
|
521
|
+
|
|
522
|
+
```
|
|
523
|
+
# Search with generic terms
|
|
524
|
+
machine learning
|
|
525
|
+
|
|
526
|
+
# Filter by "All versions" which often includes preprints
|
|
527
|
+
# Look for green [PDF] links (often open access)
|
|
528
|
+
# Check arXiv, bioRxiv versions
|
|
529
|
+
```
|
|
530
|
+
|
|
531
|
+
**In script**:
|
|
532
|
+
```bash
|
|
533
|
+
python scripts/search_google_scholar.py "topic" \
|
|
534
|
+
--open-access-only \
|
|
535
|
+
--output open_access_papers.json
|
|
536
|
+
```
|
|
537
|
+
|
|
538
|
+
### Tracking Research Impact
|
|
539
|
+
|
|
540
|
+
**For a specific paper**:
|
|
541
|
+
1. Find the paper
|
|
542
|
+
2. Click "Cited by X"
|
|
543
|
+
3. Analyze citing papers:
|
|
544
|
+
- How is it being used?
|
|
545
|
+
- What fields cite it?
|
|
546
|
+
- Recent vs older citations?
|
|
547
|
+
|
|
548
|
+
**For an author**:
|
|
549
|
+
1. Search `author:LastName`
|
|
550
|
+
2. Check h-index and i10-index
|
|
551
|
+
3. View citation history graph
|
|
552
|
+
4. Identify most influential papers
|
|
553
|
+
|
|
554
|
+
**For a topic**:
|
|
555
|
+
1. Search topic
|
|
556
|
+
2. Sort by citations
|
|
557
|
+
3. Identify seminal papers (highly cited, older)
|
|
558
|
+
4. Check recent highly-cited papers (emerging important work)
|
|
559
|
+
|
|
560
|
+
### Finding Preprints and Early Work
|
|
561
|
+
|
|
562
|
+
```
|
|
563
|
+
# arXiv papers
|
|
564
|
+
source:arxiv "deep learning"
|
|
565
|
+
|
|
566
|
+
# bioRxiv papers
|
|
567
|
+
source:biorxiv CRISPR
|
|
568
|
+
|
|
569
|
+
# All preprint servers
|
|
570
|
+
("arxiv" OR "biorxiv" OR "medrxiv") your topic
|
|
571
|
+
```
|
|
572
|
+
|
|
573
|
+
**Note**: Preprints are not peer-reviewed. Always check if published version exists.
|
|
574
|
+
|
|
575
|
+
## Common Issues and Solutions
|
|
576
|
+
|
|
577
|
+
### Too Many Results
|
|
578
|
+
|
|
579
|
+
**Problem**: Search returns 100,000+ results, overwhelming.
|
|
580
|
+
|
|
581
|
+
**Solutions**:
|
|
582
|
+
1. Add more specific terms
|
|
583
|
+
2. Use `intitle:` to search only titles
|
|
584
|
+
3. Filter by recent years
|
|
585
|
+
4. Add exclusions (e.g., `-review`)
|
|
586
|
+
5. Search within specific journals
|
|
587
|
+
|
|
588
|
+
### Too Few Results
|
|
589
|
+
|
|
590
|
+
**Problem**: Search returns 0-10 results, suspiciously few.
|
|
591
|
+
|
|
592
|
+
**Solutions**:
|
|
593
|
+
1. Remove restrictive operators
|
|
594
|
+
2. Try synonyms and related terms
|
|
595
|
+
3. Check spelling
|
|
596
|
+
4. Broaden year range
|
|
597
|
+
5. Use OR for alternative terms
|
|
598
|
+
|
|
599
|
+
### Irrelevant Results
|
|
600
|
+
|
|
601
|
+
**Problem**: Results don't match intent.
|
|
602
|
+
|
|
603
|
+
**Solutions**:
|
|
604
|
+
1. Use exact phrases with quotes
|
|
605
|
+
2. Add more specific context terms
|
|
606
|
+
3. Use `intitle:` for title-only search
|
|
607
|
+
4. Exclude common irrelevant terms
|
|
608
|
+
5. Combine multiple specific terms
|
|
609
|
+
|
|
610
|
+
### CAPTCHA or Rate Limiting
|
|
611
|
+
|
|
612
|
+
**Problem**: Google Scholar shows CAPTCHA or blocks access.
|
|
613
|
+
|
|
614
|
+
**Solutions**:
|
|
615
|
+
1. Wait several minutes before continuing
|
|
616
|
+
2. Reduce query frequency
|
|
617
|
+
3. Use longer delays in scripts (5-10 seconds)
|
|
618
|
+
4. Switch to different IP/network
|
|
619
|
+
5. Consider using institutional access
|
|
620
|
+
|
|
621
|
+
### Missing Metadata
|
|
622
|
+
|
|
623
|
+
**Problem**: Author names, year, or venue missing from results.
|
|
624
|
+
|
|
625
|
+
**Solutions**:
|
|
626
|
+
1. Click through to see full details
|
|
627
|
+
2. Check "All versions" for better metadata
|
|
628
|
+
3. Look up by DOI if available
|
|
629
|
+
4. Extract metadata from CrossRef/PubMed instead
|
|
630
|
+
5. Manually verify from paper PDF
|
|
631
|
+
|
|
632
|
+
### Duplicate Results
|
|
633
|
+
|
|
634
|
+
**Problem**: Same paper appears multiple times.
|
|
635
|
+
|
|
636
|
+
**Solutions**:
|
|
637
|
+
1. Click "All X versions" to see consolidated view
|
|
638
|
+
2. Choose version with best metadata
|
|
639
|
+
3. Use deduplication in post-processing:
|
|
640
|
+
```bash
|
|
641
|
+
python scripts/format_bibtex.py results.bib \
|
|
642
|
+
--deduplicate \
|
|
643
|
+
--output clean_results.bib
|
|
644
|
+
```
|
|
645
|
+
|
|
646
|
+
## Integration with Scripts
|
|
647
|
+
|
|
648
|
+
### search_google_scholar.py Usage
|
|
649
|
+
|
|
650
|
+
**Basic search**:
|
|
651
|
+
```bash
|
|
652
|
+
python scripts/search_google_scholar.py "machine learning drug discovery"
|
|
653
|
+
```
|
|
654
|
+
|
|
655
|
+
**With year filter**:
|
|
656
|
+
```bash
|
|
657
|
+
python scripts/search_google_scholar.py "CRISPR" \
|
|
658
|
+
--year-start 2020 \
|
|
659
|
+
--year-end 2024 \
|
|
660
|
+
--limit 100
|
|
661
|
+
```
|
|
662
|
+
|
|
663
|
+
**Sort by citations**:
|
|
664
|
+
```bash
|
|
665
|
+
python scripts/search_google_scholar.py "transformers" \
|
|
666
|
+
--sort-by citations \
|
|
667
|
+
--limit 50
|
|
668
|
+
```
|
|
669
|
+
|
|
670
|
+
**Export to BibTeX**:
|
|
671
|
+
```bash
|
|
672
|
+
python scripts/search_google_scholar.py "quantum computing" \
|
|
673
|
+
--format bibtex \
|
|
674
|
+
--output quantum.bib
|
|
675
|
+
```
|
|
676
|
+
|
|
677
|
+
**Export to JSON for later processing**:
|
|
678
|
+
```bash
|
|
679
|
+
python scripts/search_google_scholar.py "topic" \
|
|
680
|
+
--format json \
|
|
681
|
+
--output results.json
|
|
682
|
+
|
|
683
|
+
# Later: extract full metadata
|
|
684
|
+
python scripts/extract_metadata.py \
|
|
685
|
+
--input results.json \
|
|
686
|
+
--output references.bib
|
|
687
|
+
```
|
|
688
|
+
|
|
689
|
+
### Batch Searching
|
|
690
|
+
|
|
691
|
+
For multiple topics:
|
|
692
|
+
|
|
693
|
+
```bash
|
|
694
|
+
# Create file with search queries (queries.txt)
|
|
695
|
+
# One query per line
|
|
696
|
+
|
|
697
|
+
# Search each query
|
|
698
|
+
while read query; do
|
|
699
|
+
python scripts/search_google_scholar.py "$query" \
|
|
700
|
+
--limit 50 \
|
|
701
|
+
--output "${query// /_}.json"
|
|
702
|
+
sleep 10 # Delay between queries
|
|
703
|
+
done < queries.txt
|
|
704
|
+
```
|
|
705
|
+
|
|
706
|
+
## Summary
|
|
707
|
+
|
|
708
|
+
Google Scholar is the most comprehensive academic search engine, providing:
|
|
709
|
+
|
|
710
|
+
✓ **Broad coverage**: All disciplines, 100M+ documents
|
|
711
|
+
✓ **Free access**: No account or subscription required
|
|
712
|
+
✓ **Citation tracking**: "Cited by" for impact analysis
|
|
713
|
+
✓ **Multiple formats**: Articles, books, theses, patents
|
|
714
|
+
✓ **Full-text search**: Not just abstracts
|
|
715
|
+
|
|
716
|
+
Key strategies:
|
|
717
|
+
- Use advanced operators for precision
|
|
718
|
+
- Combine author, title, source searches
|
|
719
|
+
- Track citations for impact
|
|
720
|
+
- Export systematically to citation manager
|
|
721
|
+
- Respect rate limits and access policies
|
|
722
|
+
- Verify metadata with CrossRef/PubMed
|
|
723
|
+
|
|
724
|
+
For biomedical research, complement with PubMed for MeSH terms and curated metadata.
|
|
725
|
+
|