@wanshi-kg/wanshi 0.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/LICENSE +21 -0
- package/README.md +458 -0
- package/dist/__tests__/helpers.js +27 -0
- package/dist/__tests__/helpers.js.map +1 -0
- package/dist/cli/commands/export.command.js +99 -0
- package/dist/cli/commands/export.command.js.map +1 -0
- package/dist/cli/commands/index.js +22 -0
- package/dist/cli/commands/index.js.map +1 -0
- package/dist/cli/commands/inspectMerges.command.js +84 -0
- package/dist/cli/commands/inspectMerges.command.js.map +1 -0
- package/dist/cli/commands/metrics.command.js +196 -0
- package/dist/cli/commands/metrics.command.js.map +1 -0
- package/dist/cli/commands/process.command.js +82 -0
- package/dist/cli/commands/process.command.js.map +1 -0
- package/dist/cli/commands/watch.command.js +91 -0
- package/dist/cli/commands/watch.command.js.map +1 -0
- package/dist/cli/index.js +269 -0
- package/dist/cli/index.js.map +1 -0
- package/dist/cli/optionsToConfig.js +160 -0
- package/dist/cli/optionsToConfig.js.map +1 -0
- package/dist/config/index.js +59 -0
- package/dist/config/index.js.map +1 -0
- package/dist/config/legacyHints.js +113 -0
- package/dist/config/legacyHints.js.map +1 -0
- package/dist/config/schema.js +803 -0
- package/dist/config/schema.js.map +1 -0
- package/dist/config/ui.js +221 -0
- package/dist/config/ui.js.map +1 -0
- package/dist/core/DirectoryProcessor.js +725 -0
- package/dist/core/DirectoryProcessor.js.map +1 -0
- package/dist/core/adapters/IStructuredAdapter.js +3 -0
- package/dist/core/adapters/IStructuredAdapter.js.map +1 -0
- package/dist/core/adapters/SqliteAdapter.js +267 -0
- package/dist/core/adapters/SqliteAdapter.js.map +1 -0
- package/dist/core/adapters/StructuredAdapterRegistry.js +31 -0
- package/dist/core/adapters/StructuredAdapterRegistry.js.map +1 -0
- package/dist/core/adapters/index.js +20 -0
- package/dist/core/adapters/index.js.map +1 -0
- package/dist/core/checkpoint/CheckpointService.js +188 -0
- package/dist/core/checkpoint/CheckpointService.js.map +1 -0
- package/dist/core/checkpoint/index.js +18 -0
- package/dist/core/checkpoint/index.js.map +1 -0
- package/dist/core/corpus/CorpusAnalyzer.js +266 -0
- package/dist/core/corpus/CorpusAnalyzer.js.map +1 -0
- package/dist/core/corpus/CorpusProfileStore.js +92 -0
- package/dist/core/corpus/CorpusProfileStore.js.map +1 -0
- package/dist/core/corpus/index.js +21 -0
- package/dist/core/corpus/index.js.map +1 -0
- package/dist/core/corpus/normalizeGlossary.js +60 -0
- package/dist/core/corpus/normalizeGlossary.js.map +1 -0
- package/dist/core/corpus/relPath.js +52 -0
- package/dist/core/corpus/relPath.js.map +1 -0
- package/dist/core/corpus/termFrequency.js +86 -0
- package/dist/core/corpus/termFrequency.js.map +1 -0
- package/dist/core/cost/CostMeter.js +235 -0
- package/dist/core/cost/CostMeter.js.map +1 -0
- package/dist/core/cost/index.js +19 -0
- package/dist/core/cost/index.js.map +1 -0
- package/dist/core/cost/prices.js +38 -0
- package/dist/core/cost/prices.js.map +1 -0
- package/dist/core/cv/ObjectDetectionService.js +119 -0
- package/dist/core/cv/ObjectDetectionService.js.map +1 -0
- package/dist/core/di/ContainerFactory.js +670 -0
- package/dist/core/di/ContainerFactory.js.map +1 -0
- package/dist/core/di/DIContainer.js +103 -0
- package/dist/core/di/DIContainer.js.map +1 -0
- package/dist/core/di/index.js +19 -0
- package/dist/core/di/index.js.map +1 -0
- package/dist/core/errors/CustomErrors.js +342 -0
- package/dist/core/errors/CustomErrors.js.map +1 -0
- package/dist/core/errors/index.js +18 -0
- package/dist/core/errors/index.js.map +1 -0
- package/dist/core/export/KnowledgeGraphExportService.js +56 -0
- package/dist/core/export/KnowledgeGraphExportService.js.map +1 -0
- package/dist/core/export/index.js +19 -0
- package/dist/core/export/index.js.map +1 -0
- package/dist/core/export/strategies/GraphitiExportStrategy.js +115 -0
- package/dist/core/export/strategies/GraphitiExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/GraphvizDotExportStrategy.js +331 -0
- package/dist/core/export/strategies/GraphvizDotExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/IExportStrategy.js +3 -0
- package/dist/core/export/strategies/IExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/JsonExportStrategy.js +19 -0
- package/dist/core/export/strategies/JsonExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/JsonlExportStrategy.js +69 -0
- package/dist/core/export/strategies/JsonlExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/KblamExportStrategy.js +36 -0
- package/dist/core/export/strategies/KblamExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/LoraExportStrategy.js +46 -0
- package/dist/core/export/strategies/LoraExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/McpExportStrategy.js +67 -0
- package/dist/core/export/strategies/McpExportStrategy.js.map +1 -0
- package/dist/core/export/strategies/index.js +25 -0
- package/dist/core/export/strategies/index.js.map +1 -0
- package/dist/core/export/strategies/kbTriples.js +60 -0
- package/dist/core/export/strategies/kbTriples.js.map +1 -0
- package/dist/core/index.js +22 -0
- package/dist/core/index.js.map +1 -0
- package/dist/core/knowledge/KnowledgeGraphBuilder.js +627 -0
- package/dist/core/knowledge/KnowledgeGraphBuilder.js.map +1 -0
- package/dist/core/knowledge/MergeRecord.js +3 -0
- package/dist/core/knowledge/MergeRecord.js.map +1 -0
- package/dist/core/knowledge/canon/Canonicalizer.js +414 -0
- package/dist/core/knowledge/canon/Canonicalizer.js.map +1 -0
- package/dist/core/knowledge/canon/index.js +18 -0
- package/dist/core/knowledge/canon/index.js.map +1 -0
- package/dist/core/knowledge/contradiction/HeuristicContradictionChecker.js +92 -0
- package/dist/core/knowledge/contradiction/HeuristicContradictionChecker.js.map +1 -0
- package/dist/core/knowledge/contradiction/LlmContradictionChecker.js +52 -0
- package/dist/core/knowledge/contradiction/LlmContradictionChecker.js.map +1 -0
- package/dist/core/knowledge/contradiction/index.js +19 -0
- package/dist/core/knowledge/contradiction/index.js.map +1 -0
- package/dist/core/knowledge/grounding/KeywordGroundingChecker.js +33 -0
- package/dist/core/knowledge/grounding/KeywordGroundingChecker.js.map +1 -0
- package/dist/core/knowledge/grounding/MiniCheckGroundingChecker.js +82 -0
- package/dist/core/knowledge/grounding/MiniCheckGroundingChecker.js.map +1 -0
- package/dist/core/knowledge/grounding/index.js +20 -0
- package/dist/core/knowledge/grounding/index.js.map +1 -0
- package/dist/core/knowledge/grounding/verbalize.js +38 -0
- package/dist/core/knowledge/grounding/verbalize.js.map +1 -0
- package/dist/core/knowledge/images/imageMetaGraph.js +136 -0
- package/dist/core/knowledge/images/imageMetaGraph.js.map +1 -0
- package/dist/core/knowledge/index.js +20 -0
- package/dist/core/knowledge/index.js.map +1 -0
- package/dist/core/knowledge/merging/KnowledgeMerger.js +624 -0
- package/dist/core/knowledge/merging/KnowledgeMerger.js.map +1 -0
- package/dist/core/knowledge/references/ReferenceResolver.js +184 -0
- package/dist/core/knowledge/references/ReferenceResolver.js.map +1 -0
- package/dist/core/knowledge/references/citations/CitationEvidenceProcessor.js +401 -0
- package/dist/core/knowledge/references/citations/CitationEvidenceProcessor.js.map +1 -0
- package/dist/core/knowledge/references/citations/CitationResolver.js +95 -0
- package/dist/core/knowledge/references/citations/CitationResolver.js.map +1 -0
- package/dist/core/knowledge/references/citations/GrobidClient.js +143 -0
- package/dist/core/knowledge/references/citations/GrobidClient.js.map +1 -0
- package/dist/core/knowledge/references/citations/TitleIdResolver.js +101 -0
- package/dist/core/knowledge/references/citations/TitleIdResolver.js.map +1 -0
- package/dist/core/knowledge/references/web/FetchCacheService.js +114 -0
- package/dist/core/knowledge/references/web/FetchCacheService.js.map +1 -0
- package/dist/core/knowledge/references/web/GatedFetcher.js +228 -0
- package/dist/core/knowledge/references/web/GatedFetcher.js.map +1 -0
- package/dist/core/knowledge/references/web/WebReferenceProcessor.js +164 -0
- package/dist/core/knowledge/references/web/WebReferenceProcessor.js.map +1 -0
- package/dist/core/knowledge/search/KnowledgeGraphSearch.js +261 -0
- package/dist/core/knowledge/search/KnowledgeGraphSearch.js.map +1 -0
- package/dist/core/knowledge/vocabulary.js +162 -0
- package/dist/core/knowledge/vocabulary.js.map +1 -0
- package/dist/core/llm/EmbeddingService.js +113 -0
- package/dist/core/llm/EmbeddingService.js.map +1 -0
- package/dist/core/llm/OllamaService.js +146 -0
- package/dist/core/llm/OllamaService.js.map +1 -0
- package/dist/core/llm/OpenAICompatibleService.js +190 -0
- package/dist/core/llm/OpenAICompatibleService.js.map +1 -0
- package/dist/core/llm/OpenAIEmbeddingService.js +129 -0
- package/dist/core/llm/OpenAIEmbeddingService.js.map +1 -0
- package/dist/core/llm/embeddingUtils.js +25 -0
- package/dist/core/llm/embeddingUtils.js.map +1 -0
- package/dist/core/llm/index.js +23 -0
- package/dist/core/llm/index.js.map +1 -0
- package/dist/core/llm/prompts/PromptManager.js +388 -0
- package/dist/core/llm/prompts/PromptManager.js.map +1 -0
- package/dist/core/llm/prompts/PromptTemplateEngine.js +257 -0
- package/dist/core/llm/prompts/PromptTemplateEngine.js.map +1 -0
- package/dist/core/llm/prompts/templates/partials/examples/EXAMPLE_STYLE_GUIDE.md +84 -0
- package/dist/core/llm/prompts/templates/partials/examples/article.md +187 -0
- package/dist/core/llm/prompts/templates/partials/examples/code.md +229 -0
- package/dist/core/llm/prompts/templates/partials/examples/communication.md +205 -0
- package/dist/core/llm/prompts/templates/partials/examples/documentation.md +262 -0
- package/dist/core/llm/prompts/templates/partials/examples/financial.md +157 -0
- package/dist/core/llm/prompts/templates/partials/examples/legal.md +153 -0
- package/dist/core/llm/prompts/templates/partials/examples/logs.md +127 -0
- package/dist/core/llm/prompts/templates/partials/examples/medical.md +218 -0
- package/dist/core/llm/prompts/templates/partials/examples/notes.md +201 -0
- package/dist/core/llm/prompts/templates/partials/examples/research.md +208 -0
- package/dist/core/llm/prompts/templates/partials/examples/tabular.md +178 -0
- package/dist/core/llm/prompts/templates/partials/examples/transcript.md +204 -0
- package/dist/core/llm/prompts/templates/partials/retrieved-context.hbs +18 -0
- package/dist/core/llm/prompts/templates/v1/system.hbs +371 -0
- package/dist/core/llm/prompts/templates/v1/user.hbs +20 -0
- package/dist/core/llm/prompts/templates/v2/system.hbs +573 -0
- package/dist/core/llm/prompts/templates/v2/user.hbs +20 -0
- package/dist/core/llm/prompts/templates/v3/system.hbs +861 -0
- package/dist/core/llm/prompts/templates/v3/user.hbs +16 -0
- package/dist/core/llm/prompts/templates/v4/system.hbs +800 -0
- package/dist/core/llm/prompts/templates/v4/user.hbs +40 -0
- package/dist/core/llm/prompts/templates/v4.5/system.hbs +71 -0
- package/dist/core/llm/prompts/templates/v4.5/user.hbs +46 -0
- package/dist/core/llm/prompts/templates/v5/glossary/system.hbs +40 -0
- package/dist/core/llm/prompts/templates/v5/glossary/user.hbs +11 -0
- package/dist/core/llm/prompts/templates/v5/system.hbs +163 -0
- package/dist/core/llm/prompts/templates/v5/user.hbs +55 -0
- package/dist/core/pipeline/GroundingTransform.js +52 -0
- package/dist/core/pipeline/GroundingTransform.js.map +1 -0
- package/dist/core/pipeline/PipelineRunner.js +51 -0
- package/dist/core/pipeline/PipelineRunner.js.map +1 -0
- package/dist/core/pipeline/RelationFilterTransform.js +72 -0
- package/dist/core/pipeline/RelationFilterTransform.js.map +1 -0
- package/dist/core/pipeline/index.js +20 -0
- package/dist/core/pipeline/index.js.map +1 -0
- package/dist/core/processor/FileProcessor.js +184 -0
- package/dist/core/processor/FileProcessor.js.map +1 -0
- package/dist/core/processor/ProcessedRegistry.js +38 -0
- package/dist/core/processor/ProcessedRegistry.js.map +1 -0
- package/dist/core/processor/ast/AstSeedService.js +0 -0
- package/dist/core/processor/ast/AstSeedService.js.map +1 -0
- package/dist/core/processor/ast/AstSymbolStore.js +110 -0
- package/dist/core/processor/ast/AstSymbolStore.js.map +1 -0
- package/dist/core/processor/ast/index.js +19 -0
- package/dist/core/processor/ast/index.js.map +1 -0
- package/dist/core/processor/chunking/TextChunker.js +98 -0
- package/dist/core/processor/chunking/TextChunker.js.map +1 -0
- package/dist/core/processor/chunking/index.js +18 -0
- package/dist/core/processor/chunking/index.js.map +1 -0
- package/dist/core/processor/classifier/CONTENT_CLASSES.js +294 -0
- package/dist/core/processor/classifier/CONTENT_CLASSES.js.map +1 -0
- package/dist/core/processor/classifier/CascadeContentClassifier.js +107 -0
- package/dist/core/processor/classifier/CascadeContentClassifier.js.map +1 -0
- package/dist/core/processor/classifier/HeuristicContentClassifier.js +113 -0
- package/dist/core/processor/classifier/HeuristicContentClassifier.js.map +1 -0
- package/dist/core/processor/classifier/IContentTypeClassifier.js +3 -0
- package/dist/core/processor/classifier/IContentTypeClassifier.js.map +1 -0
- package/dist/core/processor/classifier/LlmContentClassifier.js +107 -0
- package/dist/core/processor/classifier/LlmContentClassifier.js.map +1 -0
- package/dist/core/processor/classifier/NER_DOMAIN_EXAMPLES.js +498 -0
- package/dist/core/processor/classifier/NER_DOMAIN_EXAMPLES.js.map +1 -0
- package/dist/core/processor/classifier/index.js +21 -0
- package/dist/core/processor/classifier/index.js.map +1 -0
- package/dist/core/processor/classifier/mergeClassifications.js +32 -0
- package/dist/core/processor/classifier/mergeClassifications.js.map +1 -0
- package/dist/core/processor/index.js +20 -0
- package/dist/core/processor/index.js.map +1 -0
- package/dist/core/processor/readers/AudioReader.js +462 -0
- package/dist/core/processor/readers/AudioReader.js.map +1 -0
- package/dist/core/processor/readers/BinaryReader.js +90 -0
- package/dist/core/processor/readers/BinaryReader.js.map +1 -0
- package/dist/core/processor/readers/ChandraPdfReader.js +187 -0
- package/dist/core/processor/readers/ChandraPdfReader.js.map +1 -0
- package/dist/core/processor/readers/ChatExportReader.js +365 -0
- package/dist/core/processor/readers/ChatExportReader.js.map +1 -0
- package/dist/core/processor/readers/DoclingReader.js +445 -0
- package/dist/core/processor/readers/DoclingReader.js.map +1 -0
- package/dist/core/processor/readers/EmailReader.js +259 -0
- package/dist/core/processor/readers/EmailReader.js.map +1 -0
- package/dist/core/processor/readers/EpubReader.js +175 -0
- package/dist/core/processor/readers/EpubReader.js.map +1 -0
- package/dist/core/processor/readers/FileReader.js +90 -0
- package/dist/core/processor/readers/FileReader.js.map +1 -0
- package/dist/core/processor/readers/FileReaderFactory.js +49 -0
- package/dist/core/processor/readers/FileReaderFactory.js.map +1 -0
- package/dist/core/processor/readers/HtmlReader.js +371 -0
- package/dist/core/processor/readers/HtmlReader.js.map +1 -0
- package/dist/core/processor/readers/ImageReader.js +162 -0
- package/dist/core/processor/readers/ImageReader.js.map +1 -0
- package/dist/core/processor/readers/JsonFileReader.js +232 -0
- package/dist/core/processor/readers/JsonFileReader.js.map +1 -0
- package/dist/core/processor/readers/JupyterReader.js +178 -0
- package/dist/core/processor/readers/JupyterReader.js.map +1 -0
- package/dist/core/processor/readers/LatexReader.js +176 -0
- package/dist/core/processor/readers/LatexReader.js.map +1 -0
- package/dist/core/processor/readers/MarkdownReader.js +289 -0
- package/dist/core/processor/readers/MarkdownReader.js.map +1 -0
- package/dist/core/processor/readers/MarkerPdfReader.js +193 -0
- package/dist/core/processor/readers/MarkerPdfReader.js.map +1 -0
- package/dist/core/processor/readers/MistralOcrReader.js +198 -0
- package/dist/core/processor/readers/MistralOcrReader.js.map +1 -0
- package/dist/core/processor/readers/OfficeReader.js +174 -0
- package/dist/core/processor/readers/OfficeReader.js.map +1 -0
- package/dist/core/processor/readers/PdfReader.js +116 -0
- package/dist/core/processor/readers/PdfReader.js.map +1 -0
- package/dist/core/processor/readers/RtfReader.js +107 -0
- package/dist/core/processor/readers/RtfReader.js.map +1 -0
- package/dist/core/processor/readers/SubtitleReader.js +145 -0
- package/dist/core/processor/readers/SubtitleReader.js.map +1 -0
- package/dist/core/processor/readers/TesseractPdfReader.js +183 -0
- package/dist/core/processor/readers/TesseractPdfReader.js.map +1 -0
- package/dist/core/processor/readers/TextReader.js +129 -0
- package/dist/core/processor/readers/TextReader.js.map +1 -0
- package/dist/core/processor/readers/TranscriptReader.js +234 -0
- package/dist/core/processor/readers/TranscriptReader.js.map +1 -0
- package/dist/core/processor/readers/image/imageMetadata.js +155 -0
- package/dist/core/processor/readers/image/imageMetadata.js.map +1 -0
- package/dist/core/processor/readers/index.js +41 -0
- package/dist/core/processor/readers/index.js.map +1 -0
- package/dist/core/processor/readers/referenceExtraction.js +198 -0
- package/dist/core/processor/readers/referenceExtraction.js.map +1 -0
- package/dist/core/processor/readers/stripReferences.js +59 -0
- package/dist/core/processor/readers/stripReferences.js.map +1 -0
- package/dist/core/processor/readers/transcript/turnPacking.js +81 -0
- package/dist/core/processor/readers/transcript/turnPacking.js.map +1 -0
- package/dist/core/progress/NdjsonProgressEmitter.js +30 -0
- package/dist/core/progress/NdjsonProgressEmitter.js.map +1 -0
- package/dist/core/progress/NoopProgressEmitter.js +15 -0
- package/dist/core/progress/NoopProgressEmitter.js.map +1 -0
- package/dist/core/progress/index.js +19 -0
- package/dist/core/progress/index.js.map +1 -0
- package/dist/core/trace/TraceWriter.js +100 -0
- package/dist/core/trace/TraceWriter.js.map +1 -0
- package/dist/core/trace/events.js +13 -0
- package/dist/core/trace/events.js.map +1 -0
- package/dist/core/trace/index.js +20 -0
- package/dist/core/trace/index.js.map +1 -0
- package/dist/core/trace/lineage.js +97 -0
- package/dist/core/trace/lineage.js.map +1 -0
- package/dist/evaluation/BenchmarkRunner.js +171 -0
- package/dist/evaluation/BenchmarkRunner.js.map +1 -0
- package/dist/evaluation/classifier/ClassifierAccuracy.js +185 -0
- package/dist/evaluation/classifier/ClassifierAccuracy.js.map +1 -0
- package/dist/evaluation/classifier/labeledSamples.js +379 -0
- package/dist/evaluation/classifier/labeledSamples.js.map +1 -0
- package/dist/evaluation/compare/goldCompare.js +126 -0
- package/dist/evaluation/compare/goldCompare.js.map +1 -0
- package/dist/evaluation/crossre/compareScoring.js +30 -0
- package/dist/evaluation/crossre/compareScoring.js.map +1 -0
- package/dist/evaluation/datasets/CrossREDataset.js +170 -0
- package/dist/evaluation/datasets/CrossREDataset.js.map +1 -0
- package/dist/evaluation/datasets/IDataset.js +3 -0
- package/dist/evaluation/datasets/IDataset.js.map +1 -0
- package/dist/evaluation/datasets/RebelDataset.js +117 -0
- package/dist/evaluation/datasets/RebelDataset.js.map +1 -0
- package/dist/evaluation/datasets/RedocredDataset.js +218 -0
- package/dist/evaluation/datasets/RedocredDataset.js.map +1 -0
- package/dist/evaluation/datasets/SemEval2010Dataset.js +150 -0
- package/dist/evaluation/datasets/SemEval2010Dataset.js.map +1 -0
- package/dist/evaluation/index.js +33 -0
- package/dist/evaluation/index.js.map +1 -0
- package/dist/evaluation/matching/ExactMatcher.js +75 -0
- package/dist/evaluation/matching/ExactMatcher.js.map +1 -0
- package/dist/evaluation/matching/SemanticMatcher.js +143 -0
- package/dist/evaluation/matching/SemanticMatcher.js.map +1 -0
- package/dist/evaluation/metrics/TripleMetrics.js +64 -0
- package/dist/evaluation/metrics/TripleMetrics.js.map +1 -0
- package/dist/evaluation/mine/MineCheckpoint.js +114 -0
- package/dist/evaluation/mine/MineCheckpoint.js.map +1 -0
- package/dist/evaluation/mine/MineDataset.js +208 -0
- package/dist/evaluation/mine/MineDataset.js.map +1 -0
- package/dist/evaluation/mine/MineReporter.js +98 -0
- package/dist/evaluation/mine/MineReporter.js.map +1 -0
- package/dist/evaluation/mine/MineRunner.js +148 -0
- package/dist/evaluation/mine/MineRunner.js.map +1 -0
- package/dist/evaluation/mine/MineScorer.js +127 -0
- package/dist/evaluation/mine/MineScorer.js.map +1 -0
- package/dist/evaluation/mine/types.js +12 -0
- package/dist/evaluation/mine/types.js.map +1 -0
- package/dist/evaluation/reporters/ConsoleReporter.js +55 -0
- package/dist/evaluation/reporters/ConsoleReporter.js.map +1 -0
- package/dist/evaluation/reporters/JsonReporter.js +50 -0
- package/dist/evaluation/reporters/JsonReporter.js.map +1 -0
- package/dist/index.js +28 -0
- package/dist/index.js.map +1 -0
- package/dist/quality/CompositeScore.js +61 -0
- package/dist/quality/CompositeScore.js.map +1 -0
- package/dist/quality/ConsistencyMetrics.js +70 -0
- package/dist/quality/ConsistencyMetrics.js.map +1 -0
- package/dist/quality/FactualMetrics.js +76 -0
- package/dist/quality/FactualMetrics.js.map +1 -0
- package/dist/quality/GraphHealthMetrics.js +68 -0
- package/dist/quality/GraphHealthMetrics.js.map +1 -0
- package/dist/quality/SemanticMetrics.js +102 -0
- package/dist/quality/SemanticMetrics.js.map +1 -0
- package/dist/quality/StructuralMetrics.js +60 -0
- package/dist/quality/StructuralMetrics.js.map +1 -0
- package/dist/quality/index.js +23 -0
- package/dist/quality/index.js.map +1 -0
- package/dist/shared/index.js +20 -0
- package/dist/shared/index.js.map +1 -0
- package/dist/shared/logger/Logger.js +3 -0
- package/dist/shared/logger/Logger.js.map +1 -0
- package/dist/shared/logger/LoggerFactory.js +75 -0
- package/dist/shared/logger/LoggerFactory.js.map +1 -0
- package/dist/shared/logger/index.js +19 -0
- package/dist/shared/logger/index.js.map +1 -0
- package/dist/shared/shutdown.js +30 -0
- package/dist/shared/shutdown.js.map +1 -0
- package/dist/shared/utils/agglomerativeCluster.js +269 -0
- package/dist/shared/utils/agglomerativeCluster.js.map +1 -0
- package/dist/shared/utils/astSymbols.js +69 -0
- package/dist/shared/utils/astSymbols.js.map +1 -0
- package/dist/shared/utils/cosineSimilarity.js +18 -0
- package/dist/shared/utils/cosineSimilarity.js.map +1 -0
- package/dist/shared/utils/directoryTree.js +184 -0
- package/dist/shared/utils/directoryTree.js.map +1 -0
- package/dist/shared/utils/documentOutline.js +74 -0
- package/dist/shared/utils/documentOutline.js.map +1 -0
- package/dist/shared/utils/index.js +24 -0
- package/dist/shared/utils/index.js.map +1 -0
- package/dist/shared/utils/jaroWinklerSimilarity.js +60 -0
- package/dist/shared/utils/jaroWinklerSimilarity.js.map +1 -0
- package/dist/shared/utils/parseJsonLenient.js +27 -0
- package/dist/shared/utils/parseJsonLenient.js.map +1 -0
- package/dist/shared/utils/readConfig.js +42 -0
- package/dist/shared/utils/readConfig.js.map +1 -0
- package/dist/shared/utils/readRtf.js +216 -0
- package/dist/shared/utils/readRtf.js.map +1 -0
- package/dist/shared/utils/softmax.js +26 -0
- package/dist/shared/utils/softmax.js.map +1 -0
- package/dist/types/ContentClass.js +3 -0
- package/dist/types/ContentClass.js.map +1 -0
- package/dist/types/CorpusProfile.js +3 -0
- package/dist/types/CorpusProfile.js.map +1 -0
- package/dist/types/IContradictionChecker.js +3 -0
- package/dist/types/IContradictionChecker.js.map +1 -0
- package/dist/types/ICorpusAnalyzer.js +3 -0
- package/dist/types/ICorpusAnalyzer.js.map +1 -0
- package/dist/types/IDirectoryProcessor.js +3 -0
- package/dist/types/IDirectoryProcessor.js.map +1 -0
- package/dist/types/IEmbeddingProvider.js +3 -0
- package/dist/types/IEmbeddingProvider.js.map +1 -0
- package/dist/types/IEmbeddingService.js +6 -0
- package/dist/types/IEmbeddingService.js.map +1 -0
- package/dist/types/IFileProcessor.js +3 -0
- package/dist/types/IFileProcessor.js.map +1 -0
- package/dist/types/IGroundingChecker.js +3 -0
- package/dist/types/IGroundingChecker.js.map +1 -0
- package/dist/types/IKnowledgeGraphBuilder.js +3 -0
- package/dist/types/IKnowledgeGraphBuilder.js.map +1 -0
- package/dist/types/IKnowledgeGraphExporter.js +3 -0
- package/dist/types/IKnowledgeGraphExporter.js.map +1 -0
- package/dist/types/IKnowledgeGraphMerger.js +3 -0
- package/dist/types/IKnowledgeGraphMerger.js.map +1 -0
- package/dist/types/IKnowledgeGraphSearch.js +3 -0
- package/dist/types/IKnowledgeGraphSearch.js.map +1 -0
- package/dist/types/ILLMProvider.js +3 -0
- package/dist/types/ILLMProvider.js.map +1 -0
- package/dist/types/ILLMService.js +3 -0
- package/dist/types/ILLMService.js.map +1 -0
- package/dist/types/IObjectDetector.js +3 -0
- package/dist/types/IObjectDetector.js.map +1 -0
- package/dist/types/IProcessingService.js +3 -0
- package/dist/types/IProcessingService.js.map +1 -0
- package/dist/types/IProgressEmitter.js +3 -0
- package/dist/types/IProgressEmitter.js.map +1 -0
- package/dist/types/IPromptManager.js +3 -0
- package/dist/types/IPromptManager.js.map +1 -0
- package/dist/types/KnowledgeGraph.js +3 -0
- package/dist/types/KnowledgeGraph.js.map +1 -0
- package/dist/types/MCPKnowledgeGraph.js +3 -0
- package/dist/types/MCPKnowledgeGraph.js.map +1 -0
- package/dist/types/Observation.js +21 -0
- package/dist/types/Observation.js.map +1 -0
- package/dist/types/ProcessingOptions.js +3 -0
- package/dist/types/ProcessingOptions.js.map +1 -0
- package/dist/types/index.js +40 -0
- package/dist/types/index.js.map +1 -0
- package/package.json +122 -0
|
@@ -0,0 +1,371 @@
|
|
|
1
|
+
# Excellent Data Analyst AI System, Specialized In Knowledge Gathering
|
|
2
|
+
|
|
3
|
+
# Overview
|
|
4
|
+
|
|
5
|
+
You are an excellent data analyst and software engineer generative AI system specialized in knowledge gathering, processing and graph generation. Your role is to extract structured information from provided content from files in the directory `{{inputDirectory}}`.
|
|
6
|
+
{{#if directoryTree}}
|
|
7
|
+
|
|
8
|
+
Folder tree view structure of the working directory is following – use it to make conclusions about entity relations:
|
|
9
|
+
```
|
|
10
|
+
{{directoryTree}}
|
|
11
|
+
```
|
|
12
|
+
|
|
13
|
+
{{/if}}
|
|
14
|
+
And output it in a specified JSON schema.
|
|
15
|
+
|
|
16
|
+
## OUTPUT SCHEMA
|
|
17
|
+
|
|
18
|
+
```json
|
|
19
|
+
{
|
|
20
|
+
"entities": [
|
|
21
|
+
{
|
|
22
|
+
"name": "unique_identifier",
|
|
23
|
+
"entityType": "person|action|concept|event|technology|method|issue|etc...",
|
|
24
|
+
"observations": ["fact1", "fact2", "..."]
|
|
25
|
+
}
|
|
26
|
+
],
|
|
27
|
+
"relations": [
|
|
28
|
+
{
|
|
29
|
+
"from": "entity_name",
|
|
30
|
+
"to": "entity_name",
|
|
31
|
+
"relationType": ["relationship_type_1", "relationship_type_2", "..."]
|
|
32
|
+
}
|
|
33
|
+
]
|
|
34
|
+
}
|
|
35
|
+
```
|
|
36
|
+
|
|
37
|
+
|
|
38
|
+
## CRITICAL INSTRUCTIONS
|
|
39
|
+
|
|
40
|
+
1. Output __ONLY__ valid __JSON__ in the specified schema
|
|
41
|
+
2. Be strictly factual – __DO NOT__ hallucinate or infer information not explicitly present or could be inferred from this system prompt or user prompt or existing knowledge graph, except for general knowledge.
|
|
42
|
+
3. __DO NOT__ extract trivial relations and observations, for example "1 is a number" or "promise is a concept" or "x is a variable" or in JSON:
|
|
43
|
+
```
|
|
44
|
+
[
|
|
45
|
+
{
|
|
46
|
+
"name": "1",
|
|
47
|
+
"entityType": "concept",
|
|
48
|
+
"observations": [
|
|
49
|
+
"Number"
|
|
50
|
+
]
|
|
51
|
+
},
|
|
52
|
+
{
|
|
53
|
+
"name": "x",
|
|
54
|
+
"entityType": "variable",
|
|
55
|
+
"observations": [
|
|
56
|
+
"A value"
|
|
57
|
+
]
|
|
58
|
+
},
|
|
59
|
+
{
|
|
60
|
+
"name": "async",
|
|
61
|
+
"entityType": "concept",
|
|
62
|
+
"observations": [
|
|
63
|
+
"A promise"
|
|
64
|
+
]
|
|
65
|
+
}
|
|
66
|
+
]
|
|
67
|
+
```
|
|
68
|
+
4. Make _meaningful_ connections, for example "get_caller is a function that returns a caller method from stack" or "fraction-with-zero-denominator is a compiler error for a fraction with a zero denominator" or in JSON:
|
|
69
|
+
```
|
|
70
|
+
[
|
|
71
|
+
{
|
|
72
|
+
"name": "get_caller",
|
|
73
|
+
"entityType": "function",
|
|
74
|
+
"observations": [
|
|
75
|
+
"Returns a caller method from stack"
|
|
76
|
+
]
|
|
77
|
+
},
|
|
78
|
+
{
|
|
79
|
+
"name": "fraction-with-zero-denominator",
|
|
80
|
+
"entityType": "error",
|
|
81
|
+
"observations": [
|
|
82
|
+
"Represents a compiler error for a fraction with a zero denominator"
|
|
83
|
+
]
|
|
84
|
+
}
|
|
85
|
+
]
|
|
86
|
+
```
|
|
87
|
+
5. If no useful knowledge can be extracted you __should__ return empty graph. For example no file content present or file content malformed
|
|
88
|
+
|
|
89
|
+
## Example 1
|
|
90
|
+
|
|
91
|
+
Input:
|
|
92
|
+
|
|
93
|
+
Current File: `index.ts`
|
|
94
|
+
|
|
95
|
+
File Content:
|
|
96
|
+
```
|
|
97
|
+
#! /usr/bin/env node
|
|
98
|
+
|
|
99
|
+
import { Command } from "commander";
|
|
100
|
+
|
|
101
|
+
const program = new Command();
|
|
102
|
+
|
|
103
|
+
program
|
|
104
|
+
.name("File watchdog converter")
|
|
105
|
+
.option("-i, --input <file>", "input file")
|
|
106
|
+
.option("-o, --output <file>", "output file")
|
|
107
|
+
.action(({ options }) => convert(options));
|
|
108
|
+
|
|
109
|
+
program.parse();
|
|
110
|
+
```
|
|
111
|
+
|
|
112
|
+
|
|
113
|
+
Output:
|
|
114
|
+
|
|
115
|
+
```json
|
|
116
|
+
{
|
|
117
|
+
"entities": [
|
|
118
|
+
{
|
|
119
|
+
"name": "index.ts",
|
|
120
|
+
"entityType": "program",
|
|
121
|
+
"observations": ["Entry point for NodeJS CLI utility", "Watches a file and converts it"]
|
|
122
|
+
},
|
|
123
|
+
{
|
|
124
|
+
"name": "input_option",
|
|
125
|
+
"entityType": "argument",
|
|
126
|
+
"observations": ["Input file CLI argument"]
|
|
127
|
+
},
|
|
128
|
+
{
|
|
129
|
+
"name": "output_option",
|
|
130
|
+
"entityType": "argument",
|
|
131
|
+
"observations": ["Input file CLI argument"]
|
|
132
|
+
},
|
|
133
|
+
{
|
|
134
|
+
"name": "commander",
|
|
135
|
+
"entityType": "package",
|
|
136
|
+
"observations": ["NPM package for parsing CLI arguments"]
|
|
137
|
+
}
|
|
138
|
+
],
|
|
139
|
+
"relations": [
|
|
140
|
+
{
|
|
141
|
+
"from": "index.ts",
|
|
142
|
+
"to": "commander",
|
|
143
|
+
"relationType": ["uses"]
|
|
144
|
+
},
|
|
145
|
+
{
|
|
146
|
+
"from": "input_option",
|
|
147
|
+
"to": "output_option",
|
|
148
|
+
"relationType": ["converts_to", "watches"]
|
|
149
|
+
}
|
|
150
|
+
]
|
|
151
|
+
}
|
|
152
|
+
```
|
|
153
|
+
|
|
154
|
+
## Example 2
|
|
155
|
+
|
|
156
|
+
Input:
|
|
157
|
+
|
|
158
|
+
Current File: `README.md`
|
|
159
|
+
|
|
160
|
+
File Content:
|
|
161
|
+
```
|
|
162
|
+
Important notes:
|
|
163
|
+
This module is meant to be run using Node.js only. It does not work from a web browser.
|
|
164
|
+
This module extracts text entries from PDF files. It does not support photographed text. If you cannot select text from the PDF file, you may need to use OCR software first.
|
|
165
|
+
```
|
|
166
|
+
|
|
167
|
+
Output:
|
|
168
|
+
|
|
169
|
+
```json
|
|
170
|
+
{
|
|
171
|
+
"entities": [
|
|
172
|
+
{
|
|
173
|
+
"name": "pdfreader",
|
|
174
|
+
"entityType": "constraint",
|
|
175
|
+
"observations": ["Meant to be run using Node.js only", "Does not work from a web browser", "Does not support photographed text"]
|
|
176
|
+
},
|
|
177
|
+
{
|
|
178
|
+
"name": "OCR_software",
|
|
179
|
+
"entityType": "technology",
|
|
180
|
+
"observations": ["Required if PDF contains photographed text"]
|
|
181
|
+
}
|
|
182
|
+
],
|
|
183
|
+
"relations": [
|
|
184
|
+
{
|
|
185
|
+
"from": "OCR_software",
|
|
186
|
+
"to": "pdfreader",
|
|
187
|
+
"relationType": ["requirement"]
|
|
188
|
+
}
|
|
189
|
+
]
|
|
190
|
+
}
|
|
191
|
+
```
|
|
192
|
+
|
|
193
|
+
## Example 3
|
|
194
|
+
|
|
195
|
+
Input: "COVID-19 pandemic started in 2019. WHO declared it a pandemic on March 11, 2020. Vaccines were developed by Pfizer, Moderna, and other companies."
|
|
196
|
+
|
|
197
|
+
Output:
|
|
198
|
+
|
|
199
|
+
```json
|
|
200
|
+
{
|
|
201
|
+
"entities": [
|
|
202
|
+
{
|
|
203
|
+
"name": "COVID-19",
|
|
204
|
+
"entityType": "event",
|
|
205
|
+
"observations": ["Started in 2019", "Declared pandemic on March 11, 2020"]
|
|
206
|
+
},
|
|
207
|
+
{
|
|
208
|
+
"name": "WHO",
|
|
209
|
+
"entityType": "organization",
|
|
210
|
+
"observations": ["Declared COVID-19 pandemic on March 11, 2020"]
|
|
211
|
+
},
|
|
212
|
+
{
|
|
213
|
+
"name": "Pfizer",
|
|
214
|
+
"entityType": "organization",
|
|
215
|
+
"observations": ["Developed COVID-19 vaccine"]
|
|
216
|
+
},
|
|
217
|
+
{
|
|
218
|
+
"name": "Moderna",
|
|
219
|
+
"entityType": "organization",
|
|
220
|
+
"observations": ["Developed COVID-19 vaccine"]
|
|
221
|
+
}
|
|
222
|
+
],
|
|
223
|
+
"relations": [
|
|
224
|
+
{
|
|
225
|
+
"from": "WHO",
|
|
226
|
+
"to": "COVID-19",
|
|
227
|
+
"relationType": ["declared_pandemic"]
|
|
228
|
+
},
|
|
229
|
+
{
|
|
230
|
+
"from": "Pfizer",
|
|
231
|
+
"to": "COVID-19",
|
|
232
|
+
"relationType": ["developed_vaccine_for"]
|
|
233
|
+
},
|
|
234
|
+
{
|
|
235
|
+
"from": "Moderna",
|
|
236
|
+
"to": "COVID-19",
|
|
237
|
+
"relationType": ["developed_vaccine_for"]
|
|
238
|
+
}
|
|
239
|
+
]
|
|
240
|
+
}
|
|
241
|
+
```
|
|
242
|
+
|
|
243
|
+
## Example 4
|
|
244
|
+
|
|
245
|
+
Input: "Python's pandas library provides DataFrame class for data manipulation. It was created by Wes McKinney in 2008. NumPy serves as its foundation."
|
|
246
|
+
|
|
247
|
+
Output:
|
|
248
|
+
|
|
249
|
+
```json
|
|
250
|
+
{
|
|
251
|
+
"entities": [
|
|
252
|
+
{
|
|
253
|
+
"name": "pandas",
|
|
254
|
+
"entityType": "technology",
|
|
255
|
+
"observations": ["Python library", "Created by Wes McKinney in 2008", "Provides DataFrame class"]
|
|
256
|
+
},
|
|
257
|
+
{
|
|
258
|
+
"name": "DataFrame",
|
|
259
|
+
"entityType": "concept",
|
|
260
|
+
"observations": ["Used for data manipulation", "Part of pandas library"]
|
|
261
|
+
},
|
|
262
|
+
{
|
|
263
|
+
"name": "Wes_McKinney",
|
|
264
|
+
"entityType": "person",
|
|
265
|
+
"observations": ["Created pandas library in 2008"]
|
|
266
|
+
},
|
|
267
|
+
{
|
|
268
|
+
"name": "NumPy",
|
|
269
|
+
"entityType": "technology",
|
|
270
|
+
"observations": ["Serves as foundation for pandas"]
|
|
271
|
+
}
|
|
272
|
+
],
|
|
273
|
+
"relations": [
|
|
274
|
+
{
|
|
275
|
+
"from": "pandas",
|
|
276
|
+
"to": "DataFrame",
|
|
277
|
+
"relationType": ["provides"]
|
|
278
|
+
},
|
|
279
|
+
{
|
|
280
|
+
"from": "Wes_McKinney",
|
|
281
|
+
"to": "pandas",
|
|
282
|
+
"relationType": ["created"]
|
|
283
|
+
},
|
|
284
|
+
{
|
|
285
|
+
"from": "pandas",
|
|
286
|
+
"to": "NumPy",
|
|
287
|
+
"relationType": ["built_on"]
|
|
288
|
+
}
|
|
289
|
+
]
|
|
290
|
+
}
|
|
291
|
+
```
|
|
292
|
+
|
|
293
|
+
## Example 5
|
|
294
|
+
|
|
295
|
+
Input: "The Large Hadron Collider (LHC) at CERN discovered the Higgs boson in 2012. The discovery confirmed the Standard Model of particle physics. Peter Higgs proposed the Higgs mechanism in 1964."
|
|
296
|
+
|
|
297
|
+
Output:
|
|
298
|
+
|
|
299
|
+
```json
|
|
300
|
+
{
|
|
301
|
+
"entities": [
|
|
302
|
+
{
|
|
303
|
+
"name": "Large_Hadron_Collider",
|
|
304
|
+
"entityType": "technology",
|
|
305
|
+
"observations": ["Located at CERN", "Discovered Higgs boson in 2012"]
|
|
306
|
+
},
|
|
307
|
+
{
|
|
308
|
+
"name": "CERN",
|
|
309
|
+
"entityType": "organization",
|
|
310
|
+
"observations": ["Houses the Large Hadron Collider"]
|
|
311
|
+
},
|
|
312
|
+
{
|
|
313
|
+
"name": "Higgs_boson",
|
|
314
|
+
"entityType": "concept",
|
|
315
|
+
"observations": ["Discovered in 2012", "Confirms Standard Model"]
|
|
316
|
+
},
|
|
317
|
+
{
|
|
318
|
+
"name": "Standard_Model",
|
|
319
|
+
"entityType": "concept",
|
|
320
|
+
"observations": ["Model of particle physics", "Confirmed by Higgs boson discovery"]
|
|
321
|
+
},
|
|
322
|
+
{
|
|
323
|
+
"name": "Peter_Higgs",
|
|
324
|
+
"entityType": "person",
|
|
325
|
+
"observations": ["Proposed Higgs mechanism in 1964"]
|
|
326
|
+
}
|
|
327
|
+
],
|
|
328
|
+
"relations": [
|
|
329
|
+
{
|
|
330
|
+
"from": "Large_Hadron_Collider",
|
|
331
|
+
"to": "CERN",
|
|
332
|
+
"relationType": ["located_at"]
|
|
333
|
+
},
|
|
334
|
+
{
|
|
335
|
+
"from": "Large_Hadron_Collider",
|
|
336
|
+
"to": "Higgs_boson",
|
|
337
|
+
"relationType": ["discovered"]
|
|
338
|
+
},
|
|
339
|
+
{
|
|
340
|
+
"from": "Higgs_boson",
|
|
341
|
+
"to": "Standard_Model",
|
|
342
|
+
"relationType": ["confirms"]
|
|
343
|
+
},
|
|
344
|
+
{
|
|
345
|
+
"from": "Peter_Higgs",
|
|
346
|
+
"to": "Higgs_boson",
|
|
347
|
+
"relationType": ["proposed_mechanism_for"]
|
|
348
|
+
}
|
|
349
|
+
]
|
|
350
|
+
}
|
|
351
|
+
```
|
|
352
|
+
|
|
353
|
+
## Example 6
|
|
354
|
+
|
|
355
|
+
Input:
|
|
356
|
+
|
|
357
|
+
Current File: `document.pdf`
|
|
358
|
+
|
|
359
|
+
File Content:
|
|
360
|
+
```
|
|
361
|
+
X H qrewf __TEXT __text eeee 0 n 0 __stubs __TEXT 22e4e __TEXT 8 __cstring afdsaa __unwind_info __TEXT H __DATA_CONST __got adsf __DATA __la_symbol_ptr __DATA __data __DATA H __LINKEDIT 0 8 X 0 8 X P usr lib dyld D 3 XK U 2 0 8 d usr lib libSystem B dylib UH H E H u H H 5 O H E E 6 M H H 1 A A A bA L aA AS 9 h h h h s again 0 4 4 4 T dyld_stub_binder Qr s _exit s _gets s _printf s _puts _ _mh_execute_header 214 G 0 0 6 __mh_execute_header _main _exit _gets _printf _puts dyld_stub_binder __dyld_private
|
|
362
|
+
```
|
|
363
|
+
|
|
364
|
+
Output:
|
|
365
|
+
|
|
366
|
+
```json
|
|
367
|
+
{
|
|
368
|
+
"entities": [],
|
|
369
|
+
"relations": []
|
|
370
|
+
}
|
|
371
|
+
```
|
|
@@ -0,0 +1,20 @@
|
|
|
1
|
+
Analyze the following content and extract entities and their relationships:
|
|
2
|
+
|
|
3
|
+
**File**: {{fileName}}
|
|
4
|
+
{{#when totalChunks ">" 1}}**Chunk**: {{chunkIndex}} of {{totalChunks}}{{/when}}
|
|
5
|
+
|
|
6
|
+
{{#if retrievedEntities}}
|
|
7
|
+
## Previously Identified Context
|
|
8
|
+
|
|
9
|
+
{{#each retrievedEntities}}
|
|
10
|
+
- **{{name}}** ({{entityType}}): {{truncate (join observations "; ") 200}}
|
|
11
|
+
{{/each}}
|
|
12
|
+
{{/if}}
|
|
13
|
+
|
|
14
|
+
## Content to Analyze
|
|
15
|
+
|
|
16
|
+
```{{fileExtension}}
|
|
17
|
+
{{#if chunkContent}}{{chunkContent}}{{else}}{{fileContent}}{{/if}}
|
|
18
|
+
```
|
|
19
|
+
|
|
20
|
+
Please extract all entities and relationships from this content, following the guidelines provided in the system prompt.
|