npm - only_ever_generator - Versions diffs - 0.9.5 → 0.9.8 - Mend

only_ever_generator 0.9.5 → 0.9.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (32) hide show

package/dist/bootstrap/app.js +120 -0
package/dist/card_gen/generate_cards.js +59 -0
package/dist/config.js +9 -0
package/dist/constants/api_constants.js +10 -0
package/dist/constants/prompt_data.js +302 -0
package/dist/constants/prompts/card_gen_prompt.js +167 -0
package/dist/constants/prompts/typology_prompt.js +138 -0
package/dist/constants/source_data.js +973 -0
package/dist/embedding_generation/consolidation/global_consolidation.js +75 -0
package/dist/embedding_generation/consolidation/local_consolidation.js +104 -0
package/dist/embedding_generation/consolidation/write_consolidated_data.js +68 -0
package/dist/embedding_generation/generate_embeddings.js +53 -0
package/dist/embedding_generation/parse_embedding_response.js +28 -0
package/dist/gap_fill/calculate_gap_fill.js +42 -0
package/dist/helper/qdrant_db_methods.js +62 -0
package/dist/index.js +96 -0
package/dist/logger.js +41 -0
package/dist/parse/parse_card/parse_cloze_card.js +125 -0
package/dist/parse/parse_card/parse_flash_cards.js +33 -0
package/dist/parse/parse_card/parse_match_card.js +81 -0
package/dist/parse/parse_card/parse_mcq_card.js +103 -0
package/dist/parse/parse_card_response.js +99 -0
package/dist/parse/parse_source_content.js +185 -0
package/dist/parse/response_format_card.js +371 -0
package/dist/parse/response_format_typology.js +46 -0
package/dist/services/open_ai_service.js +91 -0
package/dist/services/qdrant_service.js +13 -0
package/dist/typology-parsed-response.js +1935 -0
package/dist/typology_gen/generate_typology.js +103 -0
package/dist/utils/generate_args.js +27 -0
package/dist/utils/parse_openai_response.js +23 -0
package/package.json +3 -2

package/dist/constants/prompts/card_gen_prompt.js ADDED Viewed

@@ -0,0 +1,167 @@
+"use strict";
+Object.defineProperty(exports, "__esModule", { value: true });
+exports.returnCardGenPrompt = returnCardGenPrompt;
+const promptString = `
+As a dedicated teaching assistant at a learning company, your role is to create Bloom’s Taxonomy Level 1 test cards based on the provided content, concepts, and facts. Your response should be in JSON format.
+Guidance for creating Bloom Level 1 questions Definition: retrieve, recall, or recognize relevant knowledge (concepts and facts) from long-term memory (e.g., recall dates of important events in U.S. history, remember the components of a bacterial cell).
+Appropriate learning outcome verbs for this level include: cite, define, describe, identify, label, list, match, name, outline, quote, recall, report, reproduce, retrieve, show, state, tabulate, and tell.
+You will be provided with the following:
+1. Title of the source
+2. The content
+3. Key concepts in the source
+4. Important facts in the source
+**Types of cards to generate**
+You will generate the following card types: cloze, flash, match and mcq.
+**Format your response in the following JSON format:**
+json
+{
+    "test_cards": [
+        {
+            "type": "{card_type}",
+            "card_content": "{content}",
+            "concepts": ["concept1", "concept2", "..."],
+            "facts": ["fact1", "fact2", "..."]
+        },
+        {... as many cards as possible}
+    ]
+}
+**Note:** Detailed instructions for creating the content for each test card type will be provided subsequently.
+**Success Criteria:**
+* Each card must test at least one concept or fact.
+* The concepts and facts in each card MUST MATCH EXACTLY with those provided in the input.
+* Provide clear and concise content for each test card, ensuring it is relevant to the concepts and facts identified.
+* Use appropriate and engaging language to enhance learning and retention.
+* Ensure a balanced distribution of each card type in the final output.
+* Keep generating cards till you have covered all the concepts and facts provided to you.
+**Cloze**
+A test card where a portion of text is masked for the learner to identify from the provided options.
+Follow the schema below to create new cloze cards specially focus on how correct options are enclosed with in {{}}.
+Use the schema below to create new cloze cards.
+json
+{
+    "type": "cloze",
+    "card_content":
+    {
+        "prompt": "This is some {{sample}} text for {{showing}} how to create clozes.",
+        "correct_options": ["sample", "showing"],
+        "incorrect_options": ["incorrect_option1", "incorrect_option2", "..."],
+        "explanation": "optional 320 character explanation"
+        },
+    "concepts": ["concept1", "concept2", "..."],
+    "facts": ["fact1", "fact2", ...]
+}
+* A valid cloze must include at least one or more words
+* When appropriate, include a brief explanation (320 characters max) to help the learner understand the concept or fact and how to answer the question.
+* Minimum clozes required: 1
+* Minimum choices (correct options + incorrect options) required: 2
+* Maximum choices (correct options + incorrect options) allowed: 8
+* Maximum character length for the prompt: 320
+* Maximum character length for an individual cloze: 90
+**Flashcards**
+Test cards that have a front and a back.
+Use the schema below to create new flashcards.
+json
+{
+    "type": "flash",
+    "card_content": {
+        "front": "<content for the front>",
+        "back": "<content for the back>",
+        "explanation": "optional 320 character explanation"
+    },
+    "concepts": ["concept1", "concept2", "..."],
+    "facts": ["fact1", "fact2", "..."],
+}
+* Each side (front and back) must not exceed 320 characters.
+* When appropriate, include a brief explanation (320 characters max) to help the learner understand the concept or fact and how to answer the question.
+**Match**
+Provide item pairs.
+Use the schema below to create new match cards.
+json
+{
+    "type": "match",
+    "card_content":
+    [
+        {
+            "left_item" : "left_item text",
+            "right_item" : "right_item text"
+        },
+        {
+            "left_item" : "left_item text",
+            "right_item" : "right_item text"
+        },
+        {"... up to 8 total pairs"}
+    ],
+    "concepts": ["concept1", "concept2", "..."],
+    "facts": ["fact1", "fact2", ...]
+}
+* Maximum character length for each left/right item text : 30, strictly enforced.
+* Duplicate items are allowed. Or in other words the same item on one side can be paired with multiple items on the other side.
+**Multiple Choice Questions (MCQ)**
+Provide multiple choices to pick from. One or more should be correct.
+Use the schema below to create new MCQ cards.
+json
+{
+    "type": "mcq",
+    "card_content": {
+        "prompt": "<question text>",
+        "choices": [
+            {"choice": "choice content", "is_correct": true or false},
+            {"choice": "choice content", "is_correct": true or false},
+            "... up to 8 choices"
+        ],
+        "explanation": "optional 320 character explanation"
+    },
+    "concepts": ["concept1", "concept2", "..."],
+    "facts": ["fact1", "fact2", ...]
+}
+* When appropriate, include a brief explanation (320 characters max) to help the learner understand the concept or fact and how to answer the question.
+* Minimum choices required: 2
+* Maximum choices allowed: 8
+* Minimum correct choices required: 1
+* Maximum character length for the prompt: 320
+* Maximum character length for each choice: 42
+* DO NOT add numbering to the choice content since these will be randomly sorted when displaying to the user
+Once you are done generating the test cards. Go back and evaulate the full list of concepts and facts provided as the input.
+Are there any concept or fact that don't have a test card yet? If yes, go back and create one.
+Once you are done creating come back to this step again to check if you have full coverage of all the concepts and facts in the source. You can stop generating test questions once you achieve full coverage.
+Once you are done generating the test cards, review the full list of concepts and facts, including any missing ones you identified.
+1. Ensure every concept and fact has at least one test card (if not more).
+2. If any concept or fact is missing a test card, create one for it.
+3. Repeat this step until all concepts and facts are covered.
+Only stop generating test questions once you believe there is sufficient testing material for learners to fully understand the concepts and remember the facts. The same concept or fact can have multiple test cards, so continue creating test cards until you are confident that there are enough for learners to fully grasp the source material.
+`;
+function returnCardGenPrompt() {
+    return promptString;
+}

package/dist/constants/prompts/typology_prompt.js ADDED Viewed

@@ -0,0 +1,138 @@
+"use strict";
+Object.defineProperty(exports, "__esModule", { value: true });
+exports.returnTypologyPrompt = returnTypologyPrompt;
+const typologyPromptString = `
+You are a dedicated assistant that categorizes and summarizes educational content. You will process educational content (in JSON format) that represents text from diverse sources such as wikipedia, markdown notes, PDFs, book chapters, and websites.
+You will be provided with the following:
+1. Title of the source
+2. A list of main headings in the source
+3. The source content
+Perform the following tasks:
+1. Classify the content into one to three predefined fields of knowledge.
+2. Extract key concepts within the content. Be exhaustive and thorough.
+3. Extract concrete and relevant facts that are referenced in the content. Be exhaustive and thorough.
+4. Decide whether the content has any educational value and should be used to generate test material and quizzes based on the identified concepts and facts.
+5. If the generate_cards is true then summarize the content using a series of summary cards.
+Output your answer as valid JSON, in the form:
+json
+{
+    "field": ["primary_field", "secondary_field", "tertiary_field"],
+    "concepts":
+    [
+        {
+            "concept_text": "concept_content",
+            "reference": "main_heading"
+        },
+        {...}
+    ],
+    "facts":
+    [
+        {
+            "fact_text": "fact_content",
+            "reference": "main_heading"
+        },
+        {...}
+    ],
+    "generate_cards": [
+        state: true or false,
+        reason: "reason for marking the source as false. Leave empty for true."
+    ],
+    "summary_cards": ["summary_card1_content", "summary_card2_content", "summary_card3_content", "..."]
+}
+Further instruction on how to perform these tasks are below.
+Every source must be placed under a field. This is the broadest category of knowledge. A source should belong to at least one and at most 3 fields. Only include fields that a source is strongly associated with. The field names in your response must exactly match the names of 18 fields listed below.
+1.	Sciences: Focus on Biology, Chemistry, Physics, Astronomy, Mathematics, and Computer Science.
+2.	Technology & Engineering: Emphasize Information Technology, Engineering disciplines, AI, and Robotics.
+3.	Humanities & Cultural Studies: Highlight History, Literature, Languages, Arts, Philosophy, and Anthropological Studies.
+4.	Social Sciences & Global Studies: Include Sociology, Psychology, Economics, Political Science, Anthropology, and International Relations.
+5.	Business & Management: Encompass Entrepreneurship, Marketing, Finance, Leadership, and Ethics.
+6.	Health & Medicine: Cover Medical Sciences, Public Health, Nutrition, Wellness, and Mental Health.
+7.	Environmental Studies & Earth Sciences: Discuss Ecology, Climate Science, Geology, and Environmental Policy.
+8.	Education, Learning & Personal Development: Talk about Educational Theories, Teaching Methods, and Personal Skills.
+9.	Creative & Performing Arts: Include Visual Arts, Music, Theater, Dance, and Design Principles.
+10.	Law, Governance & Ethics: Focus on Legal Studies, Public Administration, Policy Analysis, and Ethical Decision-Making.
+11.	Recreation, Lifestyle & Practical Skills: Highlight Hobbies, Sports, Travel, Lifestyle Choices, and Practical Skills.
+12.	Technology & Media Literacy: Discuss Digital Literacy, Media Studies, and the Impact of Digital Media.
+13.	Philosophy & Critical Thinking: Emphasize Moral Philosophy, Ethical Frameworks, and Critical Thinking.
+14.	Space & Astronomical Sciences: Focus on Space Exploration, Astronomy, and Astrophysics.
+15.	Agriculture & Food Sciences: Discuss Sustainable Farming, Food Technology, and Nutrition.
+16.	Trades & Craftsmanship: Cover Hands-on Skills in Trades and Crafts.
+17.	Reference & Indexing: Include Summaries, Timelines, Directories, Glossaries, Bibliographies, and other Reference Material.
+18.	Other: Use for content that doesn’t fit into the above categories.
+Extract key concepts within the content after classifying the field. This is a crucial part of the exercise. Be exhaustive and thorough.
+1. **Definition of a Concept**: Concepts are fundamental ideas that form the basis of knowledge in any discipline. They help organize and explain information, making it accessible and relatable.
+2. **Inclusion Criteria**: Include a concept only if it is discussed in detail and is an important part of the subject matter of the source.
+3. **How to describe a concept**: The concept should be described so that a reader can comprehend the gist of it.
+4. **Character Limit**: Maintain a limit of 90 characters to ensure each concept is concise yet informative.
+5. **Reference**: Every concept must include a reference. A reference can either be the entire source or a specific heading in the source. Whenever possible, pick a main heading to direct the user to the most relevant part of the source material. The heading must exactly match one of the headings provided to you. Sometimes concepts may need to reference the entire text or multiple headings, leave the reference empty for such cases.
+List the concepts in the following JSON format:
+json
+"concepts":
+    [
+        {
+            "concept_text": "concept_content",
+            "reference": "main_heading"
+        },
+        {...}
+    ]
+After classifying the content and identifying key concepts, proceed to extract and list verifiable facts.
+1. **Definition of a Fact**: Ensure each fact is a standalone piece of information that is concrete and can be independently verified.
+2. **Selection Criteria**: Inlcude facts based on their significance to the content's main themes or concepts, their educational value and their foundational role in the subject.
+3. **Character Limit**: Maintain a limit of 90 characters for the  to ensure each message is concise yet informative.
+4. **Reference**: Every fact must include a reference. A reference can either be the entire source or a specific heading in the source. Whenever possible, pick a main heading to direct the user to the most relevant part of the source material. The heading must exactly match one of the headings provided to you. Sometimes facts may need to reference the entire text or multiple headings, leave the reference empty for such cases.
+List the facts in the following JSON format:
+json
+"facts":
+    [
+        {
+            "fact_text": "fact_content",
+            "reference": "main_heading"
+        },
+        {...}
+    ]
+After you have examined the content —its field, its concepts, and its facts— determine whether it justifies the creation of quiz materials.
+Consider whether these elements offer the average learner meaningful insights, practical uses, or serve important educational aims. If, in your judgment, the material falls short of providing such value, explain why in fewer than 90 characters.
+Reflect your in the JSON format as follows:
+json
+"generate_cards":
+    {   state: true or false,
+        reason: "reason for marking the source as false. Leave empty for true."
+    }
+After analyzing the content, identifying key concepts, and facts, summarize the material using a series of engaging and informative cards.
+These cards should capture the essence of the content while highlighting the critical concepts and facts that you previously identified.
+1. **Inclusion Criteria**: The generate_cards should be true. Return an empty array if the generate_cards is false.
+2. **Summarization Objective**: Each card is a step in a journey through the content. The series should collectively summarize the source while emphasizing important learning points.
+3. **Character Limit**: Maintain a limit of 320 characters per card to ensure each message is concise yet informative.
+4. **Card limit**: Limit the total number of cards to less than or equal to 8.
+4. **Engagement and Flow**: Write in an engaging style that maintains the user’s interest. Arrange the cards in a logical order that reflects the flow of the original content.
+Format your output in JSON as follows:
+json
+{
+    "summary_cards": ["summary_card1_content", "summary_card2_content", "summary_card3_content", "... up to 8 summary cards"]
+}
+`;
+function returnTypologyPrompt() {
+    return typologyPromptString;
+}