npm - katt - Versions diffs - 0.0.8 → 0.0.10 - Mend

katt 0.0.8 → 0.0.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/CODE_OF_CONDUCT.MD +92 -0
package/README.md +46 -26
package/SECURITY.md +10 -0
package/build-tests/check1.eval.js +0 -9
package/build-tests/check2.eval.js +0 -7
package/dist/index.js +286 -357
package/dist/katt.js +6 -4
package/dist/runCli-DkkiL_uk.js +388 -0
package/package.json +7 -7
package/build-tests/__snapshots__/check1.snap.md +0 -1
package/build-tests/__snapshots__/check1__Hello_World__should_return_the_date_in_a_json_format.snap.md +0 -1
package/build-tests/__snapshots__/check1__root.snap.md +0 -1
package/dist/runCli-425rgVp8.js +0 -424

package/CODE_OF_CONDUCT.MD ADDED Viewed

@@ -0,0 +1,92 @@
+# Contributor Covenant 3.0 Code of Conduct
+## Our Pledge
+We pledge to make our community welcoming, safe, and equitable for all.
+We are committed to fostering an environment that respects and promotes the dignity, rights, and contributions of all individuals, regardless of characteristics including race, ethnicity, caste, color, age, physical characteristics, neurodiversity, disability, sex or gender, gender identity or expression, sexual orientation, language, philosophy or religion, national or social origin, socio-economic position, level of education, or other status. The same privileges of participation are extended to everyone who participates in good faith and in accordance with this Covenant.
+## Encouraged Behaviors
+While acknowledging differences in social norms, we all strive to meet our community's expectations for positive behavior. We also understand that our words and actions may be interpreted differently than we intend based on culture, background, or native language.
+With these considerations in mind, we agree to behave mindfully toward each other and act in ways that center our shared values, including:
+1. Respecting the **purpose of our community**, our activities, and our ways of gathering.
+2. Engaging **kindly and honestly** with others.
+3. Respecting **different viewpoints** and experiences.
+4. **Taking responsibility** for our actions and contributions.
+5. Gracefully giving and accepting **constructive feedback**.
+6. Committing to **repairing harm** when it occurs.
+7. Behaving in other ways that promote and sustain the **well-being of our community**.
+## Restricted Behaviors
+We agree to restrict the following behaviors in our community. Instances, threats, and promotion of these behaviors are violations of this Code of Conduct.
+1. **Harassment.** Violating explicitly expressed boundaries or engaging in unnecessary personal attention after any clear request to stop.
+2. **Character attacks.** Making insulting, demeaning, or pejorative comments directed at a community member or group of people.
+3. **Stereotyping or discrimination.** Characterizing anyone’s personality or behavior on the basis of immutable identities or traits.
+4. **Sexualization.** Behaving in a way that would generally be considered inappropriately intimate in the context or purpose of the community.
+5. **Violating confidentiality**. Sharing or acting on someone's personal or private information without their permission.
+6. **Endangerment.** Causing, encouraging, or threatening violence or other harm toward any person or group.
+7. Behaving in other ways that **threaten the well-being** of our community.
+### Other Restrictions
+1. **Misleading identity.** Impersonating someone else for any reason, or pretending to be someone else to evade enforcement actions.
+2. **Failing to credit sources.** Not properly crediting the sources of content you contribute.
+3. **Promotional materials**. Sharing marketing or other commercial content in a way that is outside the norms of the community.
+4. **Irresponsible communication.** Failing to responsibly present content which includes, links or describes any other restricted behaviors.
+## Reporting an Issue
+Tensions can occur between community members even when they are trying their best to collaborate. Not every conflict represents a code of conduct violation, and this Code of Conduct reinforces encouraged behaviors and norms that can help avoid conflicts and minimize harm.
+When an incident does occur, it is important to report it promptly. To report a possible violation, **Open a discussion in this repository.**
+Community Moderators take reports of violations seriously and will make every effort to respond in a timely manner. They will investigate all reports of code of conduct violations, reviewing messages, logs, and recordings, or interviewing witnesses and other participants. Community Moderators will keep investigation and enforcement actions as transparent as possible while prioritizing safety and confidentiality. In order to honor these values, enforcement actions are carried out in private with the involved parties, but communicating to the whole community may be part of a mutually agreed upon resolution.
+## Addressing and Repairing Harm
+****
+If an investigation by the Community Moderators finds that this Code of Conduct has been violated, the following enforcement ladder may be used to determine how best to repair harm, based on the incident's impact on the individuals involved and the community as a whole. Depending on the severity of a violation, lower rungs on the ladder may be skipped.
+1) Warning
+   1) Event: A violation involving a single incident or series of incidents.
+   2) Consequence: A private, written warning from the Community Moderators.
+   3) Repair: Examples of repair include a private written apology, acknowledgement of responsibility, and seeking clarification on expectations.
+2) Temporarily Limited Activities
+   1) Event: A repeated incidence of a violation that previously resulted in a warning, or the first incidence of a more serious violation.
+   2) Consequence: A private, written warning with a time-limited cooldown period designed to underscore the seriousness of the situation and give the community members involved time to process the incident. The cooldown period may be limited to particular communication channels or interactions with particular community members.
+   3) Repair: Examples of repair may include making an apology, using the cooldown period to reflect on actions and impact, and being thoughtful about re-entering community spaces after the period is over.
+3) Temporary Suspension
+   1) Event: A pattern of repeated violation which the Community Moderators have tried to address with warnings, or a single serious violation.
+   2) Consequence: A private written warning with conditions for return from suspension. In general, temporary suspensions give the person being suspended time to reflect upon their behavior and possible corrective actions.
+   3) Repair: Examples of repair include respecting the spirit of the suspension, meeting the specified conditions for return, and being thoughtful about how to reintegrate with the community when the suspension is lifted.
+4) Permanent Ban
+   1) Event: A pattern of repeated code of conduct violations that other steps on the ladder have failed to resolve, or a violation so serious that the Community Moderators determine there is no way to keep the community safe with this person as a member.
+   2) Consequence: Access to all community spaces, tools, and communication channels is removed. In general, permanent bans should be rarely used, should have strong reasoning behind them, and should only be resorted to if working through other remedies has failed to change the behavior.
+   3) Repair: There is no possible repair in cases of this severity.
+This enforcement ladder is intended as a guideline. It does not limit the ability of Community Managers to use their discretion and judgment, in keeping with the best interests of our community.
+## Scope
+This Code of Conduct applies within all community spaces, and also applies when an individual is officially representing the community in public or other spaces. Examples of representing our community include using an official email address, posting via an official social media account, or acting as an appointed representative at an online or offline event.
+## Attribution
+This Code of Conduct is adapted from the Contributor Covenant, version 3.0, permanently available at [https://www.contributor-covenant.org/version/3/0/](https://www.contributor-covenant.org/version/3/0/).
+Contributor Covenant is stewarded by the Organization for Ethical Source and licensed under CC BY-SA 4.0. To view a copy of this license, visit [https://creativecommons.org/licenses/by-sa/4.0/](https://creativecommons.org/licenses/by-sa/4.0/)
+For answers to common questions about Contributor Covenant, see the FAQ at [https://www.contributor-covenant.org/faq](https://www.contributor-covenant.org/faq). Translations are provided at [https://www.contributor-covenant.org/translations](https://www.contributor-covenant.org/translations). Additional enforcement and community guideline resources can be found at [https://www.contributor-covenant.org/resources](https://www.contributor-covenant.org/resources). The enforcement ladder was inspired by the work of [Mozilla’s code of conduct team](https://github.com/mozilla/inclusion).

package/README.md CHANGED Viewed

@@ -12,24 +12,21 @@ Katt is a lightweight testing framework for running AI Evals, inspired by [Jest]
 - [Articles](#articles)
 - [Hello World - Example](#hello-world---example)
 - [Main Features](#main-features)
-- [Usage](#usage)
 - [Installation](#installation)
 - [Basic Usage](#basic-usage)
-- [Using promptFile](#using-promptfile)
 - [Specifying AI Models](#specifying-ai-models)
 - [Development](#development)
-- [Setup](#setup)
-- [Available Scripts](#available-scripts)
-- [Verification Process](#verification-process)
-- [Project Structure](#project-structure)
 - [How It Works](#how-it-works)
+- [Execution Flow](#execution-flow)
+- [Architecture](#architecture)
 - [Requirements](#requirements)
 - [License](#license)
 - [Contributing](#contributing)
 ## Overview
-Katt is designed to evaluate and validate the behavior of AI agents like **Claude Code**, **GitHub Copilot**, **OpenAI Codex** and more. It provides a simple, intuitive API for writing tests that interact with AI models and assert their responses.
+#### ✨ Run your own benchmarks and evaluations ✨
+**Katt** is designed to evaluate and validate the behavior of AI agents like **Claude Code**, **GitHub Copilot**, **OpenAI Codex** and more. It provides a simple, intuitive API for writing tests that interact with AI models and assert their responses.
 ## API Documentation
@@ -130,7 +127,7 @@ describe("Model selection", () => {
 You can also set runtime defaults in `katt.json`.
-Copilot (default runtime):
+GitHub Copilot (default runtime):
 ```json
 {
@@ -189,29 +186,36 @@ npm install
 ### Verification Process
-After making changes, run the following sequence:
+To verify your changes before opening a pull request, run:
-1. `npm run format`
+1. `npm test`
 2. `npm run typecheck`
-3. `npm run test`
-4. `npm run build`
-5. `npm run test:build`
+3. `npm run lint`
+4. `npm run format`
-## Project Structure
+For more details, see the [verification process section in CONTRIBUTING.md](./CONTRIBUTING.md#verification-process).
+## How It Works
+Katt runs eval files as executable test programs and coordinates collection, assertion failures, and reporting through its runtime context.
+## Execution Flow
+```mermaid
+sequenceDiagram
+  participant User as User/CI
+  participant CLI as katt CLI
+  participant FS as File Scanner
+  participant Eval as Eval Runtime
+  participant Report as Reporter
+  User->>CLI: Run `npx katt`
+  CLI->>FS: Discover `*.eval.js` and `*.eval.ts`
+  FS-->>CLI: Return eval file list
+  CLI->>Eval: Execute eval files
+  Eval-->>CLI: Return pass/fail results
+  CLI->>Report: Print per-test output + summary
+  Report-->>User: Exit code (`0` pass, `1` fail)
 ```
-katt/
-├── src/              # Source code
-│   ├── cli/          # CLI implementation
-│   ├── lib/          # Core libraries (describe, it, expect, prompt)
-│   └── types/        # TypeScript type definitions
-├── examples/         # Example eval files
-├── specs/            # Markdown specifications
-├── package.json      # Package configuration
-└── tsconfig.json     # TypeScript configuration
-```
-## How It Works
 1. Katt searches the current directory recursively for `*.eval.js` and `*.eval.ts` files
 2. It skips `.git` and `node_modules` directories
@@ -221,6 +225,22 @@ katt/
 6. A summary is displayed showing passed/failed tests and total duration
 7. Katt exits with code `0` on success or `1` on failure
+## Architecture
+```mermaid
+flowchart LR
+  User["Developer"] --> CLI["katt CLI"]
+  CLI --> EvalFiles["Eval files (*.eval.ts / *.eval.js)"]
+  CLI --> Config["katt.json config"]
+  EvalFiles --> Runtime["Test runtime (describe/it context)"]
+  Config --> Runtime
+  Runtime --> Assertions["Assertions + snapshots"]
+  Runtime --> Prompts["prompt() / promptFile()"]
+  Prompts --> AI["AI runtime (GitHub Copilot or Codex CLI)"]
+  Assertions --> Report["Terminal report + exit code"]
+  AI --> Report
+```
 ## Requirements
 - Node.js

package/SECURITY.md ADDED Viewed

@@ -0,0 +1,10 @@
+# Security Policy
+## Supported Versions
+Since Katt is under development, only the latest version will be supported.
+## Reporting a Vulnerability
+- Create an issue on this repository.
+- Describe the vulnerability and the level of it.

package/build-tests/check1.eval.js CHANGED Viewed

@@ -7,13 +7,4 @@ describe('Hello World', () => {
         const result = await prompt('Return the current year in the format "{ year: YYYY }"');
         expect(result).toContain(`{ year: ${currentData.getFullYear()} }`);
     });
-    it('should classify a response as helpful', async () => {
-        const response = await prompt('You are a helpful assistant. Give one short tip for learning JavaScript.');
-        await expect(response).toBeClassifiedAs('helpful', { threshold: 3 });
-    });
 });
-const result2 = await prompt('If you read this just say heeey');
-expect(result2.toLowerCase()).toMatchSnapshot();

package/build-tests/check2.eval.js CHANGED Viewed

@@ -6,10 +6,3 @@ describe('Working with files', () => {
         expect(result.toLowerCase()).toContain('hola');
     });
 });
-describe('Working with prompt as expectation', () => {
-    it('It should be friendly', async () => {
-        const result = await prompt('You are a friendly assistant. If you read this, say "Hola"!', { model: 'gpt-5.2' });
-        expect(result).promptCheck('To be friendly, the response should contain a greeting.');
-    });
-});