PyPI - pixie-qa - Versions diffs - 0.1.10__tar.gz → 0.1.11__tar.gz - Mend

pixie-qa 0.1.10tar.gz → 0.1.11tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (117) hide show

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: pixie-qa
-Version: 0.1.10
+Version: 0.1.11
 Summary: Automated quality assurance for AI applications
 Project-URL: Homepage, https://github.com/yiouli/pixie-qa
 Project-URL: Repository, https://github.com/yiouli/pixie-qa
@@ -66,11 +66,11 @@ Description-Content-Type: text/markdown
 # pixie-qa
-An agent skill for **eval-driven development** of LLM-powered applications.
+An agent skill that make coding agent the QA engineer for LLM applications.
 ## What the Skill Does
-The `eval-driven-dev` skill guides your coding agent through the full QA loop for LLM applications:
+The `qa-eval` skill guides your coding agent through the full eval-based QA loop for LLM applications:
 1. **Understand the code** — read the codebase, trace the data flow, learn what the code is supposed to do
 2. **Instrument it** — add `enable_storage()` and `@observe` so every run is captured to a local SQLite database

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/README.md RENAMED Viewed

@@ -1,10 +1,10 @@
 # pixie-qa
-An agent skill for **eval-driven development** of LLM-powered applications.
+An agent skill that make coding agent the QA engineer for LLM applications.
 ## What the Skill Does
-The `eval-driven-dev` skill guides your coding agent through the full QA loop for LLM applications:
+The `qa-eval` skill guides your coding agent through the full eval-based QA loop for LLM applications:
 1. **Understand the code** — read the codebase, trace the data flow, learn what the code is supposed to do
 2. **Instrument it** — add `enable_storage()` and `@observe` so every run is captured to a local SQLite database

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "pixie-qa"
-version = "0.1.10"
+version = "0.1.11"
 description = "Automated quality assurance for AI applications"
 readme = "README.md"
 requires-python = ">=3.11"

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/skills/eval-driven-dev/SKILL.md RENAMED Viewed

@@ -1,23 +1,23 @@
 ---
 name: eval-driven-dev
-description: Instrument Python LLM apps, build golden datasets, write eval-based tests, run them, and root-cause failures — covering the full eval-driven development cycle. Make sure to use this skill whenever a user is developing, testing, QA-ing, evaluating, or benchmarking a Python project that calls an LLM, even if they don't say "evals" explicitly. Use for making sure an AI app works correctly, catching regressions after prompt changes, debugging why an agent started behaving differently, or validating output quality before shipping.
+description: Add instrumentation, build golden datasets, write eval-based tests, run them, root-cause failures, and iterate — Ensure your Python LLM application works correctly. Make sure to use this skill whenever a user is developing, testing, QA-ing, evaluating, or benchmarking a Python project that calls an LLM. Use for making sure an LLM application works correctly, catching regressions after prompt changes, fixing unexpected behavior, or validating output quality before shipping.
 license: MIT
 compatibility: Python 3.11+
 metadata:
-  version: 0.1.10
+  version: 0.1.11
 ---
-# Eval-Driven Development with pixie
+# Evaluation-Driven Development for Python LLM Applications
 This skill is about doing the work, not describing it. When a user asks you to set up evals for their app, you should be reading their code, editing their files, running commands, and producing a working test pipeline — not writing a plan for them to follow later.
 ## Startup checks (always first)
-Before doing anything else, perform these two steps in order:
+This skill uses the python `pixie-qa` package. Before doing anything else, perform these two steps:
 ### 1. Upgrade pixie-qa
-Always attempt to upgrade the `pixie-qa` package in the user's environment regardless of whether the skill itself is outdated. Detect the package manager from the project (check for `uv.lock`, `poetry.lock`, `requirements.txt`, or a plain `pip` environment) and run the appropriate upgrade command:
+Attempt to upgrade the `pixie-qa` package in the user's environment. Detect the package manager from the project (check for `uv.lock`, `poetry.lock`, `requirements.txt`, or a plain `pip` environment) and run the appropriate upgrade command:
 - **uv**: `uv add pixie-qa --upgrade` (or `uv sync --upgrade-package pixie-qa`)
 - **poetry**: `poetry add pixie-qa@latest`
@@ -27,7 +27,7 @@ If the upgrade fails (e.g., no network, version conflict), log the error and con
 ### 2. Check skill version
-After upgrading pixie-qa, check whether a newer version of this skill itself is available by running the script `check_version.py`.
+Check whether a newer version of this skill itself is available by running the script `check_version.py`.
 If there is a newer version, reinstall the skill with `npx skills add` before proceeding.

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/skills/eval-driven-dev/resources/check_version.py RENAMED Viewed

@@ -1,5 +1,5 @@
 #!/usr/bin/env python3
-"""Check whether the eval-driven-dev skill is outdated and update it if needed."""
+"""Check whether the qa-eval skill is outdated and update it if needed."""
 from __future__ import annotations
@@ -9,9 +9,7 @@ from urllib.error import URLError
 from urllib.request import urlopen
 SKILL_REPO = "/yiouli/pixie-qa/"
-SKILL_URL = (
-    f"https://raw.githubusercontent.com{SKILL_REPO}main/skills/eval-driven-dev/SKILL.md"
-)
+SKILL_URL = f"https://raw.githubusercontent.com{SKILL_REPO}main/skills/qa-eval/SKILL.md"
 _RE_FRONTMATTER = re.compile(r"^---\s*\n(.*?)\n---", re.DOTALL)
 _RE_NAME = re.compile(r"^name:\s*(.+)$", re.MULTILINE)
@@ -57,7 +55,7 @@ def _normalise_version(version: str) -> tuple[int, ...]:
 def main() -> int:
     resource_dir = Path(__file__).resolve().parent
-    skill_dir = resource_dir.parent  # skills/eval-driven-dev/
+    skill_dir = resource_dir.parent  # skills/ai-qa/
     local_data = _load_local_version(skill_dir)
     local_version = local_data.get("version", "0.0.0")

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/.github/copilot-instructions.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/.github/workflows/publish.yml RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/.gitignore RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/LICENSE RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/async-handler-processing.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/autoevals-adapters.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/cli-dataset-commands.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/dataset-management.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/deep-research-demo.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/eval-harness.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/expected-output-in-evals.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/instrumentation-module-implementation.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/loud-failure-mode.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/manual-instrumentation-usability.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/observation-store-implementation.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/pixie-directory-and-skill-improvements.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/pixie-test-e2e-suite.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/root-package-exports-and-trace-id.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/scorecard-branding-and-skill-version-check.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/scorecard-eval-detail-dialog.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/skill-v2-and-rootdir-discovery.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/test-scorecard.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/changelogs/usability-utils.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/docs/package.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/cli/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/cli/dataset_command.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/cli/main.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/cli/test_command.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/config.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/dataset/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/dataset/models.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/dataset/store.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/criteria.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/eval_utils.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/evaluation.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/runner.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/scorecard.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/scorers.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/trace_capture.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/evals/trace_helpers.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/favicon.png RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/context.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/handler.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/handlers.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/instrumentors.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/observation.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/processor.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/queue.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/instrumentation/spans.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/evaluable.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/piccolo_conf.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/piccolo_migrations/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/serialization.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/store.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/tables.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/pixie/storage/tree.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/skills/eval-driven-dev/references/pixie-api.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/agent-skill-1.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/agent-skill.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/autoevals-adapters.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/dataset-management.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/evals-harness.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/expected-output-in-evals.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/instrumentation.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/manual-instrumentation-usability.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/storage.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/specs/usability-utils.md RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/e2e_cases.json RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/e2e_fixtures/conftest.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/e2e_fixtures/datasets/customer-faq.json RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/e2e_fixtures/mock_evaluators.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/e2e_fixtures/test_customer_faq.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/test_dataset_command.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/test_e2e_pixie_test.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/cli/test_main.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/dataset/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/dataset/test_models.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/dataset/test_store.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_criteria.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_eval_utils.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_evaluation.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_runner.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_scorecard.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_scorers.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_trace_capture.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/evals/test_trace_helpers.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/conftest.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_context.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_handler.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_integration.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_observation.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_processor.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_queue.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_spans.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/instrumentation/test_storage_handler.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/observation_store/__init__.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/observation_store/conftest.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/observation_store/test_evaluable.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/observation_store/test_serialization.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/observation_store/test_store.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/observation_store/test_tree.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/test_config.py RENAMED Viewed

File without changes

{pixie_qa-0.1.10 → pixie_qa-0.1.11}/tests/pixie/test_init.py RENAMED Viewed

File without changes

pixie-qa 0.1.10__tar.gz → 0.1.11__tar.gz

pixie-qa 0.1.10tar.gz → 0.1.11tar.gz