npm - opencode-skills-collection - Versions diffs - 1.0.186 → 1.0.187 - Mend

opencode-skills-collection 1.0.186 → 1.0.187

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (71) hide show

package/bundled-skills/.antigravity-install-manifest.json +5 -1
package/bundled-skills/3d-web-experience/SKILL.md +152 -37
package/bundled-skills/agent-evaluation/SKILL.md +1088 -26
package/bundled-skills/agent-memory-systems/SKILL.md +1037 -25
package/bundled-skills/agent-tool-builder/SKILL.md +668 -16
package/bundled-skills/ai-agents-architect/SKILL.md +271 -31
package/bundled-skills/ai-product/SKILL.md +716 -26
package/bundled-skills/ai-wrapper-product/SKILL.md +450 -44
package/bundled-skills/algolia-search/SKILL.md +867 -15
package/bundled-skills/autonomous-agents/SKILL.md +1033 -26
package/bundled-skills/aws-serverless/SKILL.md +1046 -35
package/bundled-skills/azure-functions/SKILL.md +1318 -19
package/bundled-skills/browser-automation/SKILL.md +1065 -28
package/bundled-skills/browser-extension-builder/SKILL.md +159 -32
package/bundled-skills/bullmq-specialist/SKILL.md +347 -16
package/bundled-skills/clerk-auth/SKILL.md +796 -15
package/bundled-skills/computer-use-agents/SKILL.md +1870 -28
package/bundled-skills/context-window-management/SKILL.md +271 -18
package/bundled-skills/conversation-memory/SKILL.md +453 -24
package/bundled-skills/crewai/SKILL.md +252 -46
package/bundled-skills/discord-bot-architect/SKILL.md +1207 -34
package/bundled-skills/docs/integrations/jetski-cortex.md +3 -3
package/bundled-skills/docs/integrations/jetski-gemini-loader/README.md +1 -1
package/bundled-skills/docs/maintainers/repo-growth-seo.md +3 -3
package/bundled-skills/docs/maintainers/skills-update-guide.md +1 -1
package/bundled-skills/docs/users/bundles.md +1 -1
package/bundled-skills/docs/users/claude-code-skills.md +1 -1
package/bundled-skills/docs/users/gemini-cli-skills.md +1 -1
package/bundled-skills/docs/users/getting-started.md +1 -1
package/bundled-skills/docs/users/kiro-integration.md +1 -1
package/bundled-skills/docs/users/usage.md +4 -4
package/bundled-skills/docs/users/visual-guide.md +4 -4
package/bundled-skills/email-systems/SKILL.md +646 -26
package/bundled-skills/faf-expert/SKILL.md +221 -0
package/bundled-skills/faf-wizard/SKILL.md +252 -0
package/bundled-skills/file-uploads/SKILL.md +212 -11
package/bundled-skills/firebase/SKILL.md +646 -16
package/bundled-skills/gcp-cloud-run/SKILL.md +1117 -32
package/bundled-skills/graphql/SKILL.md +1026 -27
package/bundled-skills/hubspot-integration/SKILL.md +804 -19
package/bundled-skills/idea-darwin/SKILL.md +120 -0
package/bundled-skills/inngest/SKILL.md +431 -16
package/bundled-skills/interactive-portfolio/SKILL.md +342 -44
package/bundled-skills/langfuse/SKILL.md +296 -41
package/bundled-skills/langgraph/SKILL.md +259 -50
package/bundled-skills/micro-saas-launcher/SKILL.md +343 -44
package/bundled-skills/neon-postgres/SKILL.md +572 -15
package/bundled-skills/nextjs-supabase-auth/SKILL.md +269 -21
package/bundled-skills/notion-template-business/SKILL.md +371 -44
package/bundled-skills/personal-tool-builder/SKILL.md +537 -44
package/bundled-skills/plaid-fintech/SKILL.md +825 -19
package/bundled-skills/prompt-caching/SKILL.md +438 -25
package/bundled-skills/rag-engineer/SKILL.md +271 -29
package/bundled-skills/salesforce-development/SKILL.md +912 -19
package/bundled-skills/satori/SKILL.md +54 -0
package/bundled-skills/scroll-experience/SKILL.md +381 -44
package/bundled-skills/segment-cdp/SKILL.md +817 -19
package/bundled-skills/shopify-apps/SKILL.md +1475 -19
package/bundled-skills/slack-bot-builder/SKILL.md +1162 -28
package/bundled-skills/telegram-bot-builder/SKILL.md +152 -37
package/bundled-skills/telegram-mini-app/SKILL.md +445 -44
package/bundled-skills/trigger-dev/SKILL.md +916 -27
package/bundled-skills/twilio-communications/SKILL.md +1310 -28
package/bundled-skills/upstash-qstash/SKILL.md +898 -27
package/bundled-skills/vercel-deployment/SKILL.md +637 -39
package/bundled-skills/viral-generator-builder/SKILL.md +132 -37
package/bundled-skills/voice-agents/SKILL.md +937 -27
package/bundled-skills/voice-ai-development/SKILL.md +375 -46
package/bundled-skills/workflow-automation/SKILL.md +982 -29
package/bundled-skills/zapier-make-patterns/SKILL.md +772 -27
package/package.json +1 -1

package/bundled-skills/langfuse/SKILL.md CHANGED Viewed

@@ -1,13 +1,21 @@
 ---
 name: langfuse
-description: "You are an expert in LLM observability and evaluation. You think in terms of traces, spans, and metrics. You know that LLM applications need monitoring just like traditional software - but with different dimensions (cost, quality, latency)."
+description: Expert in Langfuse - the open-source LLM observability platform.
+  Covers tracing, prompt management, evaluation, datasets, and integration with
+  LangChain, LlamaIndex, and OpenAI. Essential for debugging, monitoring, and
+  improving LLM applications in production.
 risk: unknown
-source: "vibeship-spawner-skills (Apache 2.0)"
-date_added: "2026-02-27"
+source: vibeship-spawner-skills (Apache 2.0)
+date_added: 2026-02-27
 ---
 # Langfuse
+Expert in Langfuse - the open-source LLM observability platform. Covers tracing,
+prompt management, evaluation, datasets, and integration with LangChain, LlamaIndex,
+and OpenAI. Essential for debugging, monitoring, and improving LLM applications
+in production.
 **Role**: LLM Observability Architect
 You are an expert in LLM observability and evaluation. You think in terms of
@@ -15,6 +23,14 @@ traces, spans, and metrics. You know that LLM applications need monitoring
 just like traditional software - but with different dimensions (cost, quality,
 latency). You use data to drive prompt improvements and catch regressions.
+### Expertise
+- Tracing architecture
+- Prompt versioning
+- Evaluation strategies
+- Cost optimization
+- Quality monitoring
 ## Capabilities
 - LLM tracing and observability
@@ -25,11 +41,42 @@ latency). You use data to drive prompt improvements and catch regressions.
 - Performance monitoring
 - A/B testing prompts
-## Requirements
+## Prerequisites
+- 0: LLM application basics
+- 1: API integration experience
+- 2: Understanding of tracing concepts
+- Required skills: Python or TypeScript/JavaScript, Langfuse account (cloud or self-hosted), LLM API keys
+## Scope
+- 0: Self-hosted requires infrastructure
+- 1: High-volume may need optimization
+- 2: Real-time dashboard has latency
+- 3: Evaluation requires setup
+## Ecosystem
+### Primary
+- Langfuse Cloud
+- Langfuse Self-hosted
+- Python SDK
+- JS/TS SDK
+### Common_integrations
+- LangChain
+- LlamaIndex
+- OpenAI SDK
+- Anthropic SDK
+- Vercel AI SDK
+### Platforms
-- Python or TypeScript/JavaScript
-- Langfuse account (cloud or self-hosted)
-- LLM API keys
+- Any Python/JS backend
+- Serverless functions
+- Jupyter notebooks
 ## Patterns
@@ -39,7 +86,6 @@ Instrument LLM calls with Langfuse
 **When to use**: Any LLM application
-```python
 from langfuse import Langfuse
 # Initialize client
@@ -91,7 +137,6 @@ trace.score(
 # Flush before exit (important in serverless)
 langfuse.flush()
-```
 ### OpenAI Integration
@@ -99,7 +144,6 @@ Automatic tracing with OpenAI SDK
 **When to use**: OpenAI-based applications
-```python
 from langfuse.openai import openai
 # Drop-in replacement for OpenAI client
@@ -139,7 +183,6 @@ async def main():
         messages=[{"role": "user", "content": "Hello"}],
         name="async-greeting"
     )
-```
 ### LangChain Integration
@@ -147,7 +190,6 @@ Trace LangChain applications
 **When to use**: LangChain-based applications
-```python
 from langchain_openai import ChatOpenAI
 from langchain_core.prompts import ChatPromptTemplate
 from langfuse.callback import CallbackHandler
@@ -194,50 +236,263 @@ result = agent_executor.invoke(
     {"input": "What's the weather?"},
     config={"callbacks": [langfuse_handler]}
 )
-```
-## Anti-Patterns
+### Prompt Management
+Version and deploy prompts
+**When to use**: Managing prompts across environments
+from langfuse import Langfuse
+langfuse = Langfuse()
+# Fetch prompt from Langfuse
+# (Create in UI or via API first)
+prompt = langfuse.get_prompt("customer-support-v2")
+# Get compiled prompt with variables
+compiled = prompt.compile(
+    customer_name="John",
+    issue="billing question"
+)
+# Use with OpenAI
+response = openai.chat.completions.create(
+    model=prompt.config.get("model", "gpt-4o"),
+    messages=compiled,
+    temperature=prompt.config.get("temperature", 0.7)
+)
+# Link generation to prompt version
+trace = langfuse.trace(name="support-chat")
+generation = trace.generation(
+    name="response",
+    model="gpt-4o",
+    prompt=prompt  # Links to specific version
+)
+# Create/update prompts via API
+langfuse.create_prompt(
+    name="customer-support-v3",
+    prompt=[
+        {"role": "system", "content": "You are a support agent..."},
+        {"role": "user", "content": "{{user_message}}"}
+    ],
+    config={
+        "model": "gpt-4o",
+        "temperature": 0.7
+    },
+    labels=["production"]  # or ["staging", "development"]
+)
+# Fetch specific label
+prompt = langfuse.get_prompt(
+    "customer-support-v3",
+    label="production"  # Gets latest with this label
+)
+### Evaluation and Scoring
+Evaluate LLM outputs systematically
+**When to use**: Quality assurance and improvement
+from langfuse import Langfuse
+langfuse = Langfuse()
+# Manual scoring in code
+trace = langfuse.trace(name="qa-flow")
+# After getting response
+trace.score(
+    name="relevance",
+    value=0.85,  # 0-1 scale
+    comment="Response addressed the question"
+)
+trace.score(
+    name="correctness",
+    value=1,  # Binary: 0 or 1
+    data_type="BOOLEAN"
+)
+# LLM-as-judge evaluation
+def evaluate_response(question: str, response: str) -> float:
+    eval_prompt = f"""
+    Rate the response quality from 0 to 1.
+    Question: {question}
+    Response: {response}
+    Output only a number between 0 and 1.
+    """
+    result = openai.chat.completions.create(
+        model="gpt-4o-mini",  # Cheaper model for eval
+        messages=[{"role": "user", "content": eval_prompt}]
+    )
+    return float(result.choices[0].message.content.strip())
+# Score asynchronously
+score = evaluate_response(question, response)
+trace.score(
+    name="quality-llm-judge",
+    value=score
+)
+# Create evaluation dataset
+dataset = langfuse.create_dataset(name="support-qa-v1")
+# Add items to dataset
+langfuse.create_dataset_item(
+    dataset_name="support-qa-v1",
+    input={"question": "How do I reset my password?"},
+    expected_output="Go to settings > security > reset password"
+)
+# Run evaluation on dataset
+dataset = langfuse.get_dataset("support-qa-v1")
+for item in dataset.items:
+    # Generate response
+    response = generate_response(item.input["question"])
+    # Link to dataset item
+    trace = langfuse.trace(name="eval-run")
+    trace.generation(
+        name="response",
+        input=item.input,
+        output=response
+    )
+    # Score against expected
+    similarity = calculate_similarity(response, item.expected_output)
+    trace.score(name="similarity", value=similarity)
+    # Link trace to dataset item
+    item.link(trace, "eval-run-1")
+### Decorator Pattern
+Clean instrumentation with decorators
+**When to use**: Function-based applications
+from langfuse.decorators import observe, langfuse_context
-### ❌ Not Flushing in Serverless
+@observe()  # Creates a trace
+def chat_handler(user_id: str, message: str) -> str:
+    # All nested @observe calls become spans
+    context = get_context(message)
+    response = generate_response(message, context)
+    return response
-**Why bad**: Traces are batched.
-Serverless may exit before flush.
-Data is lost.
+@observe()  # Becomes a span under parent trace
+def get_context(message: str) -> str:
+    # RAG retrieval
+    docs = retriever.get_relevant_documents(message)
+    return "\n".join([d.page_content for d in docs])
-**Instead**: Always call langfuse.flush() at end.
-Use context managers where available.
-Consider sync mode for critical traces.
+@observe(as_type="generation")  # LLM generation span
+def generate_response(message: str, context: str) -> str:
+    response = openai.chat.completions.create(
+        model="gpt-4o",
+        messages=[
+            {"role": "system", "content": f"Context: {context}"},
+            {"role": "user", "content": message}
+        ]
+    )
+    return response.choices[0].message.content
+# Add metadata and scores
+@observe()
+def main_flow(user_input: str):
+    # Update current trace
+    langfuse_context.update_current_trace(
+        user_id="user-123",
+        session_id="session-456",
+        tags=["production"]
+    )
+    result = process(user_input)
+    # Score the trace
+    langfuse_context.score_current_trace(
+        name="success",
+        value=1 if result else 0
+    )
-### ❌ Tracing Everything
+    return result
-**Why bad**: Noisy traces.
-Performance overhead.
-Hard to find important info.
+# Works with async
+@observe()
+async def async_handler(message: str):
+    result = await async_generate(message)
+    return result
+## Collaboration
+### Delegation Triggers
-**Instead**: Focus on: LLM calls, key logic, user actions.
-Group related operations.
-Use meaningful span names.
+- agent|langgraph|graph -> langgraph (Need to build agent to monitor)
+- crewai|multi-agent|crew -> crewai (Need to build crew to monitor)
+- structured output|extraction -> structured-output (Need to build extraction to monitor)
-### ❌ No User/Session IDs
+### Observable LangGraph Agent
-**Why bad**: Can't debug specific users.
-Can't track sessions.
-Analytics limited.
+Skills: langfuse, langgraph
-**Instead**: Always pass user_id and session_id.
-Use consistent identifiers.
-Add relevant metadata.
+Workflow:
+```
+1. Build agent with LangGraph
+2. Add Langfuse callback handler
+3. Trace all LLM calls and tool uses
+4. Score outputs for quality
+5. Monitor and iterate
+```
-## Limitations
+### Monitored RAG Pipeline
-- Self-hosted requires infrastructure
-- High-volume may need optimization
-- Real-time dashboard has latency
-- Evaluation requires setup
+Skills: langfuse, structured-output
+Workflow:
+```
+1. Build RAG with retrieval and generation
+2. Trace retrieval and LLM calls
+3. Score relevance and accuracy
+4. Track costs and latency
+5. Optimize based on data
+```
+### Evaluated Agent System
+Skills: langfuse, langgraph, structured-output
+Workflow:
+```
+1. Build agent with structured outputs
+2. Create evaluation dataset
+3. Run evaluations with traces
+4. Compare prompt versions
+5. Deploy best performers
+```
 ## Related Skills
 Works well with: `langgraph`, `crewai`, `structured-output`, `autonomous-agents`
 ## When to Use
-This skill is applicable to execute the workflow or actions described in the overview.
+- User mentions or implies: langfuse
+- User mentions or implies: llm observability
+- User mentions or implies: llm tracing
+- User mentions or implies: prompt management
+- User mentions or implies: llm evaluation
+- User mentions or implies: monitor llm
+- User mentions or implies: debug llm