qtype 0.1.11__py3-none-any.whl → 0.1.13__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- qtype/` +0 -0
- qtype/application/__init__.py +0 -2
- qtype/application/converters/tools_from_api.py +67 -57
- qtype/application/converters/tools_from_module.py +66 -32
- qtype/base/types.py +6 -1
- qtype/commands/convert.py +3 -6
- qtype/commands/generate.py +97 -10
- qtype/commands/mcp.py +68 -0
- qtype/commands/run.py +116 -44
- qtype/commands/validate.py +4 -4
- qtype/docs/.pages +8 -0
- qtype/docs/Concepts/mental-model-and-philosophy.md +363 -0
- qtype/docs/Contributing/.pages +4 -0
- qtype/docs/Contributing/index.md +283 -0
- qtype/docs/Contributing/roadmap.md +81 -0
- qtype/docs/Decisions/ADR-001-Chat-vs-Completion-Endpoint-Features.md +56 -0
- qtype/docs/Gallery/dataflow_pipelines.md +81 -0
- qtype/docs/Gallery/dataflow_pipelines.mermaid +45 -0
- qtype/docs/Gallery/research_assistant.md +97 -0
- qtype/docs/Gallery/research_assistant.mermaid +42 -0
- qtype/docs/Gallery/simple_chatbot.md +38 -0
- qtype/docs/Gallery/simple_chatbot.mermaid +35 -0
- qtype/docs/How To/Authentication/configure_aws_authentication.md +60 -0
- qtype/docs/How To/Authentication/use_api_key_authentication.md +40 -0
- qtype/docs/How To/Command Line Usage/load_multiple_inputs_from_files.md +77 -0
- qtype/docs/How To/Command Line Usage/pass_inputs_on_the_cli.md +52 -0
- qtype/docs/How To/Command Line Usage/serve_with_auto_reload.md +27 -0
- qtype/docs/How To/Data Processing/adjust_concurrency.md +40 -0
- qtype/docs/How To/Data Processing/cache_step_results.md +71 -0
- qtype/docs/How To/Data Processing/decode_json_xml.md +24 -0
- qtype/docs/How To/Data Processing/explode_collections.md +40 -0
- qtype/docs/How To/Data Processing/gather_results.md +68 -0
- qtype/docs/How To/Data Processing/invoke_other_flows.md +71 -0
- qtype/docs/How To/Data Processing/load_data_from_athena.md +49 -0
- qtype/docs/How To/Data Processing/read_data_from_files.md +61 -0
- qtype/docs/How To/Data Processing/read_sql_databases.md +46 -0
- qtype/docs/How To/Data Processing/write_data_to_file.md +39 -0
- qtype/docs/How To/Invoke Models/call_large_language_models.md +51 -0
- qtype/docs/How To/Invoke Models/create_embeddings.md +49 -0
- qtype/docs/How To/Invoke Models/reuse_prompts_with_templates.md +38 -0
- qtype/docs/How To/Language Features/include_qtype_yaml.md +45 -0
- qtype/docs/How To/Language Features/include_raw_text_from_other_files.md +48 -0
- qtype/docs/How To/Language Features/reference_entities_by_id.md +51 -0
- qtype/docs/How To/Language Features/use_agent_skills.md +29 -0
- qtype/docs/How To/Language Features/use_environment_variables.md +48 -0
- qtype/docs/How To/Language Features/use_optional_variables.md +42 -0
- qtype/docs/How To/Language Features/use_qtype_mcp.md +59 -0
- qtype/docs/How To/Observability & Debugging/trace_calls_with_open_telemetry.md +49 -0
- qtype/docs/How To/Observability & Debugging/validate_qtype_yaml.md +36 -0
- qtype/docs/How To/Observability & Debugging/visualize_application_architecture.md +61 -0
- qtype/docs/How To/Observability & Debugging/visualize_example.mermaid +35 -0
- qtype/docs/How To/Qtype Server/flow_as_ui.png +0 -0
- qtype/docs/How To/Qtype Server/serve_flows_as_apis.md +40 -0
- qtype/docs/How To/Qtype Server/serve_flows_as_ui.md +41 -0
- qtype/docs/How To/Qtype Server/use_conversational_interfaces.md +56 -0
- qtype/docs/How To/Qtype Server/use_variables_with_ui_hints.md +48 -0
- qtype/docs/How To/Tools & Integration/bind_tool_inputs_and_outputs.md +47 -0
- qtype/docs/How To/Tools & Integration/create_tools_from_openapi_specifications.md +85 -0
- qtype/docs/How To/Tools & Integration/create_tools_from_python_modules.md +87 -0
- qtype/docs/Reference/cli.md +336 -0
- qtype/docs/Reference/plugins.md +99 -0
- qtype/docs/Reference/semantic-validation-rules.md +184 -0
- qtype/docs/Tutorials/.pages +1 -0
- qtype/docs/Tutorials/01-first-qtype-application.md +249 -0
- qtype/docs/Tutorials/02-conversational-chatbot.md +327 -0
- qtype/docs/Tutorials/03-structured-data.md +480 -0
- qtype/docs/Tutorials/04-tools-and-function-calling.md +476 -0
- qtype/docs/Tutorials/example_chat.png +0 -0
- qtype/docs/Tutorials/index.md +92 -0
- qtype/docs/components/APIKeyAuthProvider.md +7 -0
- qtype/docs/components/APITool.md +10 -0
- qtype/docs/components/AWSAuthProvider.md +13 -0
- qtype/docs/components/AWSSecretManager.md +5 -0
- qtype/docs/components/Agent.md +6 -0
- qtype/docs/components/Aggregate.md +7 -0
- qtype/docs/components/AggregateStats.md +7 -0
- qtype/docs/components/Application.md +22 -0
- qtype/docs/components/AuthorizationProvider.md +6 -0
- qtype/docs/components/AuthorizationProviderList.md +5 -0
- qtype/docs/components/BearerTokenAuthProvider.md +6 -0
- qtype/docs/components/BedrockReranker.md +8 -0
- qtype/docs/components/ChatContent.md +7 -0
- qtype/docs/components/ChatMessage.md +6 -0
- qtype/docs/components/Collect.md +6 -0
- qtype/docs/components/ConstantPath.md +5 -0
- qtype/docs/components/Construct.md +6 -0
- qtype/docs/components/CustomType.md +7 -0
- qtype/docs/components/Decoder.md +8 -0
- qtype/docs/components/DecoderFormat.md +8 -0
- qtype/docs/components/DocToTextConverter.md +7 -0
- qtype/docs/components/Document.md +7 -0
- qtype/docs/components/DocumentEmbedder.md +6 -0
- qtype/docs/components/DocumentIndex.md +7 -0
- qtype/docs/components/DocumentSearch.md +7 -0
- qtype/docs/components/DocumentSource.md +12 -0
- qtype/docs/components/DocumentSplitter.md +9 -0
- qtype/docs/components/Echo.md +8 -0
- qtype/docs/components/Embedding.md +7 -0
- qtype/docs/components/EmbeddingModel.md +6 -0
- qtype/docs/components/Explode.md +5 -0
- qtype/docs/components/FieldExtractor.md +21 -0
- qtype/docs/components/FileSource.md +6 -0
- qtype/docs/components/FileWriter.md +7 -0
- qtype/docs/components/Flow.md +14 -0
- qtype/docs/components/FlowInterface.md +7 -0
- qtype/docs/components/Index.md +8 -0
- qtype/docs/components/IndexUpsert.md +6 -0
- qtype/docs/components/InvokeEmbedding.md +7 -0
- qtype/docs/components/InvokeFlow.md +8 -0
- qtype/docs/components/InvokeTool.md +8 -0
- qtype/docs/components/LLMInference.md +9 -0
- qtype/docs/components/ListType.md +5 -0
- qtype/docs/components/Memory.md +8 -0
- qtype/docs/components/MessageRole.md +14 -0
- qtype/docs/components/Model.md +10 -0
- qtype/docs/components/ModelList.md +5 -0
- qtype/docs/components/OAuth2AuthProvider.md +9 -0
- qtype/docs/components/PrimitiveTypeEnum.md +20 -0
- qtype/docs/components/PromptTemplate.md +7 -0
- qtype/docs/components/PythonFunctionTool.md +7 -0
- qtype/docs/components/RAGChunk.md +7 -0
- qtype/docs/components/RAGDocument.md +10 -0
- qtype/docs/components/RAGSearchResult.md +8 -0
- qtype/docs/components/Reranker.md +5 -0
- qtype/docs/components/SQLSource.md +8 -0
- qtype/docs/components/Search.md +7 -0
- qtype/docs/components/SearchResult.md +7 -0
- qtype/docs/components/SecretManager.md +7 -0
- qtype/docs/components/SecretReference.md +7 -0
- qtype/docs/components/Source.md +5 -0
- qtype/docs/components/Step.md +8 -0
- qtype/docs/components/TelemetrySink.md +9 -0
- qtype/docs/components/Tool.md +9 -0
- qtype/docs/components/ToolList.md +5 -0
- qtype/docs/components/TypeList.md +5 -0
- qtype/docs/components/Variable.md +8 -0
- qtype/docs/components/VariableList.md +5 -0
- qtype/docs/components/VectorIndex.md +7 -0
- qtype/docs/components/VectorSearch.md +6 -0
- qtype/docs/components/VertexAuthProvider.md +9 -0
- qtype/docs/components/Writer.md +5 -0
- qtype/docs/example_ui.png +0 -0
- qtype/docs/index.md +81 -0
- qtype/docs/legacy_how_tos/.pages +6 -0
- qtype/docs/legacy_how_tos/Configuration/modular-yaml.md +366 -0
- qtype/docs/legacy_how_tos/Configuration/phoenix_projects.png +0 -0
- qtype/docs/legacy_how_tos/Configuration/phoenix_traces.png +0 -0
- qtype/docs/legacy_how_tos/Configuration/reference-by-id.md +251 -0
- qtype/docs/legacy_how_tos/Configuration/telemetry-setup.md +259 -0
- qtype/docs/legacy_how_tos/Data Types/custom-types.md +52 -0
- qtype/docs/legacy_how_tos/Data Types/domain-types.md +113 -0
- qtype/docs/legacy_how_tos/Debugging/visualize-apps.md +147 -0
- qtype/docs/legacy_how_tos/Tools/api-tools.md +29 -0
- qtype/docs/legacy_how_tos/Tools/python-tools.md +299 -0
- qtype/docs/skills/architect/SKILL.md +188 -0
- qtype/docs/skills/architect/references/cheatsheet.md +198 -0
- qtype/docs/skills/architect/references/patterns.md +29 -0
- qtype/docs/stylesheets/extra.css +27 -0
- qtype/dsl/custom_types.py +2 -1
- qtype/dsl/linker.py +23 -7
- qtype/dsl/loader.py +3 -3
- qtype/dsl/model.py +181 -67
- qtype/examples/authentication/aws_authentication.qtype.yaml +63 -0
- qtype/examples/conversational_ai/hello_world_chat.qtype.yaml +43 -0
- qtype/examples/conversational_ai/simple_chatbot.qtype.yaml +40 -0
- qtype/examples/data_processing/athena_query.qtype.yaml +56 -0
- qtype/examples/data_processing/batch_inputs.csv +5 -0
- qtype/examples/data_processing/batch_processing.qtype.yaml +54 -0
- qtype/examples/data_processing/cache_step_results.qtype.yaml +78 -0
- qtype/examples/data_processing/collect_results.qtype.yaml +55 -0
- qtype/examples/data_processing/create_sample_db.py +129 -0
- qtype/examples/data_processing/dataflow_pipelines.qtype.yaml +108 -0
- qtype/examples/data_processing/decode_json.qtype.yaml +23 -0
- qtype/examples/data_processing/explode_items.qtype.yaml +25 -0
- qtype/examples/data_processing/invoke_other_flows.qtype.yaml +98 -0
- qtype/examples/data_processing/read_file.qtype.yaml +60 -0
- qtype/examples/data_processing/reviews.db +0 -0
- qtype/examples/data_processing/sample_article.txt +1 -0
- qtype/examples/data_processing/sample_documents.jsonl +5 -0
- qtype/examples/invoke_models/create_embeddings.qtype.yaml +28 -0
- qtype/examples/invoke_models/simple_llm_call.qtype.yaml +32 -0
- qtype/examples/language_features/include_raw.qtype.yaml +27 -0
- qtype/examples/language_features/optional_variables.qtype.yaml +32 -0
- qtype/examples/language_features/story_prompt.txt +6 -0
- qtype/examples/language_features/ui_hints.qtype.yaml +52 -0
- qtype/examples/legacy/bedrock/data_analysis_with_telemetry.qtype.yaml +169 -0
- qtype/examples/legacy/bedrock/hello_world.qtype.yaml +39 -0
- qtype/examples/legacy/bedrock/hello_world_chat.qtype.yaml +37 -0
- qtype/examples/legacy/bedrock/hello_world_chat_with_telemetry.qtype.yaml +40 -0
- qtype/examples/legacy/bedrock/hello_world_chat_with_thinking.qtype.yaml +40 -0
- qtype/examples/legacy/bedrock/hello_world_completion.qtype.yaml +41 -0
- qtype/examples/legacy/bedrock/hello_world_completion_with_auth.qtype.yaml +44 -0
- qtype/examples/legacy/bedrock/simple_agent_chat.qtype.yaml +46 -0
- qtype/examples/legacy/chat_with_langfuse.qtype.yaml +50 -0
- qtype/examples/legacy/data/customers.csv +6 -0
- qtype/examples/legacy/data_processor.qtype.yaml +48 -0
- qtype/examples/legacy/echo/debug_example.qtype.yaml +59 -0
- qtype/examples/legacy/echo/prompt.qtype.yaml +22 -0
- qtype/examples/legacy/echo/readme.md +29 -0
- qtype/examples/legacy/echo/test.qtype.yaml +26 -0
- qtype/examples/legacy/echo/video.qtype.yaml +20 -0
- qtype/examples/legacy/field_extractor_example.qtype.yaml +137 -0
- qtype/examples/legacy/multi_flow_example.qtype.yaml +125 -0
- qtype/examples/legacy/openai/hello_world_chat.qtype.yaml +43 -0
- qtype/examples/legacy/openai/hello_world_chat_with_telemetry.qtype.yaml +46 -0
- qtype/examples/legacy/qtype_plugin_example.py +51 -0
- qtype/examples/legacy/rag.qtype.yaml +207 -0
- qtype/examples/legacy/sample_data.txt +43 -0
- qtype/examples/legacy/time_utilities.qtype.yaml +64 -0
- qtype/examples/legacy/vertex/README.md +11 -0
- qtype/examples/legacy/vertex/hello_world_chat.qtype.yaml +36 -0
- qtype/examples/legacy/vertex/hello_world_completion.qtype.yaml +40 -0
- qtype/examples/legacy/vertex/hello_world_completion_with_auth.qtype.yaml +45 -0
- qtype/examples/observability_debugging/trace_with_opentelemetry.qtype.yaml +40 -0
- qtype/examples/research_assistant/research_assistant.qtype.yaml +94 -0
- qtype/examples/research_assistant/tavily.oas.yaml +722 -0
- qtype/examples/research_assistant/tavily.qtype.yaml +216 -0
- qtype/examples/tutorials/01_hello_world.qtype.yaml +48 -0
- qtype/examples/tutorials/02_conversational_chat.qtype.yaml +37 -0
- qtype/examples/tutorials/03_structured_data.qtype.yaml +130 -0
- qtype/examples/tutorials/04_tools_and_function_calling.qtype.yaml +89 -0
- qtype/interpreter/api.py +4 -1
- qtype/interpreter/base/base_step_executor.py +3 -1
- qtype/interpreter/base/stream_emitter.py +19 -13
- qtype/interpreter/conversions.py +7 -3
- qtype/interpreter/converters.py +142 -26
- qtype/interpreter/executors/agent_executor.py +2 -3
- qtype/interpreter/executors/aggregate_executor.py +3 -4
- qtype/interpreter/executors/construct_executor.py +15 -15
- qtype/interpreter/executors/doc_to_text_executor.py +1 -3
- qtype/interpreter/executors/field_extractor_executor.py +13 -12
- qtype/interpreter/executors/file_source_executor.py +21 -34
- qtype/interpreter/executors/file_writer_executor.py +4 -4
- qtype/interpreter/executors/index_upsert_executor.py +1 -1
- qtype/interpreter/executors/invoke_embedding_executor.py +1 -4
- qtype/interpreter/executors/invoke_flow_executor.py +2 -2
- qtype/interpreter/executors/invoke_tool_executor.py +19 -18
- qtype/interpreter/executors/llm_inference_executor.py +16 -18
- qtype/interpreter/executors/prompt_template_executor.py +1 -3
- qtype/interpreter/executors/sql_source_executor.py +1 -1
- qtype/interpreter/resource_cache.py +3 -1
- qtype/interpreter/rich_progress.py +6 -3
- qtype/interpreter/stream/chat/converter.py +25 -17
- qtype/interpreter/stream/chat/ui_request_to_domain_type.py +2 -2
- qtype/interpreter/tools/function_tool_helper.py +11 -10
- qtype/interpreter/types.py +89 -4
- qtype/interpreter/typing.py +35 -38
- qtype/mcp/__init__.py +0 -0
- qtype/mcp/server.py +722 -0
- qtype/schema/qtype.schema.json +4016 -0
- qtype/semantic/checker.py +20 -1
- qtype/semantic/generate.py +6 -9
- qtype/semantic/model.py +26 -33
- qtype/semantic/resolver.py +7 -0
- qtype/semantic/visualize.py +45 -53
- {qtype-0.1.11.dist-info → qtype-0.1.13.dist-info}/METADATA +65 -44
- qtype-0.1.13.dist-info/RECORD +352 -0
- {qtype-0.1.11.dist-info → qtype-0.1.13.dist-info}/WHEEL +1 -2
- qtype/application/facade.py +0 -177
- qtype-0.1.11.dist-info/RECORD +0 -142
- qtype-0.1.11.dist-info/top_level.txt +0 -1
- {qtype-0.1.11.dist-info → qtype-0.1.13.dist-info}/entry_points.txt +0 -0
- {qtype-0.1.11.dist-info → qtype-0.1.13.dist-info}/licenses/LICENSE +0 -0
|
@@ -0,0 +1,327 @@
|
|
|
1
|
+
# Build a Conversational Chatbot
|
|
2
|
+
|
|
3
|
+
**Time:** 20 minutes
|
|
4
|
+
**Prerequisites:** [Tutorial 1: Your First QType Application](01-first-qtype-application.md)
|
|
5
|
+
**Example:** [`02_conversational_chat.qtype.yaml`](https://github.com/bazaarvoice/qtype/blob/main/examples/02_conversational_chat.qtype.yaml)
|
|
6
|
+
|
|
7
|
+
**What you'll learn:**
|
|
8
|
+
|
|
9
|
+
* Stateful flows with memory
|
|
10
|
+
* Using the web ui
|
|
11
|
+
* Domain types
|
|
12
|
+
|
|
13
|
+
**What you'll build:** A stateful chatbot that maintains conversation history and provides contextual responses.
|
|
14
|
+
|
|
15
|
+
---
|
|
16
|
+
|
|
17
|
+
## Background: A Quick Note on Flows
|
|
18
|
+
|
|
19
|
+
Flows are effectively data pipelines -- they accept input values and produce output values.
|
|
20
|
+
The flow will execute for each input it receives.
|
|
21
|
+
|
|
22
|
+
Thus, for a conversational AI, each message from the user is one execution of the flow.
|
|
23
|
+
|
|
24
|
+
Flows are inherently _stateless_: no data is stored between executions though they can use tools, apis, or memory to share data.
|
|
25
|
+
|
|
26
|
+
In this example, we'll use memory to let the flow remember previous chat messages from both the user and the LLM.
|
|
27
|
+
|
|
28
|
+
|
|
29
|
+
## Part 1: Add Memory to Your Application (5 minutes)
|
|
30
|
+
|
|
31
|
+
### Create Your Chatbot File
|
|
32
|
+
|
|
33
|
+
Create a new file called `02_conversational_chat.qtype.yaml`. Let's use bedrock for this example, but you could also use OpenAI as in the previous tutorial:
|
|
34
|
+
|
|
35
|
+
```yaml
|
|
36
|
+
id: 02_conversational_chat
|
|
37
|
+
description: A conversational chatbot with memory
|
|
38
|
+
|
|
39
|
+
models:
|
|
40
|
+
|
|
41
|
+
models:
|
|
42
|
+
- type: Model
|
|
43
|
+
id: nova_lite
|
|
44
|
+
provider: aws-bedrock
|
|
45
|
+
model_id: amazon.nova-lite-v1:0
|
|
46
|
+
inference_params:
|
|
47
|
+
temperature: 0.7
|
|
48
|
+
max_tokens: 512
|
|
49
|
+
|
|
50
|
+
```
|
|
51
|
+
|
|
52
|
+
---
|
|
53
|
+
|
|
54
|
+
### Add Memory Configuration
|
|
55
|
+
|
|
56
|
+
Now add a memory configuration *before* the `flows:` section:
|
|
57
|
+
|
|
58
|
+
```yaml
|
|
59
|
+
memories:
|
|
60
|
+
- id: chat_memory
|
|
61
|
+
token_limit: 10000
|
|
62
|
+
```
|
|
63
|
+
|
|
64
|
+
**What this means:**
|
|
65
|
+
|
|
66
|
+
- `memories:` - Section for memory configurations (new concept!)
|
|
67
|
+
- `id: chat_memory` - A nickname you'll use to reference this memory
|
|
68
|
+
- `token_limit: 10000` - Maximum total tokens to have in the memory
|
|
69
|
+
|
|
70
|
+
**Check your work:**
|
|
71
|
+
|
|
72
|
+
1. Save the file
|
|
73
|
+
2. Validate: `qtype validate 02_conversational_chat.qtype.yaml`
|
|
74
|
+
3. Should pass ✅ (even though we haven't added flows yet)
|
|
75
|
+
|
|
76
|
+
---
|
|
77
|
+
|
|
78
|
+
## Part 2: Create a Conversational Flow (7 minutes)
|
|
79
|
+
|
|
80
|
+
### Set Up the Conversational Flow
|
|
81
|
+
|
|
82
|
+
Add this flow definition:
|
|
83
|
+
|
|
84
|
+
```yaml
|
|
85
|
+
flows:
|
|
86
|
+
- type: Flow
|
|
87
|
+
id: simple_chat_example
|
|
88
|
+
interface:
|
|
89
|
+
type: Conversational
|
|
90
|
+
variables:
|
|
91
|
+
- id: user_message
|
|
92
|
+
type: ChatMessage
|
|
93
|
+
- id: response_message
|
|
94
|
+
type: ChatMessage
|
|
95
|
+
inputs:
|
|
96
|
+
- user_message
|
|
97
|
+
outputs:
|
|
98
|
+
- response_message
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
**New concepts explained:**
|
|
102
|
+
|
|
103
|
+
**`ChatMessage` type** - A special domain type for chat applications
|
|
104
|
+
|
|
105
|
+
- Represents a single message in a conversation
|
|
106
|
+
- Contains structured blocks (text, images, files, etc.) and metadata
|
|
107
|
+
- Different from the simple `text` type used in stateless applications
|
|
108
|
+
|
|
109
|
+
**ChatMessage Structure:**
|
|
110
|
+
|
|
111
|
+
```yaml
|
|
112
|
+
ChatMessage:
|
|
113
|
+
blocks:
|
|
114
|
+
- type: text
|
|
115
|
+
content: "Hello, how can I help?"
|
|
116
|
+
- type: image
|
|
117
|
+
url: "https://example.com/image.jpg"
|
|
118
|
+
role: assistant # or 'user', 'system'
|
|
119
|
+
metadata:
|
|
120
|
+
timestamp: "2025-11-08T10:30:00Z"
|
|
121
|
+
```
|
|
122
|
+
|
|
123
|
+
The `blocks` list allows multimodal messages (text + images + files), while `role` indicates who sent the message. QType automatically handles this structure when managing conversation history.
|
|
124
|
+
|
|
125
|
+
|
|
126
|
+
**Why two variables?**
|
|
127
|
+
|
|
128
|
+
- `user_message` - What the user types
|
|
129
|
+
- `response_message` - What the AI responds
|
|
130
|
+
- QType tracks both in memory for context
|
|
131
|
+
|
|
132
|
+
**`interface.type: Conversational`**
|
|
133
|
+
|
|
134
|
+
This tells QType that the flow should be served as a conversation. When you type `qtype serve` (covered below) this ensures that the ui shows a chat interface instead of just listing inputs and outputs.
|
|
135
|
+
|
|
136
|
+
|
|
137
|
+
**Check your work:**
|
|
138
|
+
|
|
139
|
+
1. Validate: `qtype validate 02_conversational_chat.qtype.yaml`
|
|
140
|
+
2. Should still pass ✅
|
|
141
|
+
|
|
142
|
+
---
|
|
143
|
+
|
|
144
|
+
### Add the Chat Step
|
|
145
|
+
|
|
146
|
+
Add the LLM inference step that connects to your memory:
|
|
147
|
+
|
|
148
|
+
```yaml
|
|
149
|
+
steps:
|
|
150
|
+
- id: llm_inference_step
|
|
151
|
+
type: LLMInference
|
|
152
|
+
model: nova_lite
|
|
153
|
+
system_message: "You are a helpful assistant."
|
|
154
|
+
memory: chat_memory
|
|
155
|
+
inputs:
|
|
156
|
+
- user_message
|
|
157
|
+
outputs:
|
|
158
|
+
- response_message
|
|
159
|
+
```
|
|
160
|
+
|
|
161
|
+
**What's new:**
|
|
162
|
+
|
|
163
|
+
**`memory: chat_memory`** - Links this step to the memory configuration
|
|
164
|
+
- Automatically sends conversation history with each request
|
|
165
|
+
- Updates memory after each exchange
|
|
166
|
+
- This line is what enables "remembering" previous messages
|
|
167
|
+
|
|
168
|
+
**`system_message` with personality** - Unlike the previous generic message, this shapes the AI's behavior for conversation
|
|
169
|
+
|
|
170
|
+
**Check your work:**
|
|
171
|
+
|
|
172
|
+
1. Validate: `qtype validate 02_conversational_chat.qtype.yaml`
|
|
173
|
+
2. Should pass ✅
|
|
174
|
+
|
|
175
|
+
---
|
|
176
|
+
|
|
177
|
+
## Part 3: Set Up and Test (8 minutes)
|
|
178
|
+
|
|
179
|
+
### Configure Authentication
|
|
180
|
+
|
|
181
|
+
Create `.env` in the same folder (or update your existing one):
|
|
182
|
+
|
|
183
|
+
```
|
|
184
|
+
AWS_PROFILE=your-aws-profile
|
|
185
|
+
```
|
|
186
|
+
|
|
187
|
+
**Using OpenAI?** Replace the model configuration with:
|
|
188
|
+
```yaml
|
|
189
|
+
auths:
|
|
190
|
+
- type: api_key
|
|
191
|
+
id: openai_auth
|
|
192
|
+
api_key: ${OPENAI_KEY}
|
|
193
|
+
host: https://api.openai.com
|
|
194
|
+
models:
|
|
195
|
+
- type: Model
|
|
196
|
+
id: gpt-4
|
|
197
|
+
provider: openai
|
|
198
|
+
model_id: gpt-4-turbo
|
|
199
|
+
auth: openai_auth
|
|
200
|
+
inference_params:
|
|
201
|
+
temperature: 0.7
|
|
202
|
+
```
|
|
203
|
+
|
|
204
|
+
And:
|
|
205
|
+
|
|
206
|
+
- update the step to use `model: gtp-4`.
|
|
207
|
+
- update your `.env` file to have `OPENAI_KEY`
|
|
208
|
+
|
|
209
|
+
---
|
|
210
|
+
|
|
211
|
+
### Start the Chat Interface
|
|
212
|
+
|
|
213
|
+
Unlike the previous tutorial where you used `qtype run` for one-off questions, conversational applications work better with the web interface:
|
|
214
|
+
|
|
215
|
+
```bash
|
|
216
|
+
qtype serve 02_conversational_chat.qtype.yaml
|
|
217
|
+
```
|
|
218
|
+
|
|
219
|
+
**What you'll see:**
|
|
220
|
+
```
|
|
221
|
+
INFO: Started server process
|
|
222
|
+
INFO: Uvicorn running on http://127.0.0.1:8000
|
|
223
|
+
```
|
|
224
|
+
|
|
225
|
+
**Visit:** [http://localhost:8000/ui](http://localhost:8000/ui)
|
|
226
|
+
|
|
227
|
+
You should see a chat interface with your application name at the top. Give it a chat!
|
|
228
|
+
|
|
229
|
+

|
|
230
|
+
|
|
231
|
+
|
|
232
|
+
|
|
233
|
+
---
|
|
234
|
+
|
|
235
|
+
### Test Conversation Memory
|
|
236
|
+
|
|
237
|
+
Try this conversation to see memory in action:
|
|
238
|
+
|
|
239
|
+
```
|
|
240
|
+
You: My name is Alex and I love pizza.
|
|
241
|
+
AI: Nice to meet you, Alex! Pizza is delicious...
|
|
242
|
+
|
|
243
|
+
You: What's my name?
|
|
244
|
+
AI: Your name is Alex! ✅
|
|
245
|
+
|
|
246
|
+
You: What food do I like?
|
|
247
|
+
AI: You mentioned you love pizza! ✅
|
|
248
|
+
```
|
|
249
|
+
|
|
250
|
+
Refreshing the page creates a new session and the memory is removed.
|
|
251
|
+
|
|
252
|
+
---
|
|
253
|
+
|
|
254
|
+
## Part 4: Understanding What's Happening (Bonus)
|
|
255
|
+
|
|
256
|
+
### The Memory Lifecycle
|
|
257
|
+
|
|
258
|
+
Here's what happens when you send a message:
|
|
259
|
+
|
|
260
|
+
```
|
|
261
|
+
User: "What's my name?"
|
|
262
|
+
↓
|
|
263
|
+
QType: Get conversation history from memory
|
|
264
|
+
↓
|
|
265
|
+
Memory: Returns previous messages (including "My name is Alex")
|
|
266
|
+
↓
|
|
267
|
+
QType: Combines system message + history + new question
|
|
268
|
+
↓
|
|
269
|
+
LLM: Processes full context → "Your name is Alex!"
|
|
270
|
+
↓
|
|
271
|
+
QType: Saves new exchange to memory
|
|
272
|
+
↓
|
|
273
|
+
User: Sees response
|
|
274
|
+
```
|
|
275
|
+
|
|
276
|
+
**Key insight:** The LLM itself has no memory - QType handles this by:
|
|
277
|
+
|
|
278
|
+
1. Storing all previous messages
|
|
279
|
+
2. Sending relevant history with each new question
|
|
280
|
+
3. Managing token limits automatically
|
|
281
|
+
|
|
282
|
+
|
|
283
|
+
**The memory is keyed on the user session** -- it's not accessible by other visitors to the page.
|
|
284
|
+
|
|
285
|
+
---
|
|
286
|
+
|
|
287
|
+
## What You've Learned
|
|
288
|
+
|
|
289
|
+
Congratulations! You've mastered:
|
|
290
|
+
|
|
291
|
+
✅ **Memory configuration** - Storing conversation state
|
|
292
|
+
✅ **Conversational flows** - Multi-turn interactions
|
|
293
|
+
✅ **ChatMessage type** - Domain-specific data types
|
|
294
|
+
✅ **Web interface** - Using `qtype serve` for chat applications
|
|
295
|
+
|
|
296
|
+
---
|
|
297
|
+
|
|
298
|
+
## Next Steps
|
|
299
|
+
|
|
300
|
+
**Reference the complete example:**
|
|
301
|
+
|
|
302
|
+
- [`02_conversational_chat.qtype`](https://github.com/bazaarvoice/qtype/blob/main/examples/02_conversational_chat.qtype) - Full working example
|
|
303
|
+
|
|
304
|
+
**Learn more:**
|
|
305
|
+
|
|
306
|
+
- [Tutorial: Structured Data](03-structured-data.md)
|
|
307
|
+
- [ChatMessage Reference](../components/ChatMessage.md)
|
|
308
|
+
- [Use Conversational Interfaces](../How%20To/Qtype%20Server/use_conversational_interfaces.md)
|
|
309
|
+
|
|
310
|
+
---
|
|
311
|
+
|
|
312
|
+
## Common Questions
|
|
313
|
+
|
|
314
|
+
**Q: Why do I need `ChatMessage` instead of `text`?**
|
|
315
|
+
A: `ChatMessage` includes metadata (role, attachments) that QType uses to properly format conversation history for the LLM. The `text` type is for simple strings without this context.
|
|
316
|
+
|
|
317
|
+
**Q: Can I have multiple memory configurations?**
|
|
318
|
+
A: Yes! You can define multiple memories in the `memories:` section and reference different ones in different flows or steps.
|
|
319
|
+
|
|
320
|
+
**Q: Can I use memory with the `Complete` interface?**
|
|
321
|
+
A: No - memory only works with `Conversational` interface. Complete flows are stateless by design. If you need to remember information between requests, you must use the Conversational interface.
|
|
322
|
+
|
|
323
|
+
**Q: When should I use Complete vs Conversational?**
|
|
324
|
+
A: Use Complete for streaming single responses from an llm. Use Conversational when you need context from previous interactions (chatbots, assistants, multi-step conversations).
|
|
325
|
+
|
|
326
|
+
**Q: How do I clear memory during a conversation?**
|
|
327
|
+
A: Currently, you need to start a new session (refresh the page in the UI).
|