create-harper 0.12.3 → 0.13.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (65) hide show
  1. package/package.json +1 -1
  2. package/template-react/AGENTS.md +5 -2
  3. package/template-react/resources/README.md +1 -1
  4. package/template-react/skills/adding-tables-with-schemas.md +2 -2
  5. package/template-react/skills/automatic-apis.md +53 -0
  6. package/template-react/skills/caching.md +113 -0
  7. package/template-react/skills/checking-authentication.md +100 -5
  8. package/template-react/skills/custom-resources.md +8 -4
  9. package/template-react/skills/defining-relationships.md +3 -3
  10. package/template-react/skills/extending-tables.md +7 -3
  11. package/template-react/skills/handling-binary-data.md +3 -3
  12. package/template-react/skills/programmatic-table-requests.md +3 -3
  13. package/template-react/skills/querying-rest-apis.md +2 -2
  14. package/template-react/skills/real-time-apps.md +8 -4
  15. package/template-react/skills/typescript-type-stripping.md +4 -4
  16. package/template-react/skills/using-blob-datatype.md +131 -0
  17. package/template-react/skills/vector-indexing.md +215 -0
  18. package/template-react-ts/AGENTS.md +5 -2
  19. package/template-react-ts/resources/README.md +1 -1
  20. package/template-react-ts/skills/adding-tables-with-schemas.md +2 -2
  21. package/template-react-ts/skills/automatic-apis.md +53 -0
  22. package/template-react-ts/skills/caching.md +113 -0
  23. package/template-react-ts/skills/checking-authentication.md +100 -5
  24. package/template-react-ts/skills/custom-resources.md +8 -4
  25. package/template-react-ts/skills/defining-relationships.md +3 -3
  26. package/template-react-ts/skills/extending-tables.md +7 -3
  27. package/template-react-ts/skills/handling-binary-data.md +3 -3
  28. package/template-react-ts/skills/programmatic-table-requests.md +3 -3
  29. package/template-react-ts/skills/querying-rest-apis.md +2 -2
  30. package/template-react-ts/skills/real-time-apps.md +8 -4
  31. package/template-react-ts/skills/typescript-type-stripping.md +4 -4
  32. package/template-react-ts/skills/using-blob-datatype.md +131 -0
  33. package/template-react-ts/skills/vector-indexing.md +215 -0
  34. package/template-vanilla/AGENTS.md +5 -2
  35. package/template-vanilla/resources/README.md +1 -1
  36. package/template-vanilla/skills/adding-tables-with-schemas.md +2 -2
  37. package/template-vanilla/skills/automatic-apis.md +53 -0
  38. package/template-vanilla/skills/caching.md +113 -0
  39. package/template-vanilla/skills/checking-authentication.md +100 -5
  40. package/template-vanilla/skills/custom-resources.md +8 -4
  41. package/template-vanilla/skills/defining-relationships.md +3 -3
  42. package/template-vanilla/skills/extending-tables.md +7 -3
  43. package/template-vanilla/skills/handling-binary-data.md +3 -3
  44. package/template-vanilla/skills/programmatic-table-requests.md +3 -3
  45. package/template-vanilla/skills/querying-rest-apis.md +2 -2
  46. package/template-vanilla/skills/real-time-apps.md +8 -4
  47. package/template-vanilla/skills/typescript-type-stripping.md +4 -4
  48. package/template-vanilla/skills/using-blob-datatype.md +131 -0
  49. package/template-vanilla/skills/vector-indexing.md +215 -0
  50. package/template-vanilla-ts/AGENTS.md +5 -2
  51. package/template-vanilla-ts/resources/README.md +1 -1
  52. package/template-vanilla-ts/skills/adding-tables-with-schemas.md +2 -2
  53. package/template-vanilla-ts/skills/automatic-apis.md +53 -0
  54. package/template-vanilla-ts/skills/caching.md +113 -0
  55. package/template-vanilla-ts/skills/checking-authentication.md +100 -5
  56. package/template-vanilla-ts/skills/custom-resources.md +8 -4
  57. package/template-vanilla-ts/skills/defining-relationships.md +3 -3
  58. package/template-vanilla-ts/skills/extending-tables.md +7 -3
  59. package/template-vanilla-ts/skills/handling-binary-data.md +3 -3
  60. package/template-vanilla-ts/skills/programmatic-table-requests.md +3 -3
  61. package/template-vanilla-ts/skills/querying-rest-apis.md +2 -2
  62. package/template-vanilla-ts/skills/real-time-apps.md +8 -4
  63. package/template-vanilla-ts/skills/typescript-type-stripping.md +4 -4
  64. package/template-vanilla-ts/skills/using-blob-datatype.md +131 -0
  65. package/template-vanilla-ts/skills/vector-indexing.md +215 -0
@@ -0,0 +1,215 @@
1
+ # Vector Indexing
2
+
3
+ Harper supports **vector indexing** on array attributes, enabling efficient similarity search over high-dimensional vector data. This is essential for AI-powered features such as semantic search, recommendations, and embeddings-based retrieval.
4
+
5
+ ---
6
+
7
+ ## What Is Vector Indexing
8
+
9
+ Vector indexing organizes numeric vectors so that Harper can efficiently find records that are closest to a given query vector using a distance metric such as cosine similarity or Euclidean distance.
10
+
11
+ Unlike traditional indexes that rely on exact matches, vector indexes enable **nearest-neighbor search** across high-dimensional spaces, making them ideal for embeddings and machine learning workloads.
12
+
13
+ ---
14
+
15
+ ## Enabling a Vector Index
16
+
17
+ Vector indexes are defined using the `@indexed` directive on numeric array attributes.
18
+
19
+ ```graphql
20
+ type Product @table {
21
+ id: Long @primaryKey
22
+ name: String
23
+ description: String
24
+ textEmbeddings: [Float] @indexed(type: "HNSW")
25
+ price: Float
26
+ }
27
+ ```
28
+
29
+ - `type: "HNSW"` enables Harper’s vector index using the HNSW algorithm
30
+ - The indexed field must be an array of numeric values
31
+ - Vector indexes are stored and maintained automatically
32
+
33
+ ---
34
+
35
+ ## Querying with a Vector Index
36
+
37
+ ### Search Vectors with sort
38
+
39
+ Once defined, vector indexes can be used by specifying a `sort` configuration with a target vector. To view the similarity of a result to a given query vector, use the `$distance` attribute in the `select` clause.
40
+
41
+ ```js
42
+ const results = Product.search({
43
+ select: ['name', 'description', 'price', '$distance'],
44
+ sort: {
45
+ attribute: 'textEmbeddings',
46
+ target: searchVector,
47
+ },
48
+ limit: 5,
49
+ });
50
+ ```
51
+
52
+ - `attribute` is the vector index attribute
53
+ - `target` is the vector to compare against
54
+ - `searchVector` is the embedding to compare against
55
+ - Results are ordered by similarity
56
+ - Vector search can be combined with filters and limits
57
+ - The `$distance` attribute in the `select` (optional) returns the distance between the result and the query vector
58
+
59
+ ### Search Vectors limited by distance
60
+
61
+ Vector indexes results can be limited by distance using the `conditions` clause. In the following example, results are returned that are less than 0.1 similar to the query vector.
62
+ The `conditions` clause can be combined with `sort` and `limit` and the `comparator` can be any of the following: `lt`, `lte`, `gt`, `gte`, `between`.
63
+
64
+ ```js
65
+ const results = Product.search({
66
+ select: ['name', 'description', 'price', '$distance'],
67
+ conditions: {
68
+ attribute: 'textEmbeddings',
69
+ comparator: 'lt',
70
+ value: 0.1, // '0.1' is the similarity threshold
71
+ target: searchVector,
72
+ },
73
+ });
74
+ ```
75
+
76
+ - `attribute` is the vector index attribute
77
+ - `comparator` is the comparison operator (`lt`, `lte`, `gt`, `gte`, `between` are accepted)
78
+ - `value` is the threshold value
79
+ - `target` is the vector to compare against
80
+ - `searchVector` is the embedding to compare against
81
+ - Vector search can be combined with filters, sort, and limits
82
+ - The `$distance` attribute in the `select` (optional) returns the distance between the result and the query vector
83
+
84
+ ---
85
+
86
+ ## Vector Index Options
87
+
88
+ Additional tuning options can be provided on the `@indexed` directive:
89
+
90
+ | Option | Description |
91
+ | ---------------------- | ------------------------------------------- |
92
+ | `distance` | Similarity metric (`cosine` or `euclidean`) |
93
+ | `efConstruction` | Index build quality vs performance |
94
+ | `M` | Graph connectivity per HNSW layer |
95
+ | `optimizeRouting` | Improves routing efficiency |
96
+ | `efSearchConstruction` | Search breadth during queries |
97
+
98
+ These options allow fine-tuning for performance and recall tradeoffs.
99
+
100
+ ---
101
+
102
+ ## How to Generate and Search Vector Embeddings
103
+
104
+ Here is a full example that generates embeddings for a set of products and then searches for similar products using vector indexes. The following example shows how to generate embeddings using OpenAI or Ollama.
105
+
106
+ ```js
107
+ import { Ollama } from 'ollama';
108
+ const ollama = new Ollama({ host: 'http://127.0.0.1:11434' });
109
+ // The name of the ollama embedding model
110
+ const OLLAMA_EMBEDDING_MODEL = 'nomic-embed-text';
111
+
112
+ const { Product } = tables;
113
+
114
+ import OpenAI from 'openai';
115
+ const openai = new OpenAI();
116
+ // the name of the OpenAI embedding model
117
+ const OPENAI_EMBEDDING_MODEL = 'text-embedding-3-small';
118
+
119
+ const SIMILARITY_THRESHOLD = 0.5;
120
+
121
+ export class ProductSearch extends Resource {
122
+ // based on env variable we choose the appropriate embedding generator
123
+ generateEmbedding = process.env.EMBEDDING_GENERATOR === 'ollama'
124
+ ? this._generateOllamaEmbedding
125
+ : this._generateOpenAIEmbedding;
126
+
127
+ /**
128
+ * Executes a search query using a generated text embedding and returns the matching products.
129
+ *
130
+ * @param {Object} data - The input data for the request.
131
+ * @param {string} data.prompt - The prompt to generate the text embedding from.
132
+ * @return {Promise<Array>} Returns a promise that resolves to an array of products matching the conditions,
133
+ * including fields: name, description, price, and $distance.
134
+ */
135
+ async post(data) {
136
+ const embedding = await this.generateEmbedding(data.prompt);
137
+
138
+ return await Product.search({
139
+ select: ['name', 'description', 'price', '$distance'],
140
+ conditions: {
141
+ attribute: 'textEmbeddings',
142
+ comparator: 'lt',
143
+ value: SIMILARITY_THRESHOLD,
144
+ target: embedding[0],
145
+ },
146
+ limit: 5,
147
+ });
148
+ }
149
+
150
+ /**
151
+ * Generates an embedding using the Ollama API.
152
+ *
153
+ * @param {string} promptData - The input data for which the embedding is to be generated.
154
+ * @return {Promise<number[][]>} A promise that resolves to the generated embedding as an array of numbers.
155
+ */
156
+ async _generateOllamaEmbedding(promptData) {
157
+ const embedding = await ollama.embed({
158
+ model: OLLAMA_EMBEDDING_MODEL,
159
+ input: promptData,
160
+ });
161
+ return embedding?.embeddings;
162
+ }
163
+
164
+ /**
165
+ * Generates OpenAI embeddings based on the given prompt data.
166
+ *
167
+ * @param {string} promptData - The input data used for generating the embedding.
168
+ * @return {Promise<number[][]>} A promise that resolves to an array of embeddings, where each embedding is an array of floats.
169
+ */
170
+ async _generateOpenAIEmbedding(promptData) {
171
+ const embedding = await openai.embeddings.create({
172
+ model: OPENAI_EMBEDDING_MODEL,
173
+ input: promptData,
174
+ encoding_format: 'float',
175
+ });
176
+
177
+ let embeddings = [];
178
+ embedding.data.forEach((embeddingData) => {
179
+ embeddings.push(embeddingData.embedding);
180
+ });
181
+
182
+ return embeddings;
183
+ }
184
+ }
185
+ ```
186
+
187
+ Sample request to the `ProductSearch` resource which prompts to find "shorts for the gym":
188
+
189
+ ```bash
190
+ curl -X POST "http://localhost:9926/ProductSearch/" \
191
+ -H "accept: \
192
+ -H "Content-Type: application/json" \
193
+ -H "Authorization: Basic <YOUR_AUTH>" \
194
+ -d '{"prompt": "shorts for the gym"}'
195
+ ```
196
+
197
+ ---
198
+
199
+ ## When to Use Vector Indexing
200
+
201
+ Vector indexing is ideal when:
202
+
203
+ - Storing embedding vectors from ML models
204
+ - Performing semantic or similarity-based search
205
+ - Working with high-dimensional numeric data
206
+ - Exact-match indexes are insufficient
207
+
208
+ ---
209
+
210
+ ## Summary
211
+
212
+ - Vector indexing enables fast similarity search on numeric arrays
213
+ - Defined using `@indexed(type: "HNSW")`
214
+ - Queried using a target vector in search sorting
215
+ - Tunable for performance and accuracy