create-harper 0.12.4 → 0.13.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (33) hide show
  1. package/package.json +1 -1
  2. package/template-react/AGENTS.md +3 -1
  3. package/template-react/skills/automatic-apis.md +4 -0
  4. package/template-react/skills/caching.md +113 -0
  5. package/template-react/skills/checking-authentication.md +95 -0
  6. package/template-react/skills/custom-resources.md +4 -0
  7. package/template-react/skills/extending-tables.md +4 -0
  8. package/template-react/skills/using-blob-datatype.md +131 -0
  9. package/template-react/skills/vector-indexing.md +215 -0
  10. package/template-react-ts/AGENTS.md +3 -1
  11. package/template-react-ts/skills/automatic-apis.md +4 -0
  12. package/template-react-ts/skills/caching.md +113 -0
  13. package/template-react-ts/skills/checking-authentication.md +95 -0
  14. package/template-react-ts/skills/custom-resources.md +4 -0
  15. package/template-react-ts/skills/extending-tables.md +4 -0
  16. package/template-react-ts/skills/using-blob-datatype.md +131 -0
  17. package/template-react-ts/skills/vector-indexing.md +215 -0
  18. package/template-vanilla/AGENTS.md +3 -1
  19. package/template-vanilla/skills/automatic-apis.md +4 -0
  20. package/template-vanilla/skills/caching.md +113 -0
  21. package/template-vanilla/skills/checking-authentication.md +95 -0
  22. package/template-vanilla/skills/custom-resources.md +4 -0
  23. package/template-vanilla/skills/extending-tables.md +4 -0
  24. package/template-vanilla/skills/using-blob-datatype.md +131 -0
  25. package/template-vanilla/skills/vector-indexing.md +215 -0
  26. package/template-vanilla-ts/AGENTS.md +3 -1
  27. package/template-vanilla-ts/skills/automatic-apis.md +4 -0
  28. package/template-vanilla-ts/skills/caching.md +113 -0
  29. package/template-vanilla-ts/skills/checking-authentication.md +95 -0
  30. package/template-vanilla-ts/skills/custom-resources.md +4 -0
  31. package/template-vanilla-ts/skills/extending-tables.md +4 -0
  32. package/template-vanilla-ts/skills/using-blob-datatype.md +131 -0
  33. package/template-vanilla-ts/skills/vector-indexing.md +215 -0
package/package.json CHANGED
@@ -1,7 +1,7 @@
1
1
  {
2
2
  "name": "create-harper",
3
3
  "description": "Scaffold a new Harper project in JavaScript or TypeScript.",
4
- "version": "0.12.4",
4
+ "version": "0.13.1",
5
5
  "type": "module",
6
6
  "author": {
7
7
  "name": "HarperDB",
@@ -16,4 +16,6 @@ This repository contains "skills" that guide AI agents in developing Harper appl
16
16
  - [Handling Binary Data](skills/handling-binary-data.md): How to store and serve binary data like images or MP3s.
17
17
  - [Serving Web Content](skills/serving-web-content): Two ways to serve web content from a Harper application.
18
18
  - [Checking Authentication](skills/checking-authentication.md): How to use sessions to verify user identity and roles.
19
- [adding-tables-with-schemas.md](skills/adding-tables-with-schemas.md)
19
+ - [Caching](skills/caching.md): How caching is defined and implemented in Harper applications.
20
+ - [Using Blobs](skills/using-blob-datatype.md): How to store and retrieve large data in HarperDB.
21
+ - [Vector Indexing](skills/vector-indexing.md): How to define and use vector indexes for efficient similarity search.
@@ -47,3 +47,7 @@ This is the easiest way to add real-time capabilities to your application. For m
47
47
  The `GET /{TableName}/` and `DELETE /{TableName}/` endpoints can be filtered using query parameters. While basic equality filters are straightforward, Harper supports a rich set of operators, sorting, and pagination.
48
48
 
49
49
  For a comprehensive guide on advanced querying, see the [Querying REST APIs](querying-rest-apis.md) skill.
50
+
51
+ ## Customizing Resources
52
+
53
+ If the automatic APIs don't behave how you need, then you can look to [customize the resources](./custom-resources.md).
@@ -0,0 +1,113 @@
1
+ # Harper Caching
2
+
3
+ Harper includes integrated support for **caching data from external sources**, enabling high-performance, low-latency cache storage that is fully queryable and interoperable with your applications. With built-in caching capabilities and distributed responsiveness, Harper makes an ideal **data caching server** for both edge and centralized use cases.
4
+
5
+ ---
6
+
7
+ ## What is Harper Caching?
8
+
9
+ Harper caching lets you store **cached content** in standard tables, enabling you to:
10
+
11
+ - Expose cached entries as **queryable structured data** (e.g., JSON or CSV)
12
+ - Serve data to clients with **flexible formats and custom querying**
13
+ - Manage cache control with **timestamps and ETags** for downstream caching layers
14
+ - Implement **active or passive caching** patterns depending on your source and invalidation strategy
15
+
16
+ ---
17
+
18
+ ## Configuring a Cache Table
19
+
20
+ Define a cache table in your `schema.graphql`:
21
+
22
+ ```graphql
23
+ type MyCache @table(expiration: 3600) @export {
24
+ id: ID @primaryKey
25
+ }
26
+ ```
27
+
28
+ - `expiration` is defined in seconds
29
+ - Expired records are refreshed on access
30
+ - Evicted records are removed after expiration
31
+
32
+ ---
33
+
34
+ ## Connecting an External Source
35
+
36
+ Create a resource:
37
+
38
+ ```js
39
+ import { Resource } from 'harperdb';
40
+
41
+ export class ThirdPartyAPI extends Resource {
42
+ async get() {
43
+ const id = this.getId();
44
+ const response = await fetch(`https://api.example.com/items/${id}`);
45
+ if (!response.ok) {
46
+ throw new Error('Source fetch failed');
47
+ }
48
+ return await response.json();
49
+ }
50
+ }
51
+ ```
52
+
53
+ Attach it to your table:
54
+
55
+ ```js
56
+ import { tables } from 'harperdb';
57
+ import { ThirdPartyAPI } from './ThirdPartyAPI.js';
58
+
59
+ const { MyCache } = tables;
60
+ MyCache.sourcedFrom(ThirdPartyAPI);
61
+ ```
62
+
63
+ ---
64
+
65
+ ## Cache Behavior
66
+
67
+ 1. Fresh data is returned immediately
68
+ 2. Missing or stale data triggers a fetch
69
+ 3. Concurrent misses are deduplicated
70
+
71
+ ---
72
+
73
+ ## Active Caching
74
+
75
+ Use `subscribe()` to proactively update or invalidate cache entries:
76
+
77
+ ```js
78
+ class MyAPI extends Resource {
79
+ async *subscribe() {
80
+ // stream updates
81
+ }
82
+ }
83
+ ```
84
+
85
+ See [Real Time Apps](real-time-apps.md) for more details.
86
+
87
+ ---
88
+
89
+ ## Write-Through Caching
90
+
91
+ Propagate updates upstream:
92
+
93
+ ```js
94
+ class ThirdPartyAPI extends Resource {
95
+ async put(data) {
96
+ await fetch(`https://some-api.com/${this.getId()}`, {
97
+ method: 'PUT',
98
+ body: JSON.stringify(data),
99
+ });
100
+ }
101
+ }
102
+ ```
103
+
104
+ ---
105
+
106
+ ## Summary
107
+
108
+ Harper Caching allows you to:
109
+
110
+ - Cache external APIs efficiently
111
+ - Query cached data like native tables
112
+ - Prevent cache stampedes
113
+ - Build real-time or write-through caches
@@ -177,6 +177,100 @@ export function ensureSuperUser(user: User | undefined) {
177
177
  - Sessions are cookie-based; the server handles setting and reading the cookie via Harper. If you make cross-origin requests, ensure the appropriate `credentials` mode and CORS settings.
178
178
  - If developing locally, double-check the server config still has `authentication.authorizeLocal: false` to avoid accidental superuser bypass.
179
179
 
180
+ ## Token-based auth (JWT + refresh token) for non-browser clients
181
+
182
+ Cookie-backed sessions are great for browser flows. For CLI tools, mobile apps, or other non-browser clients, it’s often easier to use **explicit tokens**:
183
+
184
+ - **JWT (`operation_token`)**: short-lived bearer token used to authorize API requests.
185
+ - **Refresh token (`refresh_token`)**: longer-lived token used to mint a new JWT when it expires.
186
+
187
+ This project includes two Resource patterns for that flow:
188
+
189
+ ### Issuing tokens: `IssueTokens`
190
+
191
+ **Description / use case:** Generate `{ refreshToken, jwt }` either:
192
+
193
+ - with an existing Authorization token (either Basic Auth or a JWT) and you want to issue new tokens, or
194
+ - from an explicit `{ username, password }` payload (useful for direct “login” from a CLI/mobile client).
195
+
196
+ ```
197
+ js
198
+ export class IssueTokens extends Resource {
199
+ static loadAsInstance = false;
200
+
201
+ async get(target) {
202
+ const { refresh_token: refreshToken, operation_token: jwt } =
203
+ await databases.system.hdb_user.operation(
204
+ { operation: 'create_authentication_tokens' },
205
+ this.getContext(),
206
+ );
207
+ return { refreshToken, jwt };
208
+ }
209
+
210
+ async post(target, data) {
211
+ if (!data.username || !data.password) {
212
+ throw new Error('username and password are required');
213
+ }
214
+
215
+ const { refresh_token: refreshToken, operation_token: jwt } =
216
+ await databases.system.hdb_user.operation({
217
+ operation: 'create_authentication_tokens',
218
+ username: data.username,
219
+ password: data.password,
220
+ });
221
+ return { refreshToken, jwt };
222
+ }
223
+ }
224
+ ```
225
+
226
+ **Recommended documentation notes to include:**
227
+
228
+ - `GET` variant: intended for “I already have an Authorization token, give me new tokens”.
229
+ - `POST` variant: intended for “I have credentials, give me tokens”.
230
+ - Response shape:
231
+ - `refreshToken`: store securely (long-lived).
232
+ - `jwt`: attach to requests (short-lived).
233
+
234
+ ### Refreshing a JWT: `RefreshJWT`
235
+
236
+ **Description / use case:** When the JWT expires, the client uses the refresh token to get a new JWT without re-supplying username/password.
237
+
238
+ ```
239
+ js
240
+ export class RefreshJWT extends Resource {
241
+ static loadAsInstance = false;
242
+
243
+ async post(target, data) {
244
+ if (!data.refreshToken) {
245
+ throw new Error('refreshToken is required');
246
+ }
247
+
248
+ const { operation_token: jwt } = await databases.system.hdb_user.operation({
249
+ operation: 'refresh_operation_token',
250
+ refresh_token: data.refreshToken,
251
+ });
252
+ return { jwt };
253
+ }
254
+ }
255
+ ```
256
+
257
+ **Recommended documentation notes to include:**
258
+
259
+ - Requires `refreshToken` in the request body.
260
+ - Returns a new `{ jwt }`.
261
+ - If refresh fails (expired/revoked), client must re-authenticate (e.g., call `IssueTokens.post` again).
262
+
263
+ ### Suggested client flow (high-level)
264
+
265
+ 1. **Sign in (token flow)**
266
+ - POST /IssueTokens/ with a body of `{ "username": "your username", "password": "your password" }` or GET /IssueTokens/ with an existing Authorization token.
267
+ - Receive `{ jwt, refreshToken }` in the response
268
+ 2. **Call protected APIs**
269
+ - Send the JWT with each request in the Authorization header (as your auth mechanism expects)
270
+ 3. **JWT expires**
271
+ - POST /RefreshJWT/ with a body of `{ "refreshToken": "your refresh token" }`.
272
+ - Receive `{ jwt }` in the response and continue
273
+
180
274
  ## Quick checklist
181
275
 
182
276
  - [ ] Public endpoints explicitly `allowRead`/`allowCreate` as needed.
@@ -184,3 +278,4 @@ export function ensureSuperUser(user: User | undefined) {
184
278
  - [ ] Protected routes call `ensureSuperUser(this.getCurrentUser())` (or another role check) before doing work.
185
279
  - [ ] Sign-out verifies a session and deletes it.
186
280
  - [ ] `authentication.authorizeLocal` is `false` and `enableSessions` is `true` in Harper config.
281
+ - [ ] If using tokens: `IssueTokens` issues `{ jwt, refreshToken }`, `RefreshJWT` refreshes `{ jwt }` with a `refreshToken`.
@@ -4,6 +4,10 @@ Custom Resources allow you to define your own REST endpoints with custom logic b
4
4
 
5
5
  Harper supports [TypeScript Type Stripping](typescript-type-stripping.md), allowing you to use TypeScript directly without additional build tools on supported Node.js versions.
6
6
 
7
+ ## Do you need a custom resource?
8
+
9
+ Exported tables have [automatic APIs](./automatic-apis.md) as a part of them. These APIs may be enough for what you need. Take a look at [extendings tables](./extending-tables.md) to learn more about resources with a backing table.
10
+
7
11
  ## Defining a Custom Resource
8
12
 
9
13
  To create a custom resource:
@@ -2,6 +2,10 @@
2
2
 
3
3
  In Harper, when you define a table in GraphQL and export it, you can extend the automatically generated resource class to add custom logic, validation, or hooks to standard CRUD operations.
4
4
 
5
+ ## Do you need to extend tables?
6
+
7
+ Exported tables have [automatic APIs](./automatic-apis.md) as a part of them. These APIs may be sufficient for what you need.
8
+
5
9
  ## How to Extend a Table Resource
6
10
 
7
11
  1. Identify the table you want to extend (e.g., `ExamplePeople`).
@@ -0,0 +1,131 @@
1
+ # Blob (Binary Large Objects)
2
+
3
+ Harper supports **Blobs** — binary large objects for storing unstructured or large binary data — with integrated streaming support and efficient storage. Blobs are ideal for media files, documents, and any data where size or throughput makes standard JSON fields impractical.
4
+
5
+ ---
6
+
7
+ ## What Are Blobs
8
+
9
+ Blobs extend the native JavaScript `Blob` type and allow you to store **binary or arbitrary data** inside Harper tables. The blob reference is stored in the record, while the blob’s contents are streamed to and from storage.
10
+
11
+ - Designed for binary data such as images, audio, and documents
12
+ - Supports streaming reads and writes
13
+ - Blob data is stored separately from record attributes
14
+ - Optimized for large payloads
15
+
16
+ ---
17
+
18
+ ## Defining Blob Fields
19
+
20
+ Declare a blob field using the `Blob` type in your schema:
21
+
22
+ ```graphql
23
+ type MyTable @table {
24
+ id: ID @primaryKey
25
+ data: Blob
26
+ }
27
+ ```
28
+
29
+ Any record written to this field will store a reference to the blob’s contents.
30
+
31
+ ---
32
+
33
+ ## Creating and Storing Blobs
34
+
35
+ ### Creating a Blob from a Buffer
36
+
37
+ ```js
38
+ const blob = createBlob(largeBuffer);
39
+ await MyTable.put({ id: 'my-record', data: blob });
40
+ ```
41
+
42
+ - `createBlob()` returns a blob reference
43
+ - Data is streamed to storage asynchronously
44
+ - Records may be committed before the blob finishes writing
45
+
46
+ ---
47
+
48
+ ### Creating a Blob from a Stream
49
+
50
+ ```js
51
+ const blob = createBlob(stream);
52
+ await MyTable.put({ id: 'streamed-record', data: blob });
53
+ ```
54
+
55
+ Streaming allows large data to be written without loading it fully into memory.
56
+
57
+ ---
58
+
59
+ ## Reading Blob Data
60
+
61
+ Retrieve a record and read its blob contents:
62
+
63
+ ```js
64
+ const record = await MyTable.get('my-record');
65
+ const buffer = await record.data.bytes();
66
+ ```
67
+
68
+ Blob objects also support streaming interfaces for large reads.
69
+
70
+ ---
71
+
72
+ ## Blob Attributes and Events
73
+
74
+ ### Size
75
+
76
+ The blob size may not be immediately available when streaming:
77
+
78
+ ```js
79
+ if (blob.size === undefined) {
80
+ blob.on('size', size => {
81
+ console.log('Blob size:', size);
82
+ });
83
+ }
84
+ ```
85
+
86
+ ---
87
+
88
+ ### saveBeforeCommit
89
+
90
+ Blobs are not atomic while streaming. To ensure the blob is fully written before committing the record:
91
+
92
+ ```js
93
+ const blob = createBlob(stream, { saveBeforeCommit: true });
94
+ await MyTable.put({ id: 'safe-record', data: blob });
95
+ ```
96
+
97
+ ---
98
+
99
+ ## Error Handling
100
+
101
+ Handle streaming errors by attaching an error listener:
102
+
103
+ ```js
104
+ blob.on('error', () => {
105
+ MyTable.invalidate('my-record');
106
+ });
107
+ ```
108
+
109
+ This prevents partially written blobs from being used.
110
+
111
+ ---
112
+
113
+ ## Automatic Coercion
114
+
115
+ When a field is defined as `Blob`, assigning a string or buffer automatically converts it into a blob when using `put`, `patch`, or `publish`.
116
+
117
+ ---
118
+
119
+ ## Related Skill
120
+
121
+ - [Handling Binary Data with Blobs](handling-binary-data.md) How to store and serve binary data like images or MP3s using the Blob data type.
122
+
123
+ ---
124
+
125
+ ## Summary
126
+
127
+ - Blobs store large or binary data efficiently
128
+ - Blob fields reference streamed content
129
+ - Supports buffered and streamed writes
130
+ - Optional write-before-commit behavior
131
+ - Integrates seamlessly with Harper tables
@@ -0,0 +1,215 @@
1
+ # Vector Indexing
2
+
3
+ Harper supports **vector indexing** on array attributes, enabling efficient similarity search over high-dimensional vector data. This is essential for AI-powered features such as semantic search, recommendations, and embeddings-based retrieval.
4
+
5
+ ---
6
+
7
+ ## What Is Vector Indexing
8
+
9
+ Vector indexing organizes numeric vectors so that Harper can efficiently find records that are closest to a given query vector using a distance metric such as cosine similarity or Euclidean distance.
10
+
11
+ Unlike traditional indexes that rely on exact matches, vector indexes enable **nearest-neighbor search** across high-dimensional spaces, making them ideal for embeddings and machine learning workloads.
12
+
13
+ ---
14
+
15
+ ## Enabling a Vector Index
16
+
17
+ Vector indexes are defined using the `@indexed` directive on numeric array attributes.
18
+
19
+ ```graphql
20
+ type Product @table {
21
+ id: Long @primaryKey
22
+ name: String
23
+ description: String
24
+ textEmbeddings: [Float] @indexed(type: "HNSW")
25
+ price: Float
26
+ }
27
+ ```
28
+
29
+ - `type: "HNSW"` enables Harper’s vector index using the HNSW algorithm
30
+ - The indexed field must be an array of numeric values
31
+ - Vector indexes are stored and maintained automatically
32
+
33
+ ---
34
+
35
+ ## Querying with a Vector Index
36
+
37
+ ### Search Vectors with sort
38
+
39
+ Once defined, vector indexes can be used by specifying a `sort` configuration with a target vector. To view the similarity of a result to a given query vector, use the `$distance` attribute in the `select` clause.
40
+
41
+ ```js
42
+ const results = Product.search({
43
+ select: ['name', 'description', 'price', '$distance'],
44
+ sort: {
45
+ attribute: 'textEmbeddings',
46
+ target: searchVector,
47
+ },
48
+ limit: 5,
49
+ });
50
+ ```
51
+
52
+ - `attribute` is the vector index attribute
53
+ - `target` is the vector to compare against
54
+ - `searchVector` is the embedding to compare against
55
+ - Results are ordered by similarity
56
+ - Vector search can be combined with filters and limits
57
+ - The `$distance` attribute in the `select` (optional) returns the distance between the result and the query vector
58
+
59
+ ### Search Vectors limited by distance
60
+
61
+ Vector indexes results can be limited by distance using the `conditions` clause. In the following example, results are returned that are less than 0.1 similar to the query vector.
62
+ The `conditions` clause can be combined with `sort` and `limit` and the `comparator` can be any of the following: `lt`, `lte`, `gt`, `gte`, `between`.
63
+
64
+ ```js
65
+ const results = Product.search({
66
+ select: ['name', 'description', 'price', '$distance'],
67
+ conditions: {
68
+ attribute: 'textEmbeddings',
69
+ comparator: 'lt',
70
+ value: 0.1, // '0.1' is the similarity threshold
71
+ target: searchVector,
72
+ },
73
+ });
74
+ ```
75
+
76
+ - `attribute` is the vector index attribute
77
+ - `comparator` is the comparison operator (`lt`, `lte`, `gt`, `gte`, `between` are accepted)
78
+ - `value` is the threshold value
79
+ - `target` is the vector to compare against
80
+ - `searchVector` is the embedding to compare against
81
+ - Vector search can be combined with filters, sort, and limits
82
+ - The `$distance` attribute in the `select` (optional) returns the distance between the result and the query vector
83
+
84
+ ---
85
+
86
+ ## Vector Index Options
87
+
88
+ Additional tuning options can be provided on the `@indexed` directive:
89
+
90
+ | Option | Description |
91
+ | ---------------------- | ------------------------------------------- |
92
+ | `distance` | Similarity metric (`cosine` or `euclidean`) |
93
+ | `efConstruction` | Index build quality vs performance |
94
+ | `M` | Graph connectivity per HNSW layer |
95
+ | `optimizeRouting` | Improves routing efficiency |
96
+ | `efSearchConstruction` | Search breadth during queries |
97
+
98
+ These options allow fine-tuning for performance and recall tradeoffs.
99
+
100
+ ---
101
+
102
+ ## How to Generate and Search Vector Embeddings
103
+
104
+ Here is a full example that generates embeddings for a set of products and then searches for similar products using vector indexes. The following example shows how to generate embeddings using OpenAI or Ollama.
105
+
106
+ ```js
107
+ import { Ollama } from 'ollama';
108
+ const ollama = new Ollama({ host: 'http://127.0.0.1:11434' });
109
+ // The name of the ollama embedding model
110
+ const OLLAMA_EMBEDDING_MODEL = 'nomic-embed-text';
111
+
112
+ const { Product } = tables;
113
+
114
+ import OpenAI from 'openai';
115
+ const openai = new OpenAI();
116
+ // the name of the OpenAI embedding model
117
+ const OPENAI_EMBEDDING_MODEL = 'text-embedding-3-small';
118
+
119
+ const SIMILARITY_THRESHOLD = 0.5;
120
+
121
+ export class ProductSearch extends Resource {
122
+ // based on env variable we choose the appropriate embedding generator
123
+ generateEmbedding = process.env.EMBEDDING_GENERATOR === 'ollama'
124
+ ? this._generateOllamaEmbedding
125
+ : this._generateOpenAIEmbedding;
126
+
127
+ /**
128
+ * Executes a search query using a generated text embedding and returns the matching products.
129
+ *
130
+ * @param {Object} data - The input data for the request.
131
+ * @param {string} data.prompt - The prompt to generate the text embedding from.
132
+ * @return {Promise<Array>} Returns a promise that resolves to an array of products matching the conditions,
133
+ * including fields: name, description, price, and $distance.
134
+ */
135
+ async post(data) {
136
+ const embedding = await this.generateEmbedding(data.prompt);
137
+
138
+ return await Product.search({
139
+ select: ['name', 'description', 'price', '$distance'],
140
+ conditions: {
141
+ attribute: 'textEmbeddings',
142
+ comparator: 'lt',
143
+ value: SIMILARITY_THRESHOLD,
144
+ target: embedding[0],
145
+ },
146
+ limit: 5,
147
+ });
148
+ }
149
+
150
+ /**
151
+ * Generates an embedding using the Ollama API.
152
+ *
153
+ * @param {string} promptData - The input data for which the embedding is to be generated.
154
+ * @return {Promise<number[][]>} A promise that resolves to the generated embedding as an array of numbers.
155
+ */
156
+ async _generateOllamaEmbedding(promptData) {
157
+ const embedding = await ollama.embed({
158
+ model: OLLAMA_EMBEDDING_MODEL,
159
+ input: promptData,
160
+ });
161
+ return embedding?.embeddings;
162
+ }
163
+
164
+ /**
165
+ * Generates OpenAI embeddings based on the given prompt data.
166
+ *
167
+ * @param {string} promptData - The input data used for generating the embedding.
168
+ * @return {Promise<number[][]>} A promise that resolves to an array of embeddings, where each embedding is an array of floats.
169
+ */
170
+ async _generateOpenAIEmbedding(promptData) {
171
+ const embedding = await openai.embeddings.create({
172
+ model: OPENAI_EMBEDDING_MODEL,
173
+ input: promptData,
174
+ encoding_format: 'float',
175
+ });
176
+
177
+ let embeddings = [];
178
+ embedding.data.forEach((embeddingData) => {
179
+ embeddings.push(embeddingData.embedding);
180
+ });
181
+
182
+ return embeddings;
183
+ }
184
+ }
185
+ ```
186
+
187
+ Sample request to the `ProductSearch` resource which prompts to find "shorts for the gym":
188
+
189
+ ```bash
190
+ curl -X POST "http://localhost:9926/ProductSearch/" \
191
+ -H "accept: \
192
+ -H "Content-Type: application/json" \
193
+ -H "Authorization: Basic <YOUR_AUTH>" \
194
+ -d '{"prompt": "shorts for the gym"}'
195
+ ```
196
+
197
+ ---
198
+
199
+ ## When to Use Vector Indexing
200
+
201
+ Vector indexing is ideal when:
202
+
203
+ - Storing embedding vectors from ML models
204
+ - Performing semantic or similarity-based search
205
+ - Working with high-dimensional numeric data
206
+ - Exact-match indexes are insufficient
207
+
208
+ ---
209
+
210
+ ## Summary
211
+
212
+ - Vector indexing enables fast similarity search on numeric arrays
213
+ - Defined using `@indexed(type: "HNSW")`
214
+ - Queried using a target vector in search sorting
215
+ - Tunable for performance and accuracy
@@ -16,4 +16,6 @@ This repository contains "skills" that guide AI agents in developing Harper appl
16
16
  - [Handling Binary Data](skills/handling-binary-data.md): How to store and serve binary data like images or MP3s.
17
17
  - [Serving Web Content](skills/serving-web-content): Two ways to serve web content from a Harper application.
18
18
  - [Checking Authentication](skills/checking-authentication.md): How to use sessions to verify user identity and roles.
19
- [adding-tables-with-schemas.md](skills/adding-tables-with-schemas.md)
19
+ - [Caching](skills/caching.md): How caching is defined and implemented in Harper applications.
20
+ - [Using Blobs](skills/using-blob-datatype.md): How to store and retrieve large data in HarperDB.
21
+ - [Vector Indexing](skills/vector-indexing.md): How to define and use vector indexes for efficient similarity search.
@@ -47,3 +47,7 @@ This is the easiest way to add real-time capabilities to your application. For m
47
47
  The `GET /{TableName}/` and `DELETE /{TableName}/` endpoints can be filtered using query parameters. While basic equality filters are straightforward, Harper supports a rich set of operators, sorting, and pagination.
48
48
 
49
49
  For a comprehensive guide on advanced querying, see the [Querying REST APIs](querying-rest-apis.md) skill.
50
+
51
+ ## Customizing Resources
52
+
53
+ If the automatic APIs don't behave how you need, then you can look to [customize the resources](./custom-resources.md).