@promptbook/markitdown 0.88.0-1 → 0.88.0-9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -58,8 +58,6 @@ Rest of the documentation is common for **entire promptbook ecosystem**:
58
58
 
59
59
  During the computer revolution, we have seen [multiple generations of computer languages](https://github.com/webgptorg/promptbook/discussions/180), from the physical rewiring of the vacuum tubes through low-level machine code to the high-level languages like Python or JavaScript. And now, we're on the edge of the **next revolution**!
60
60
 
61
-
62
-
63
61
  It's a revolution of writing software in **plain human language** that is understandable and executable by both humans and machines – and it's going to change everything!
64
62
 
65
63
  The incredible growth in power of microprocessors and the Moore's Law have been the driving force behind the ever-more powerful languages, and it's been an amazing journey! Similarly, the large language models (like GPT or Claude) are the next big thing in language technology, and they're set to transform the way we interact with computers.
@@ -122,21 +120,16 @@ We also have a community of developers and users of **Promptbook**:
122
120
  - [Landing page `ptbk.io`](https://ptbk.io)
123
121
  - [Github discussions](https://github.com/webgptorg/promptbook/discussions)
124
122
  - [LinkedIn `Promptbook`](https://linkedin.com/company/promptbook)
125
- - [Facebook `Promptbook`](https://www.facebook.com/61560776453536)
123
+ - [Facebook `Promptbook`](https://www.facebook.com/61560776453536)
126
124
 
127
125
  And **Promptbook.studio** branded socials:
128
126
 
129
-
130
-
131
127
  - [Instagram `@promptbook.studio`](https://www.instagram.com/promptbook.studio/)
132
128
 
133
129
  And **Promptujeme** sub-brand:
134
130
 
135
131
  _/Subbrand for Czech clients/_
136
132
 
137
-
138
-
139
-
140
133
  - [Promptujeme.cz](https://www.promptujeme.cz/)
141
134
  - [Facebook `Promptujeme`](https://www.facebook.com/promptujeme/)
142
135
 
@@ -154,8 +147,6 @@ _/Sub-brand for images and graphics generated via Promptbook prompting/_
154
147
 
155
148
  ## 💙 The Book language
156
149
 
157
-
158
-
159
150
  Following is the documentation and blueprint of the [Book language](https://github.com/webgptorg/book).
160
151
 
161
152
  Book is a language that can be used to write AI applications, agents, workflows, automations, knowledgebases, translators, sheet processors, email automations and more. It allows you to harness the power of AI models in human-like terms, without the need to know the specifics and technicalities of the models.
@@ -205,8 +196,6 @@ Personas can have access to different knowledge, tools and actions. They can als
205
196
 
206
197
  - [PERSONA](https://github.com/webgptorg/promptbook/blob/main/documents/commands/PERSONA.md)
207
198
 
208
-
209
-
210
199
  ### **How:** Knowledge, Instruments and Actions
211
200
 
212
201
  The resources used by the personas are used to do the work.
@@ -282,11 +271,6 @@ Or you can install them separately:
282
271
 
283
272
  ## 📚 Dictionary
284
273
 
285
-
286
-
287
-
288
-
289
-
290
274
  ### 📚 Dictionary
291
275
 
292
276
  The following glossary is used to clarify certain concepts:
@@ -302,12 +286,8 @@ The following glossary is used to clarify certain concepts:
302
286
  - **Retrieval-augmented generation** is a machine learning paradigm where a model generates text by retrieving relevant information from a large database of text. This approach combines the benefits of generative models and retrieval models.
303
287
  - **Longtail** refers to non-common or rare events, items, or entities that are not well-represented in the training data of machine learning models. Longtail items are often challenging for models to predict accurately.
304
288
 
305
-
306
-
307
289
  _Note: Thos section is not complete dictionary, more list of general AI / LLM terms that has connection with Promptbook_
308
290
 
309
-
310
-
311
291
  #### 💯 Core concepts
312
292
 
313
293
  - [📚 Collection of pipelines](https://github.com/webgptorg/promptbook/discussions/65)
@@ -336,8 +316,6 @@ _Note: Thos section is not complete dictionary, more list of general AI / LLM te
336
316
  - [👮 Agent adversary expectations](https://github.com/webgptorg/promptbook/discussions/39)
337
317
  - [view more](https://github.com/webgptorg/promptbook/discussions/categories/concepts)
338
318
 
339
-
340
-
341
319
  ### Terms specific to Promptbook TypeScript implementation
342
320
 
343
321
  - Anonymous mode
@@ -345,10 +323,9 @@ _Note: Thos section is not complete dictionary, more list of general AI / LLM te
345
323
 
346
324
 
347
325
 
348
- ## 🔌 Usage in Typescript / Javascript
326
+ ## 🚂 Promptbook Engine
349
327
 
350
- - [Simple usage](./examples/usage/simple-script)
351
- - [Usage with client and remote server](./examples/usage/remote)
328
+ ![Schema of Promptbook Engine](./documents/promptbook-engine.svg)
352
329
 
353
330
  ## ➕➖ When to use Promptbook?
354
331
 
package/esm/index.es.js CHANGED
@@ -5,7 +5,7 @@ import hexEncoder from 'crypto-js/enc-hex';
5
5
  import { basename, join, dirname } from 'path';
6
6
  import { format } from 'prettier';
7
7
  import parserHtml from 'prettier/parser-html';
8
- import { BehaviorSubject } from 'rxjs';
8
+ import { Subject } from 'rxjs';
9
9
  import { randomBytes } from 'crypto';
10
10
  import { forTime } from 'waitasecond';
11
11
  import sha256 from 'crypto-js/sha256';
@@ -26,7 +26,7 @@ const BOOK_LANGUAGE_VERSION = '1.0.0';
26
26
  * @generated
27
27
  * @see https://github.com/webgptorg/promptbook
28
28
  */
29
- const PROMPTBOOK_ENGINE_VERSION = '0.88.0-1';
29
+ const PROMPTBOOK_ENGINE_VERSION = '0.88.0-9';
30
30
  /**
31
31
  * TODO: string_promptbook_version should be constrained to the all versions of Promptbook engine
32
32
  * Note: [💞] Ignore a discrepancy between file name and entity name
@@ -2068,6 +2068,36 @@ function $randomToken(randomness) {
2068
2068
  * TODO: Maybe use nanoid instead https://github.com/ai/nanoid
2069
2069
  */
2070
2070
 
2071
+ /**
2072
+ * Recursively converts JSON strings to JSON objects
2073
+
2074
+ * @public exported from `@promptbook/utils`
2075
+ */
2076
+ function jsonStringsToJsons(object) {
2077
+ if (object === null) {
2078
+ return object;
2079
+ }
2080
+ if (Array.isArray(object)) {
2081
+ return object.map(jsonStringsToJsons);
2082
+ }
2083
+ if (typeof object !== 'object') {
2084
+ return object;
2085
+ }
2086
+ const newObject = { ...object };
2087
+ for (const [key, value] of Object.entries(object)) {
2088
+ if (typeof value === 'string' && isValidJsonString(value)) {
2089
+ newObject[key] = JSON.parse(value);
2090
+ }
2091
+ else {
2092
+ newObject[key] = jsonStringsToJsons(value);
2093
+ }
2094
+ }
2095
+ return newObject;
2096
+ }
2097
+ /**
2098
+ * TODO: Type the return type correctly
2099
+ */
2100
+
2071
2101
  /**
2072
2102
  * This error indicates problems parsing the format value
2073
2103
  *
@@ -2294,21 +2324,43 @@ function assertsTaskSuccessful(executionResult) {
2294
2324
  function createTask(options) {
2295
2325
  const { taskType, taskProcessCallback } = options;
2296
2326
  const taskId = `${taskType.toLowerCase().substring(0, 4)}-${$randomToken(8 /* <- TODO: To global config + Use Base58 to avoid simmilar char conflicts */)}`;
2297
- const partialResultSubject = new BehaviorSubject({});
2327
+ let status = 'RUNNING';
2328
+ const createdAt = new Date();
2329
+ let updatedAt = createdAt;
2330
+ const errors = [];
2331
+ const warnings = [];
2332
+ let currentValue = {};
2333
+ const partialResultSubject = new Subject();
2334
+ // <- Note: Not using `BehaviorSubject` because on error we can't access the last value
2298
2335
  const finalResultPromise = /* not await */ taskProcessCallback((newOngoingResult) => {
2336
+ Object.assign(currentValue, newOngoingResult);
2337
+ // <- TODO: assign deep
2299
2338
  partialResultSubject.next(newOngoingResult);
2300
2339
  });
2301
2340
  finalResultPromise
2302
2341
  .catch((error) => {
2342
+ errors.push(error);
2303
2343
  partialResultSubject.error(error);
2304
2344
  })
2305
- .then((value) => {
2306
- if (value) {
2345
+ .then((executionResult) => {
2346
+ if (executionResult) {
2307
2347
  try {
2308
- assertsTaskSuccessful(value);
2309
- partialResultSubject.next(value);
2348
+ updatedAt = new Date();
2349
+ errors.push(...executionResult.errors);
2350
+ warnings.push(...executionResult.warnings);
2351
+ // <- TODO: !!! Only unique errors and warnings should be added (or filtered)
2352
+ // TODO: [🧠] !!! errors, warning, isSuccessful are redundant both in `ExecutionTask` and `ExecutionTask.currentValue`
2353
+ // Also maybe move `ExecutionTask.currentValue.usage` -> `ExecutionTask.usage`
2354
+ // And delete `ExecutionTask.currentValue.preparedPipeline`
2355
+ assertsTaskSuccessful(executionResult);
2356
+ status = 'FINISHED';
2357
+ currentValue = jsonStringsToJsons(executionResult);
2358
+ // <- TODO: Convert JSON values in string to JSON objects
2359
+ partialResultSubject.next(executionResult);
2310
2360
  }
2311
2361
  catch (error) {
2362
+ status = 'ERROR';
2363
+ errors.push(error);
2312
2364
  partialResultSubject.error(error);
2313
2365
  }
2314
2366
  }
@@ -2325,12 +2377,33 @@ function createTask(options) {
2325
2377
  return {
2326
2378
  taskType,
2327
2379
  taskId,
2380
+ get status() {
2381
+ return status;
2382
+ // <- Note: [1] Theese must be getters to allow changing the value in the future
2383
+ },
2384
+ get createdAt() {
2385
+ return createdAt;
2386
+ // <- Note: [1]
2387
+ },
2388
+ get updatedAt() {
2389
+ return updatedAt;
2390
+ // <- Note: [1]
2391
+ },
2328
2392
  asPromise,
2329
2393
  asObservable() {
2330
2394
  return partialResultSubject.asObservable();
2331
2395
  },
2396
+ get errors() {
2397
+ return errors;
2398
+ // <- Note: [1]
2399
+ },
2400
+ get warnings() {
2401
+ return warnings;
2402
+ // <- Note: [1]
2403
+ },
2332
2404
  get currentValue() {
2333
- return partialResultSubject.value;
2405
+ return currentValue;
2406
+ // <- Note: [1]
2334
2407
  },
2335
2408
  };
2336
2409
  }
@@ -4705,7 +4778,7 @@ async function executeAttempts(options) {
4705
4778
  Last result:
4706
4779
  ${block($ongoingTaskResult.$resultString === null
4707
4780
  ? 'null'
4708
- : $ongoingTaskResult.$resultString
4781
+ : spaceTrim$1($ongoingTaskResult.$resultString)
4709
4782
  .split('\n')
4710
4783
  .map((line) => `> ${line}`)
4711
4784
  .join('\n'))}