npm - @jax-js/jax - Versions diffs - 0.0.3 → 0.0.4 - Mend

@jax-js/jax 0.0.3 → 0.0.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/README.md +50 -19
package/dist/{backend-BqDtPGaR.js → backend-EBRGmEYw.js} +296 -153
package/dist/{backend-D2C4MJRP.cjs → backend-Ss1Mev_-.cjs} +315 -154
package/dist/index.cjs +681 -157
package/dist/index.d.cts +422 -76
package/dist/index.d.ts +422 -76
package/dist/index.js +677 -157
package/dist/{webgpu-fqhx41TC.cjs → webgpu-BVdMaO9T.cjs} +9 -3
package/dist/{webgpu-CNg9JGva.js → webgpu-ow0Pn_6q.js} +9 -3
package/package.json +15 -4

package/README.md CHANGED Viewed

@@ -37,42 +37,58 @@ const xnorm = norm(x.ref); // 1^2 + 2^2 + 3^2 = 14
 const xgrad = grad(norm)(x); // [2, 4, 6]
 ```
-The default backend runs on CPU, but on [supported browsers](https://caniuse.com/webgpu),
-you can switch to GPU for maximum performance.
+The default backend runs on CPU, but on [supported browsers](https://caniuse.com/webgpu), you can
+switch to GPU for better performance.
 ```js
-import { numpy as np, setDevice } from "@jax-js/jax";
+import { defaultDevice, numpy as np } from "@jax-js/jax";
 // Change the default backend to GPU.
-setDevice("webgpu");
+defaultDevice("webgpu");
 const x = np.ones([4096, 4096]);
 const y = np.dot(x.ref, x); // JIT-compiled into a matrix multiplication kernel
 ```
+Most common JAX APIs are supported. See the [compatibility table](./FEATURES.md) for a full
+breakdown of what features are available.
 ## Development
-Under construction.
+This repository is managed by [`pnpm`](https://pnpm.io/). You can compile and build all packages in
+watch mode with:
 ```bash
 pnpm install
 pnpm run build:watch
+```
+Then you can run tests in a headless browser using [Vitest](https://vitest.dev/).
-# Run tests
+```bash
 pnpm exec playwright install
 pnpm test
 ```
+_We are currently on an older version of Playwright that supports using WebGPU in headless mode;
+newer versions seem to skip the WebGPU tests._
+To start a Vite dev server running the website, demos and REPL:
+```bash
+pnpm -C website dev
+```
 ## Next on Eric's mind
 - Finish CLIP inference demo and associated features (depthwise convolution, vmap of gather, etc.)
-- Fix jit-of-grad returning very incorrect result
-- Improve perf of MNIST neural network
-  - Optimize conv2d further (maybe blocks -> local dims?)
-  - Add fused epilogue to JIT
-  - Reduce kernel overhead of constants / inline expressions
-- Investigate why jax-js Matmul is 2x slower on Safari TP than unroll kernel
-- How many threads to create per workgroup, depends on hardware
+- Performance
+  - Improve perf of MNIST neural network
+    - Optimize conv2d further (maybe blocks -> local dims?)
+    - Add fused epilogue to JIT
+    - Reduce kernel overhead of constants / inline expressions
+  - Investigate why jax-js Matmul is 2x slower on Safari TP than unroll kernel
+  - How many threads to create per workgroup, depends on hardware
 ## Milestones
@@ -91,9 +107,9 @@ pnpm test
 - [x] Other dtypes like int32 and bool
 - [x] `jit()` support via Jaxprs and kernel fusion
 - [x] We figure out the `dispose()` / refcount / linear types stuff
-  - [ ] `dispose()` for saved "const" tracers in Jaxprs
-  - [ ] Garbage collection for JIT programs
-  - [ ] Memory scheduling, buffer allocation (can be tricky)
+  - [x] `dispose()` for saved "const" tracers in Jaxprs
+  - [x] Garbage collection for JIT programs
+  - [x] Debug grad-grad-jit test producing a UseAfterFreeError
 - [ ] Demos: Navier-Stokes, neural networks, statistics
 - [x] Features for neural networks
   - [x] Convolution
@@ -103,6 +119,21 @@ pnpm test
   - [x] Better memory allocation that frees buffers
   - [ ] SIMD support for Wasm backend
   - [ ] Async / multithreading Wasm support
-- [ ] Device switching with `.to()` between webgpu/cpu/wasm
-- [ ] numpy/jax API compatibility table
-- [ ] Import tfjs models
+- [ ] Full support of weak types and committed devices
+  - [ ] High-level ops have automatic type promotion
+  - [ ] Weak types - [ref](https://docs.jax.dev/en/latest/type_promotion.html#weak-types)
+  - [ ] Committed devices -
+        [ref](https://docs.jax.dev/en/latest/sharded-computation.html#sharded-data-placement)
+  - [ ] Device switching with `device_put()` between webgpu/cpu/wasm
+- [x] numpy/jax API compatibility table
+## Future work / help wanted
+Contributions are welcomed in the following areas:
+- Adding support for more JAX functions and operations, see [compatibility table](./FEATURES.md).
+- Improving performance of the WebGPU and Wasm runtimes, generating better kernels, using SIMD and
+  multithreading.
+- Adding WebGL runtime for older browsers that don't support WebGPU.
+- Making a fast transformer inference engine, comparing against onnxruntime-web.
+- Ergonomics and API improvements.