page-agent 1.2.0 β†’ 1.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -1,4 +1,4 @@
1
- # PageAgent πŸ€–πŸͺ„
1
+ # Page Agent
2
2
 
3
3
  <picture>
4
4
  <source media="(prefers-color-scheme: dark)" srcset="https://img.alicdn.com/imgextra/i4/O1CN01qKig1P1FnhpFKNdi6_!!6000000000532-2-tps-1280-256.png">
@@ -9,28 +9,33 @@
9
9
 
10
10
  The GUI Agent Living in Your Webpage. Control web interfaces with natural language.
11
11
 
12
- 🌐 **English** | [δΈ­ζ–‡](./README-zh.md)
12
+ 🌐 **English** | [δΈ­ζ–‡](./docs/README-zh.md)
13
13
 
14
- πŸ‘‰ <a href="https://alibaba.github.io/page-agent/" target="_blank"><b>πŸš€ Demo</b></a> | <a href="https://alibaba.github.io/page-agent/#/docs/introduction/overview" target="_blank"><b>πŸ“– Documentation</b></a>
14
+ πŸ‘‰ <a href="https://alibaba.github.io/page-agent/" target="_blank"><b>πŸš€ Demo</b></a> | <a href="https://alibaba.github.io/page-agent/docs/introduction/overview" target="_blank"><b>πŸ“– Documentation</b></a>
15
15
 
16
- <video id="demo-video" src="https://github.com/user-attachments/assets/34d8444d-cbfb-44a3-a24e-fd5c167bb0bf" controls crossorigin muted></video>
16
+ <video id="demo-video" src="https://github.com/user-attachments/assets/11aed429-b69c-49d5-8982-fa99b4a0e9a8" controls crossorigin muted></video>
17
17
 
18
18
  ---
19
19
 
20
20
  ## ✨ Features
21
21
 
22
- - **🎯 Easy Integration**
23
- - No python. No headless browser. No browser extension. Just in-page scripts.
24
- - **πŸ” Client-Side Processing**
25
- - **🧠 DOM Extraction**
26
- - **πŸ’¬ Natural Language Interface**
27
- - **🎨 UI with Human in the loop**
22
+ - **🎯 Easy integration**
23
+ - No need for `browser extension` / `python` / `headless browser`.
24
+ - Just in-page javascript. Everything happens in your web page.
25
+ - The best tool for your agent to control web pages.
26
+ - **πŸ“– Text-based DOM manipulation**
27
+ - No screenshots. No OCR or multi-modal LLMs needed.
28
+ - No special permissions required.
29
+ - **🧠 Bring your own LLMs**
30
+ - **🎨 Pretty UI with human-in-the-loop**
31
+ - **πŸ™ Optional [chrome extension](https://alibaba.github.io/page-agent/docs/features/chrome-extension) for multi-page tasks.**
28
32
 
29
- And πŸ˜‰
33
+ ## πŸ’‘ Use Cases
30
34
 
31
- - **πŸ§ͺ `cross-page` control with an experimental chrome extension** - `packages/extension`
32
-
33
- πŸ‘‰ [**πŸ—ΊοΈ Roadmap**](https://github.com/alibaba/page-agent/issues/96)
35
+ - **SaaS AI Copilot** β€” Ship an AI copilot in your product in lines of code. No backend rewrite needed.
36
+ - **Smart Form Filling** β€” Turn 20-click workflows into one sentence. Perfect for ERP, CRM, and admin systems.
37
+ - **Accessibility** β€” Make any web app accessible through natural language. Voice commands, screen readers, zero barrier.
38
+ - **Multi-page Agent** β€” Extend your agent's reach across browser tabs with the optional [chrome extension](https://alibaba.github.io/page-agent/docs/features/chrome-extension).
34
39
 
35
40
  ## πŸš€ Quick Start
36
41
 
@@ -39,19 +44,15 @@ And πŸ˜‰
39
44
  Fastest way to try PageAgent with our free Demo LLM:
40
45
 
41
46
  ```html
42
- <script
43
- src="https://cdn.jsdelivr.net/npm/page-agent@1.2.0/dist/iife/page-agent.demo.js"
44
- crossorigin="true"
45
- ></script>
47
+ <script src="{URL}" crossorigin="true"></script>
46
48
  ```
47
49
 
48
- > - **⚠️ For technical evaluation only.** Demo LLM has rate limits and usage restrictions. May change without notice.
49
- > - **🌷 Bring your own LLM API.**
50
-
51
50
  | Mirrors | URL |
52
51
  | ------- | ---------------------------------------------------------------------------------- |
53
- | Global | https://cdn.jsdelivr.net/npm/page-agent@1.2.0/dist/iife/page-agent.demo.js |
54
- | China | https://registry.npmmirror.com/page-agent/1.2.0/files/dist/iife/page-agent.demo.js |
52
+ | Global | https://cdn.jsdelivr.net/npm/page-agent@1.4.0/dist/iife/page-agent.demo.js |
53
+ | China | https://registry.npmmirror.com/page-agent/1.4.0/files/dist/iife/page-agent.demo.js |
54
+
55
+ > **⚠️ For technical evaluation only.** This demo CDN uses our free [testing LLM API](https://alibaba.github.io/page-agent/docs/features/models#free-testing-api). By using it, you agree to its [terms](https://github.com/alibaba/page-agent/blob/main/docs/terms-and-privacy.md).
55
56
 
56
57
  ### NPM Installation
57
58
 
@@ -63,8 +64,8 @@ npm install page-agent
63
64
  import { PageAgent } from 'page-agent'
64
65
 
65
66
  const agent = new PageAgent({
66
- model: 'deepseek-chat',
67
- baseURL: 'https://api.deepseek.com',
67
+ model: 'qwen3.5-plus',
68
+ baseURL: 'https://dashscope.aliyuncs.com/compatible-mode/v1',
68
69
  apiKey: 'YOUR_API_KEY',
69
70
  language: 'en-US',
70
71
  })
@@ -72,25 +73,13 @@ const agent = new PageAgent({
72
73
  await agent.execute('Click the login button')
73
74
  ```
74
75
 
75
- ## πŸ—οΈ Structure
76
-
77
- PageAgent adopts a simplified monorepo structure:
78
-
79
- ```
80
- packages/
81
- β”œβ”€β”€ core/ # ** Core agent logic without UI(npm: @page-agent/core) **
82
- β”œβ”€β”€ page-agent/ # Exported agent and demo(npm: page-agent)
83
- β”œβ”€β”€ llms/ # LLM client (npm: @page-agent/llms)
84
- β”œβ”€β”€ page-controller/ # DOM operations & Visual Mask (npm: @page-agent/page-controller)
85
- β”œβ”€β”€ ui/ # Panel & i18n (npm: @page-agent/ui)
86
- └── website/ # Demo & Documentation site
87
- ```
76
+ For more programmatic usage, see [πŸ“– Documentations](https://alibaba.github.io/page-agent/docs/introduction/overview).
88
77
 
89
78
  ## 🀝 Contributing
90
79
 
91
80
  We welcome contributions from the community! Follow our instructions in [CONTRIBUTING.md](CONTRIBUTING.md) for environment setup and local development.
92
81
 
93
- Please read [Code of Conduct](CODE_OF_CONDUCT.md) before contributing.
82
+ Please read [Code of Conduct](docs/CODE_OF_CONDUCT.md) before contributing.
94
83
 
95
84
  ## πŸ‘ Acknowledgments
96
85