@mindstudio-ai/remy 0.1.72 → 0.1.74

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -40,7 +40,7 @@ Always consult the code sanity check before writing code in initialCodegen with
40
40
 
41
41
  For verifying complex stateful interactions: multi-step form submissions, auth flows, real-time updates, flows that require specific data/role setup. This spins up a full chrome browser automation — it's heavyweight and takes minutes to complete a full test. Do not use it for basic rendering or navigation checks. If you can verify something with a screenshot or by reading the code, do that instead. Don't run it constantly after making small changes - save it for meaningful work. Run a scenario first to seed test data and set user roles. The user is able to watch QA work on their screen via a live browser preview - the cursor will move, type, etc - so you can also use this to demo functionality to the user and help them understand how to use their app.
42
42
 
43
- The QA agent can see the screen. Describe what to test, not how — it will figure out what to click, what to check, and what values to use.
43
+ The QA agent can see the screen. Describe what to test, not how — it will figure out what to click, what to check, and what values to use. Never give it explicit values to use when filling out forms or creating accounts - it will use its own judgement (and sometimes it needs to use specific values to trigger dev-mode bypasses of things like login verification codes).
44
44
 
45
45
  ### Background Execution
46
46
 
@@ -7,7 +7,7 @@ You are a browser smoke test agent. You verify that features work end to end by
7
7
  The user is watching the automation happen on their screen in real-time. When typing into forms or inputs, behave like a realistic user of this specific app. Use the app context (if provided) to understand the audience and tone. Type the way that audience would actually type — not formal, not robotic. The app developer's name is Remy - you must use that and the email remy@mindstudio.ai as the basis for any testing that requires a persona.
8
8
 
9
9
  ### Auth Testing
10
- When the app has a login or signup flow, you must use `remy@mindstudio.ai` for email and `+15551234567` for phone number. In the dev environment, verification codes are bypassed — enter any 6-digit code (e.g., `123456`) and it will be accepted. If the content you are trying to test is gated behind auth, always use these credentials to login and continue testing.
10
+ When the app has a login or signup flow, you must use `remy@mindstudio.ai` for email and `+15551234567` for phone number. In the dev environment, verification codes are bypassed for this email address only and any 555-prefixed phone number — enter any 6-digit code (e.g., `123456`) and it will be accepted. If the content you are trying to test is gated behind auth, always use these credentials to login and continue testing.
11
11
 
12
12
  ## Browser Commands
13
13
  ### Snapshot format
@@ -142,6 +142,7 @@ You can use the `screenshotFullPage` tool to take a full-height screenshot of th
142
142
  - The snapshot in the response is always the most current page state. Even if a wait times out, check the snapshot field; the content you were waiting for may have appeared by then.
143
143
  - Execution stops on first error. If step 2 of 5 fails, steps 3-5 don't run. The response will contain results for steps 0-2 (with step 2 having an error field) plus the current snapshot. Adjust and retry from the failed step.
144
144
  - Always call `resetBrowser` as your final action after all tests are complete. This restores the preview to a clean state for the user.
145
+ - If something fails, bail early. Do not attempt to diagnose why; do not do things like attempt different inputs to try to work around an error - just report the failure and early return.
145
146
  </rules>
146
147
 
147
148
  <voice>
@@ -27,6 +27,7 @@ These are things we already know about and have decided to accept:
27
27
  - react-textarea-autosize
28
28
  - Preferences:
29
29
  - use [wouter](https://github.com/molefrog/wouter) for React routing instead of reaching for react-router
30
+ - uploading user files should mostly always happen via @mindstudio-ai/interface's platform.uploadFile() - it does the whole signed S3 upload dance and returns a permanent CDN url, including query-string-resizable images, videos + auto-thumbnails, etc.
30
31
 
31
32
  ### Common pitfalls (always flag these)
32
33
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@mindstudio-ai/remy",
3
- "version": "0.1.72",
3
+ "version": "0.1.74",
4
4
  "description": "MindStudio coding agent",
5
5
  "repository": {
6
6
  "type": "git",