PyPI - pearmut - Versions diffs - 0.0.3__py3-none-any.whl → 0.0.4__py3-none-any.whl - Mend

pearmut 0.0.3py3-none-any.whl → 0.0.4py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

pearmut/static/pointwise.html CHANGED Viewed

@@ -168,4 +168,4 @@
       direction: rtl;
       width: 16px;
       height: 200px;
-    }</style><script defer="defer" src="pointwise.bundle.js"></script></head><body><div style="max-width: 1600px; min-width: 900px; margin-left: auto; margin-right: auto; margin-top: 20px; padding-left: 10px;"><div style="display: flex;"><span id="progress" style="flex: 0 0 130px;">Annotated: 0/0</span> <span id="time" style="flex: 0 0 170px;">Annotation time: 0m</span> <span id="status_message" style="margin-left: 20px; flex-grow: 1; vertical-align: top;"></span> <input type="button" value="Next 🛠️" id="button_next" disabled="disabled" style="flex: 0 0 150px; margin-right: 20px; margin-left: 20px; height: 2.5em;" title="Finish annotating all examples first."></div><div id="output_div" style="margin-top: 100px;"></div><br><br><br></div></body></html>
+    }</style><script defer="defer" src="pointwise.bundle.js"></script></head><body><div style="max-width: 1600px; min-width: 900px; margin-left: auto; margin-right: auto; margin-top: 20px; padding-left: 10px;"><div style="display: flex;"><span id="progress" style="flex: 0 0 140px;">Annotated: 0/0</span> <span id="time" style="flex: 0 0 190px;">Annotation time: 0m</span> <span id="status_message" style="margin-left: 20px; flex-grow: 1; vertical-align: top;"></span> <input type="button" value="Next 🛠️" id="button_next" disabled="disabled" style="flex: 0 0 150px; margin-right: 20px; margin-left: 20px; height: 2.5em;" title="Finish annotating all examples first."></div><div id="output_div" style="margin-top: 100px;"></div><br><br><br></div></body></html>

{pearmut-0.0.3.dist-info → pearmut-0.0.4.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: pearmut
-Version: 0.0.3
+Version: 0.0.4
 Summary: A tool for evaluation of model outputs, primarily MT.
 Author-email: Vilém Zouhar <vilem.zouhar@gmail.com>
 License: MIT
@@ -28,6 +28,19 @@ Supports multimodality (text, video, audio, images) and a variety of annotation
 <img width="1334" alt="Screenshot of ESA/MQM interface" src="https://github.com/user-attachments/assets/dde04b98-c724-4226-b926-011a89e9ce31" />
+## Getting started fast
+```bash
+# install the package
+pip install pearmut
+# download two campaign definitions
+wget https://raw.githubusercontent.com/zouharvi/pearmut/refs/heads/main/examples/wmt25_%23_en-cs_CZ.json
+wget https://raw.githubusercontent.com/zouharvi/pearmut/refs/heads/main/examples/wmt25_%23_cs-de_DE.json
+# load them into pearmut
+pearmut add wmt25_#_en-cs_CZ.json
+pearmut add wmt25_#_cs-de_DE.json
+# start pearmut (will show management links)
+pearmut run
+```
 ## Starting a campaign
@@ -36,68 +49,82 @@ First, install the package
 pip install pearmut
 ```
-A campaign is described in a single JSON file.
-The simplest one, where each user has a pre-defined list of tasks (`task-based`) is:
+A campaign is described in a single JSON file (see [examples/](examples/)!).
+One of the simplest ones, where each user has a pre-defined list of tasks (`task-based`), is:
 ```python
 {
-    "campaign_id": "my campaign 4",
-    "info": {
-        "type": "task-based",
-        "template": "pointwise",
-        "protocol_score": True,                # collect scores
-        "protocol_error_spans": True,          # collect error spans
-        "protocol_error_categories": False,    # do not collect MQM categories, so ESA
-    },
-    "data": [
-        [...],  # tasks for first user
-        [...],  # tasks for second user
-        [...],  # tasks for third user
+  "info": {
+    "type": "task-based",
+    "template": "pointwise",
+    "protocol_score": true,                 # we want scores [0...100] for each segment
+    "protocol_error_spans": true,           # we want error spans
+    "protocol_error_categories": false,     # we do not want error span categories
+    "status_message": "Evaluate translation from en to cs_CZ",  # message to show to users
+    "url": "http://localhost:8001"          # where the server will be accessible
+  },
+  "campaign_id": "wmt25_#_en-cs_CZ",
+  "data": [
+    # data for first task/user
+    [
+      {
+        # each evaluation item is a document
+        "src": [
+          "This will be the year that Guinness loses its cool. Cheers to that!",
+          "I'm not sure I can remember exactly when I sensed it. Maybe it was when some...",
+        ],
+        "tgt": [
+          "Tohle bude rok, kdy Guinness přijde o svůj „cool“ faktor. Na zdraví!",
+          "Nevím přesně, kdy jsem to poprvé zaznamenal. Možná to bylo ve chvíli, ...",
+        ]
+      },
+      ...
+    ],
+    # data for second task/user
+    [
         ...
     ],
+    # arbitrary number of users (each corresponds to a single URL to be shared)
+  ]
 }
 ```
 In general, the task item can be anything and is handled by the specific protocol template.
 For the standard ones (ESA, DA, MQM), we expect each item to be a list (i.e. document unit) that looks as follows:
 ```python
 [
-    {
-        "src": "A najednou se všechna tato voda naplnila dalšími lidmi a dalšími věcmi.",       # mandatory for ESA/MQM/DA
-        "tgt": "And suddenly all the water became full of other people and other people.",      # mandatory for ESA/MQM/DA
+    {   # single document definition
+        "src": ["A najednou se všechna tato voda naplnila dalšími lidmi a dalšími věcmi.", "toto je pokračování stejného dokumentu"],       # mandatory for ESA/MQM/DA
+        "tgt": ["And suddenly all the water became full of other people and other people.", "this is a continuation of the same document"],      # mandatory for ESA/MQM/DA
         ...  # all other keys that will be stored, useful for your analysis
     },
-    {
-        "src": "toto je pokračování stejného dokumentu",
-        "tgt": "this is a continuation of the same document",
-        ...
-    },
-    ...
+    ... # definition of another item
 ]
 ```
-We also support dynamic allocation of annotations (`dynamic`, not yet ⚠️), which is more complex and can be ignored for now:
+We also support a super simple allocation of annotations (`task-single`, not yet ⚠️), where you simply pass a list of all examples to be evaluated and they are processed in parallel by all annotators:
 ```python
 {
     "campaign_id": "my campaign 6",
     "info": {
-        "type": "dynamic",
-        "template": "kway",
-        "protocol_k": 5,
+        "type": "task-single",
+        "template": "pointwise",
+        "protocol_score": True,                # collect scores
+        "protocol_error_spans": True,          # collect error spans
+        "protocol_error_categories": False,    # do not collect MQM categories, so ESA
         "users": 50,
     },
     "data": [...], # list of all items
 }
 ```
-We also support a super simple allocation of annotations (`task-single`, not yet ⚠️), where you simply pass a list of all examples to be evaluated and they are processed in parallel by all annotators:
+We also support dynamic allocation of annotations (`dynamic`, not yet ⚠️), which is more complex and can be ignored for now:
 ```python
 {
     "campaign_id": "my campaign 6",
     "info": {
-        "type": "task-single",
-        "template": "pointwise",
-        "protocol_score": True,                # collect scores
-        "protocol_error_spans": True,          # collect error spans
-        "protocol_error_categories": False,    # do not collect MQM categories, so ESA
+        "type": "dynamic",
+        "template": "kway",
+        "protocol_k": 5,
         "users": 50,
     },
     "data": [...], # list of all items
@@ -106,17 +133,22 @@ We also support a super simple allocation of annotations (`task-single`, not yet
 To load a campaign into the server, run the following.
 It will fail if an existing campaign with the same `campaign_id` already exists, unless you specify `-o/--overwrite`.
-It will also output a secret management link.
+It will also output a secret management link. Then, launch the server:
 ```bash
 pearmut add my_campaign_4.json
-```
-Finally, you can launch the server with:
-```bash
 pearmut run
 ```
-You can see examples in `data/examples/`.
+## Annotator management
+When adding new campaigns or launching pearmut, a management link is shown that gives an overview of annotator progress but also an easy access to the annotation links or resetting the task progress (no data will be lost).
+<img width="800" alt="Management dashboard" src="https://github.com/user-attachments/assets/057899d7-2291-46c7-876f-407c4050a9cb" />
+Additionally, at the end of an annotation, a token of completion is shown which can be compared to the correct one that you can download in metadat from the dashboard.
+An intentionally incorrect token can be shown if the annotations don't pass quality control.
+<img width="500" alt="Token on completion" src="https://github.com/user-attachments/assets/4b4d2aa9-7bab-44d6-894b-6c789cd3bc6e" />
 ## Development
@@ -131,13 +163,13 @@ npm run watch --prefix web/
 pip3 install -e .
 # add existing data from WMT25, this generates annotation links
 # sets up progress/log files in current working folder
-pearmut add data/examples/wmt25_#_en-cs_CZ.json
-pearmut add data/examples/wmt25_#_cs-de_DE.json
+pearmut add examples/wmt25_#_en-cs_CZ.json
+pearmut add examples/wmt25_#_cs-de_DE.json
 # shows a management link for all loaded campaigns
 pearmut run
 ```
-## Misc
+## Citation
 If you use this work in your paper, please cite as:
 ```bibtex

{pearmut-0.0.3.dist-info → pearmut-0.0.4.dist-info}/RECORD RENAMED Viewed

@@ -7,12 +7,12 @@ pearmut/static/dashboard.bundle.js,sha256=bd7L6wiFIHTdCk1bgiDkWNhJ-T9OwI3pq8Tsis
 pearmut/static/dashboard.html,sha256=yXwKubqBYdWZ260xRSgNcfebtDVWPl6J5UAa6sj2NOk,1742
 pearmut/static/index.html,sha256=ieCRLK83MVe-f-gtjYiOlvE-kKd8VnFF2xgyi6FoZpU,872
 pearmut/static/pointwise.bundle.js,sha256=2aGddZQPxdVM73Ln9-ZJen42VeTY5fhMiAYgO1I63Rw,98820
-pearmut/static/pointwise.html,sha256=7C2IN61js9F2445whHVDptxdIfL-ntw5u4rF2OoBWzo,4436
+pearmut/static/pointwise.html,sha256=RJxuRj8xbEdxfWM0K_phltK7pMjRuk48mYhEo1X8PgY,4436
 pearmut/static/assets/favicon.svg,sha256=gVPxdBlyfyJVkiMfh8WLaiSyH4lpwmKZs8UiOeX8YW4,7347
 pearmut/static/assets/style.css,sha256=jfETRgVCohe680_30GXxbV4Zq4-B6UlXd5pZXlVLIRs,888
-pearmut-0.0.3.dist-info/licenses/LICENSE,sha256=xx0jnfkXJvxRnG63LTGOxlggYnIysveWIZ6H3PNdCrQ,11357
-pearmut-0.0.3.dist-info/METADATA,sha256=UmP0vmPs2mnztEKpdEDd-NDBRLs8Pd81w7wISoqsUbM,4882
-pearmut-0.0.3.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
-pearmut-0.0.3.dist-info/entry_points.txt,sha256=eEA9LVWsS3neQbMvL_nMvEw8I0oFudw8nQa1iqxOiWM,45
-pearmut-0.0.3.dist-info/top_level.txt,sha256=CdgtUM-SKQDt6o5g0QreO-_7XTBP9_wnHMS1P-Rl5Go,8
-pearmut-0.0.3.dist-info/RECORD,,
+pearmut-0.0.4.dist-info/licenses/LICENSE,sha256=xx0jnfkXJvxRnG63LTGOxlggYnIysveWIZ6H3PNdCrQ,11357
+pearmut-0.0.4.dist-info/METADATA,sha256=9L-x0xFezPPy8FCKQBVk7criwX-qzGkvEEJw-tkpu3c,6814
+pearmut-0.0.4.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
+pearmut-0.0.4.dist-info/entry_points.txt,sha256=eEA9LVWsS3neQbMvL_nMvEw8I0oFudw8nQa1iqxOiWM,45
+pearmut-0.0.4.dist-info/top_level.txt,sha256=CdgtUM-SKQDt6o5g0QreO-_7XTBP9_wnHMS1P-Rl5Go,8
+pearmut-0.0.4.dist-info/RECORD,,

{pearmut-0.0.3.dist-info → pearmut-0.0.4.dist-info}/WHEEL RENAMED Viewed

File without changes

{pearmut-0.0.3.dist-info → pearmut-0.0.4.dist-info}/entry_points.txt RENAMED Viewed

File without changes

{pearmut-0.0.3.dist-info → pearmut-0.0.4.dist-info}/licenses/LICENSE RENAMED Viewed

File without changes

{pearmut-0.0.3.dist-info → pearmut-0.0.4.dist-info}/top_level.txt RENAMED Viewed

File without changes

pearmut 0.0.3__py3-none-any.whl → 0.0.4__py3-none-any.whl

pearmut 0.0.3py3-none-any.whl → 0.0.4py3-none-any.whl