PyPI - EuroEval - Versions diffs - 15.8.0__tar.gz → 15.8.2__tar.gz - Mend

EuroEval 15.8.0tar.gz → 15.8.2tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of EuroEval might be problematic. Click here for more details.

Files changed (240) hide show

{euroeval-15.8.0 → euroeval-15.8.2}/.github/workflows/ci.yaml RENAMED Viewed

@@ -10,6 +10,10 @@ on:
     branches:
       - main
+concurrency:
+  group: ${{ github.workflow }}-${{ github.head_ref }}
+  cancel-in-progress: true
 jobs:
   code-check:
     if: github.event.pull_request.draft == false

{euroeval-15.8.0 → euroeval-15.8.2}/.pre-commit-config.yaml RENAMED Viewed

@@ -10,7 +10,7 @@ repos:
       - id: trailing-whitespace
       - id: debug-statements
 -   repo: https://github.com/astral-sh/ruff-pre-commit
-    rev: v0.11.8
+    rev: v0.11.9
     hooks:
       - id: ruff
         args:

{euroeval-15.8.0 → euroeval-15.8.2}/CHANGELOG.md RENAMED Viewed

@@ -10,6 +10,23 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.
+## [v15.8.2] - 2025-05-12
+### Fixed
+- Catch error when caching generative model outputs, when the number of model inputs and
+  outputs do not match.
+- Disallow vLLM >=0.8.5, as it breaks generation output for several models.
+## [v15.8.1] - 2025-05-08
+### Fixed
+- NER labels were included twice in the prompt templates (which was due to there being
+  both, e.g., `B-ORG` and `I-ORG`). This caused models not using structured generation,
+  such as reasoning models, to sometimes output the wrong labels. This has been fixed
+  now.
+- If a model outputs a `\boxed{}` answer, we now extract and use that, rather than the
+  full generated answer.
 ## [v15.8.0] - 2025-05-07
 ### Added
 - Added the BeleBele datasets for Finnish, Italian and Spanish. They are listed as

{euroeval-15.8.0 → euroeval-15.8.2}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: EuroEval
-Version: 15.8.0
+Version: 15.8.2
 Summary: The robust European language model benchmark.
 Project-URL: Repository, https://github.com/EuroEval/EuroEval
 Project-URL: Issues, https://github.com/EuroEval/EuroEval/issues
@@ -62,12 +62,12 @@ Requires-Dist: bitsandbytes>=0.43.1; (platform_system == 'Linux') and extra == '
 Requires-Dist: fbgemm-gpu>=1.0.0; (platform_system == 'Linux') and extra == 'all'
 Requires-Dist: gradio>=4.26.0; extra == 'all'
 Requires-Dist: outlines>=0.1.11; extra == 'all'
-Requires-Dist: vllm>=0.8.3; (platform_system == 'Linux') and extra == 'all'
+Requires-Dist: vllm<0.8.5,>=0.8.3; (platform_system == 'Linux') and extra == 'all'
 Provides-Extra: generative
 Requires-Dist: bitsandbytes>=0.43.1; (platform_system == 'Linux') and extra == 'generative'
 Requires-Dist: fbgemm-gpu>=1.0.0; (platform_system == 'Linux') and extra == 'generative'
 Requires-Dist: outlines>=0.1.11; extra == 'generative'
-Requires-Dist: vllm>=0.8.3; (platform_system == 'Linux') and extra == 'generative'
+Requires-Dist: vllm<0.8.5,>=0.8.3; (platform_system == 'Linux') and extra == 'generative'
 Provides-Extra: human-evaluation
 Requires-Dist: gradio>=4.26.0; extra == 'human-evaluation'
 Provides-Extra: test

{euroeval-15.8.0 → euroeval-15.8.2}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "EuroEval"
-version = "15.8.0"
+version = "15.8.2"
 description = "The robust European language model benchmark."
 readme = "README.md"
 authors = [
@@ -46,7 +46,7 @@ dependencies = [
 generative = [
     "outlines>=0.1.11",
     "bitsandbytes>=0.43.1; platform_system == 'Linux'",
-    "vllm>=0.8.3; platform_system == 'Linux'",
+    "vllm>=0.8.3,<0.8.5; platform_system == 'Linux'",
     "fbgemm-gpu>=1.0.0; platform_system == 'Linux'",
 ]
 human_evaluation = [
@@ -55,7 +55,7 @@ human_evaluation = [
 all = [
     "outlines>=0.1.11",
     "bitsandbytes>=0.43.1; platform_system == 'Linux'",
-    "vllm>=0.8.3; platform_system == 'Linux'",
+    "vllm>=0.8.3,<0.8.5; platform_system == 'Linux'",
     "fbgemm-gpu>=1.0.0; platform_system == 'Linux'",
     "gradio>=4.26.0",
 ]

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_modules/litellm.py RENAMED Viewed

@@ -401,6 +401,12 @@ class LiteLLMModel(BenchmarkModule):
             model_responses=ordered_responses, model_id=self.model_config.model_id
         )
+        if len(messages) != len(model_output.sequences):
+            raise InvalidBenchmark(
+                f"Number of model inputs ({len(messages):,}) does not match the "
+                f"number of model outputs ({len(model_output.sequences):,})."
+            )
         return model_output
     def _handle_exception(
@@ -616,8 +622,7 @@ class LiteLLMModel(BenchmarkModule):
         scores = []
         for model_response in model_responses:
             if not model_response.choices:
-                # This happens for reasoning models, when they don't finish thinking
-                # and run out of tokens. Happens quite rarely, but we need to handle it.
+                sequences.append("")
                 logger.warning(
                     f"The model {model_id!r} did not end up "
                     "generating any text. This is likely because the model ran "

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/data_models.py RENAMED Viewed

@@ -529,12 +529,16 @@ class DatasetConfig:
         else:
             sep_word = main_language.or_separator
+        local_labels: list[str] = []
+        for label in self.labels:
+            if label not in self.prompt_label_mapping:
+                continue
+            local_label = self.prompt_label_mapping[label]
+            if local_label not in local_labels:
+                local_labels.append(local_label)
         # Convert labels to single-quoted labels - and remove duplicates
-        quoted_labels = [
-            f"'{self.prompt_label_mapping[label]}'"
-            for label in set(self.labels)
-            if label in self.prompt_label_mapping
-        ]
+        quoted_labels = [f"'{label}'" for label in local_labels]
         if not quoted_labels:
             return ""

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/model_cache.py RENAMED Viewed

@@ -168,6 +168,15 @@ class ModelCache:
         input_column = "messages" if "messages" in model_inputs else "text"
         model_inputs = model_inputs[input_column]
+        # Double check that the number of inputs and outputs match
+        if not len(model_inputs) == len(model_output.sequences):
+            logger.warning(
+                f"Number of model inputs ({len(model_inputs)}) does not match the "
+                f"number of model outputs ({len(model_output.sequences)}). We will not "
+                f"cache the model outputs."
+            )
+            return
         # Store the generated sequences in the cache, one by one
         with tqdm(
             iterable=model_inputs,

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/task_group_utils/sequence_classification.py RENAMED Viewed

@@ -144,9 +144,27 @@ def extract_labels_from_generation(
         )
         if labels is not None:
             return labels
-    return get_closest_word_edit_labels(
-        generated_sequences=model_output.sequences, dataset_config=dataset_config
-    )
+    candidate_labels = [
+        dataset_config.prompt_label_mapping[lbl]
+        for lbl in dataset_config.id2label.values()
+    ]
+    new_predicted_labels: list[str] = list()
+    for predicted_label in model_output.sequences:
+        # If the prediction includes a boxed answer, use that instead of the full
+        # generation
+        if (m := re.search(r"boxed\{(.*?)\}", predicted_label)) is not None:
+            predicted_label = m.group(1)
+        # Pick the label with the smallest word edit distance to the predicted label
+        edit_distances = [
+            Levenshtein.distance(s1=predicted_label.lower(), s2=candidate_label.lower())
+            for candidate_label in candidate_labels
+        ]
+        predicted_label = candidate_labels[np.argmin(edit_distances).item()]
+        new_predicted_labels.append(predicted_label)
+    return new_predicted_labels
 def get_closest_logprobs_labels(
@@ -305,32 +323,3 @@ def get_closest_logprobs_labels(
     assert len(output_labels) == len(generation_logprobs)
     return output_labels
-def get_closest_word_edit_labels(
-    generated_sequences: list[str], dataset_config: "DatasetConfig"
-) -> list[str]:
-    """Get the labels with the smallest edit distance to the predicted labels.
-    Args:
-        generated_sequences:
-            The generated sequences from the model.
-        dataset_config:
-            The configuration of the dataset.
-    Returns:
-        The candidate labels with the smallest edit distance to the predicted labels.
-    """
-    candidate_labels = [
-        dataset_config.prompt_label_mapping[lbl]
-        for lbl in dataset_config.id2label.values()
-    ]
-    new_predicted_labels: list[str] = list()
-    for predicted_label in generated_sequences:
-        edit_distances = [
-            Levenshtein.distance(s1=predicted_label.lower(), s2=candidate_label.lower())
-            for candidate_label in candidate_labels
-        ]
-        closest_label = candidate_labels[np.argmin(edit_distances).item()]
-        new_predicted_labels.append(closest_label)
-    return new_predicted_labels

{euroeval-15.8.0 → euroeval-15.8.2}/uv.lock RENAMED Viewed

@@ -906,7 +906,7 @@ wheels = [
 [[package]]
 name = "euroeval"
-version = "15.8.0"
+version = "15.8.2"
 source = { editable = "." }
 dependencies = [
     { name = "accelerate" },
@@ -1034,8 +1034,8 @@ requires-dist = [
     { name = "termcolor", specifier = ">=2.0.0" },
     { name = "torch", specifier = ">=2.6.0" },
     { name = "transformers", specifier = ">=4.51.0" },
-    { name = "vllm", marker = "sys_platform == 'linux' and extra == 'all'", specifier = ">=0.8.3" },
-    { name = "vllm", marker = "sys_platform == 'linux' and extra == 'generative'", specifier = ">=0.8.3" },
+    { name = "vllm", marker = "sys_platform == 'linux' and extra == 'all'", specifier = ">=0.8.3,<0.8.5" },
+    { name = "vllm", marker = "sys_platform == 'linux' and extra == 'generative'", specifier = ">=0.8.3,<0.8.5" },
 ]
 provides-extras = ["generative", "human-evaluation", "all", "test"]

{euroeval-15.8.0 → euroeval-15.8.2}/.github/ISSUE_TEMPLATE/benchmark_dataset_request.yaml RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/.github/ISSUE_TEMPLATE/bug.yaml RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/.github/ISSUE_TEMPLATE/feature_request.yaml RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/.github/ISSUE_TEMPLATE/model_evaluation_request.yaml RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/.gitignore RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/CITATION.cff RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/CODE_OF_CONDUCT.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/CONTRIBUTING.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/Dockerfile.cuda RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/LICENSE RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/NEW_DATASET_GUIDE.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/README.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/CNAME RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/README.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/README.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/danish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/dutch.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/english.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/faroese.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/finnish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/french.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/german.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/icelandic.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/italian.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/norwegian.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/spanish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/datasets/swedish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/extras/radial_plotter.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/faq.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/gfx/favicon.png RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/danish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/dutch.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/english.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/faroese.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/french.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/german.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/icelandic.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/italian.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/norwegian.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/spanish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Monolingual/swedish.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Multilingual/european.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Multilingual/germanic.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Multilingual/mainland-scandinavian.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/Multilingual/romance.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/leaderboards/README.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/methodology.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/python-package.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/README.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/common-sense-reasoning.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/knowledge.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/linguistic-acceptability.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/named-entity-recognition.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/reading-comprehension.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/sentiment-classification.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/speed.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/docs/tasks/summarization.md RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/gfx/euroeval.png RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/gfx/euroeval.xcf RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/gfx/scandeval.png RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/makefile RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/mkdocs.yaml RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/__init__.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_config_factory.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_modules/__init__.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_modules/base.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_modules/fresh.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_modules/hf.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmark_modules/vllm.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/benchmarker.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/callbacks.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/cli.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/constants.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/data_loading.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/__init__.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/danish.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/dutch.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/english.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/faroese.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/finnish.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/french.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/german.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/icelandic.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/italian.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/norwegian.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/spanish.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/dataset_configs/swedish.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/enums.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/exceptions.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/finetuning.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/generation.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/generation_utils.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/human_evaluation.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/languages.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/model_config.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/model_loading.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/__init__.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/linguistic_acceptability.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/multiple_choice.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/named_entity_recognition.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/reading_comprehension.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/sentiment_classification.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/prompt_templates/summarization.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/scores.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/speed_benchmark.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/task_group_utils/__init__.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/task_group_utils/multiple_choice_classification.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/task_group_utils/question_answering.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/task_group_utils/text_to_text.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/task_group_utils/token_classification.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/tasks.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/tokenization_utils.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/types.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/euroeval/utils.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/constants.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_allocine.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_angry_tweets.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_arc.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_arc_is.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_belebele.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_cnn_dailymail.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_conll_en.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_conll_es.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_conll_nl.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_dane.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_danish_citizen_tests.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_dansk.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_danske_talemaader.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_danske_talemaader_old.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_dbrd.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_dutch_cola.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_eltec.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_fone.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_foqa.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_fosent.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_fquad.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_germanquad.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_germeval.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_hellaswag.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_hellaswag_fi.py RENAMED Viewed

File without changes

{euroeval-15.8.0 → euroeval-15.8.2}/src/scripts/create_hotter_and_colder_sentiment.py RENAMED Viewed

File without changes

EuroEval 15.8.0__tar.gz → 15.8.2__tar.gz

Potentially problematic release.

EuroEval 15.8.0tar.gz → 15.8.2tar.gz