PyPI - nkululeko - Versions diffs - 0.88.11__tar.gz → 0.89.0__tar.gz - Mend

nkululeko 0.88.11tar.gz → 0.89.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (167) hide show

{nkululeko-0.88.11 → nkululeko-0.89.0}/CHANGELOG.md RENAMED Viewed

@@ -1,6 +1,16 @@
 Changelog
 =========
+Version 0.89.0
+--------------
+* added Roc plots and classification report on Debug
+Version 0.88.12
+---------------
+* added n_jobs for sklearn processing
+* re_named num_workers n_jobs
 Version 0.88.11
 --------------
 * removed hack in Praat script
@@ -470,9 +480,9 @@ Version 0.66.3
 Version 0.66.2
 --------------
-* enabled data-pathes with quotes
+* enabled data-pacthes with quotes
 * enabled missing category labels
-* used tgdm for progress display
+* used tqdm for progress display
 Version 0.66.1
 --------------

{nkululeko-0.88.11/nkululeko.egg-info → nkululeko-0.89.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: nkululeko
-Version: 0.88.11
+Version: 0.89.0
 Summary: Machine learning audio prediction experiments based on templates
 Home-page: https://github.com/felixbur/nkululeko
 Author: Felix Burkhardt
@@ -204,7 +204,7 @@ All of them take *--config <my_config.ini>* as an argument.
   * *configurations*: which experiments to combine
   * *--method* (optional): majority_voting, mean (default), max, sum, uncertainty, uncertainty_weighted, confidence_weighted, performance_weighted
   * *--threshold*: uncertainty threshold (1.0 means no threshold)
-  * *--weightes*: weights for performance_weighted method (could be from previous UAR, ACC)
+  * *--weights*: weights for performance_weighted method (could be from previous UAR, ACC)
   * *--outfile* (optional): name of CSV file for output (default: ensemble_result.csv)
   * *--no_labels* (optional): indicate that no ground truth is given
 * **nkululeko.multidb**: do [multiple experiments](http://blog.syntheticspeech.de/2024/01/02/nkululeko-compare-several-databases/), comparing several databases cross and in itself
@@ -220,14 +220,11 @@ All of them take *--config <my_config.ini>* as an argument.
 * **nkululeko.predict**: [predict features](http://blog.syntheticspeech.de/2023/08/16/nkululeko-how-to-predict-labels-for-your-data-from-existing-models-and-check-them/) like SNR, MOS, arousal/valence, age/gender, with DNN models
 * **nkululeko.segment**: [segment a database](http://blog.syntheticspeech.de/2023/07/14/nkululeko-segmenting-a-database/) based on VAD (voice activity detection)
 * **nkululeko.resample**: check on all [sampling rates and change](http://blog.syntheticspeech.de/2023/08/31/how-to-fix-different-sampling-rates-in-a-dataset-with-nkululeko/) to 16kHz
-* **nkululeko.nkuluflag**: a convenient module to specify configuration parameters on the command-line.
-  * usage: nkuluflag.py [-h] [--config CONFIG] [--data [DATA ...]] [--label [LABEL ...]] [--tuning_params [TUNING_PARAMS ...]] [--layers [LAYERS ...]] [--model MODEL] [--feat FEAT] [--set SET]
-                    [--with_os WITH_OS] [--target TARGET] [--epochs EPOCHS] [--runs RUNS] [--learning_rate LEARNING_RATE] [--drop DROP]
+* **nkululeko.nkuluflag**: a convenient module to specify configuration parameters on the command-line. Usage:
+  ```bash
+  $ python -m nkululeko.nkuluflag.py [-h] [--config CONFIG] [--data [DATA ...]] [--label [LABEL ...]] [--tuning_params [TUNING_PARAMS ...]] [--layers [LAYERS ...]] [--model MODEL] [--feat FEAT] [--set SET] [--with_os WITH_OS] [--target TARGET] [--epochs EPOCHS] [--runs RUNS] [--learning_rate LEARNING_RATE] [--drop DROP]
+  ```
 There's my [blog](http://blog.syntheticspeech.de/?s=nkululeko) with tutorials:
 * [Introduction](http://blog.syntheticspeech.de/2021/08/04/machine-learning-experiment-framework/)
@@ -359,6 +356,16 @@ F. Burkhardt, Johannes Wagner, Hagen Wierstorf, Florian Eyben and Björn Schulle
 Changelog
 =========
+Version 0.89.0
+--------------
+* added Roc plots and classification report on Debug
+Version 0.88.12
+---------------
+* added n_jobs for sklearn processing
+* re_named num_workers n_jobs
 Version 0.88.11
 --------------
 * removed hack in Praat script
@@ -828,9 +835,9 @@ Version 0.66.3
 Version 0.66.2
 --------------
-* enabled data-pathes with quotes
+* enabled data-pacthes with quotes
 * enabled missing category labels
-* used tgdm for progress display
+* used tqdm for progress display
 Version 0.66.1
 --------------

{nkululeko-0.88.11 → nkululeko-0.89.0}/README.md RENAMED Viewed

@@ -160,7 +160,7 @@ All of them take *--config <my_config.ini>* as an argument.
   * *configurations*: which experiments to combine
   * *--method* (optional): majority_voting, mean (default), max, sum, uncertainty, uncertainty_weighted, confidence_weighted, performance_weighted
   * *--threshold*: uncertainty threshold (1.0 means no threshold)
-  * *--weightes*: weights for performance_weighted method (could be from previous UAR, ACC)
+  * *--weights*: weights for performance_weighted method (could be from previous UAR, ACC)
   * *--outfile* (optional): name of CSV file for output (default: ensemble_result.csv)
   * *--no_labels* (optional): indicate that no ground truth is given
 * **nkululeko.multidb**: do [multiple experiments](http://blog.syntheticspeech.de/2024/01/02/nkululeko-compare-several-databases/), comparing several databases cross and in itself
@@ -176,14 +176,11 @@ All of them take *--config <my_config.ini>* as an argument.
 * **nkululeko.predict**: [predict features](http://blog.syntheticspeech.de/2023/08/16/nkululeko-how-to-predict-labels-for-your-data-from-existing-models-and-check-them/) like SNR, MOS, arousal/valence, age/gender, with DNN models
 * **nkululeko.segment**: [segment a database](http://blog.syntheticspeech.de/2023/07/14/nkululeko-segmenting-a-database/) based on VAD (voice activity detection)
 * **nkululeko.resample**: check on all [sampling rates and change](http://blog.syntheticspeech.de/2023/08/31/how-to-fix-different-sampling-rates-in-a-dataset-with-nkululeko/) to 16kHz
-* **nkululeko.nkuluflag**: a convenient module to specify configuration parameters on the command-line.
-  * usage: nkuluflag.py [-h] [--config CONFIG] [--data [DATA ...]] [--label [LABEL ...]] [--tuning_params [TUNING_PARAMS ...]] [--layers [LAYERS ...]] [--model MODEL] [--feat FEAT] [--set SET]
-                    [--with_os WITH_OS] [--target TARGET] [--epochs EPOCHS] [--runs RUNS] [--learning_rate LEARNING_RATE] [--drop DROP]
+* **nkululeko.nkuluflag**: a convenient module to specify configuration parameters on the command-line. Usage:
+  ```bash
+  $ python -m nkululeko.nkuluflag.py [-h] [--config CONFIG] [--data [DATA ...]] [--label [LABEL ...]] [--tuning_params [TUNING_PARAMS ...]] [--layers [LAYERS ...]] [--model MODEL] [--feat FEAT] [--set SET] [--with_os WITH_OS] [--target TARGET] [--epochs EPOCHS] [--runs RUNS] [--learning_rate LEARNING_RATE] [--drop DROP]
+  ```
 There's my [blog](http://blog.syntheticspeech.de/?s=nkululeko) with tutorials:
 * [Introduction](http://blog.syntheticspeech.de/2021/08/04/machine-learning-experiment-framework/)

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/constants.py RENAMED Viewed

@@ -1,2 +1,2 @@
-VERSION="0.88.11"
+VERSION="0.89.0"
 SAMPLING_RATE = 16000

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/feat_extract/feats_agender.py RENAMED Viewed

@@ -2,6 +2,7 @@
 from nkululeko.feat_extract.featureset import Featureset
 import os
 # import pandas as pd
 import audeer
 import nkululeko.glob_conf as glob_conf
@@ -10,6 +11,7 @@ import numpy as np
 import audinterface
 import torch
 class AgenderSet(Featureset):
     """
     Embeddings from the wav2vec2. based model finetuned on agender data, described in the paper
@@ -30,8 +32,7 @@ class AgenderSet(Featureset):
         if not os.path.isdir(model_root):
             cache_root = audeer.mkdir("cache")
             model_root = audeer.mkdir(model_root)
-            archive_path = audeer.download_url(
-                model_url, cache_root, verbose=True)
+            archive_path = audeer.download_url(model_url, cache_root, verbose=True)
             audeer.extract_archive(archive_path, model_root)
         cuda = "cuda" if torch.cuda.is_available() else "cpu"
         device = self.util.config_val("MODEL", "device", cuda)
@@ -62,7 +63,7 @@ class AgenderSet(Featureset):
                 },
                 sampling_rate=16000,
                 resample=True,
-                num_workers=5,
+                num_workers=self.n_jobs,
                 verbose=True,
             )
             self.df = hidden_states.process_index(self.data_df.index)

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/feat_extract/feats_auddim.py RENAMED Viewed

@@ -32,8 +32,7 @@ class AuddimSet(Featureset):
         if not os.path.isdir(model_root):
             cache_root = audeer.mkdir("cache")
             model_root = audeer.mkdir(model_root)
-            archive_path = audeer.download_url(
-                model_url, cache_root, verbose=True)
+            archive_path = audeer.download_url(model_url, cache_root, verbose=True)
             audeer.extract_archive(archive_path, model_root)
         cuda = "cuda" if torch.cuda.is_available() else "cpu"
         device = self.util.config_val("MODEL", "device", cuda)
@@ -63,7 +62,7 @@ class AuddimSet(Featureset):
                 },
                 sampling_rate=16000,
                 resample=True,
-                num_workers=5,
+                num_workers=self.n_jobs,
                 verbose=True,
             )
             self.df = logits.process_index(self.data_df.index)

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/feat_extract/feats_audmodel.py RENAMED Viewed

@@ -30,8 +30,7 @@ class AudmodelSet(Featureset):
         if not os.path.isdir(model_root):
             cache_root = audeer.mkdir("cache")
             model_root = audeer.mkdir(model_root)
-            archive_path = audeer.download_url(
-                model_url, cache_root, verbose=True)
+            archive_path = audeer.download_url(model_url, cache_root, verbose=True)
             audeer.extract_archive(archive_path, model_root)
         cuda = "cuda" if torch.cuda.is_available() else "cpu"
         device = self.util.config_val("MODEL", "device", cuda)
@@ -61,7 +60,7 @@ class AudmodelSet(Featureset):
                 },
                 sampling_rate=16000,
                 resample=True,
-                num_workers=5,
+                num_workers=self.n_jobs,
                 verbose=True,
             )
             self.df = hidden_states.process_index(self.data_df.index)

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/feat_extract/feats_opensmile.py RENAMED Viewed

@@ -38,7 +38,7 @@ class Opensmileset(Featureset):
             smile = opensmile.Smile(
                 feature_set=self.feature_set,
                 feature_level=self.feature_level,
-                num_workers=5,
+                num_workers=self.n_jobs,
                 verbose=True,
             )
             if isinstance(self.data_df.index, pd.MultiIndex):

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/feat_extract/feats_oxbow.py RENAMED Viewed

@@ -22,17 +22,15 @@ class Openxbow(Featureset):
         self.feature_set = eval(f"opensmile.FeatureSet.{self.featset}")
         store = self.util.get_path("store")
         storage = f"{store}{self.name}_{self.featset}.pkl"
-        extract = self.util.config_val(
-            "FEATS", "needs_feature_extraction", False)
+        extract = self.util.config_val("FEATS", "needs_feature_extraction", False)
         no_reuse = eval(self.util.config_val("FEATS", "no_reuse", "False"))
         if extract or no_reuse or not os.path.isfile(storage):
             # extract smile features first
-            self.util.debug(
-                "extracting openSmile features, this might take a while...")
+            self.util.debug("extracting openSmile features, this might take a while...")
             smile = opensmile.Smile(
                 feature_set=self.feature_set,
                 feature_level=opensmile.FeatureLevel.LowLevelDescriptors,
-                num_workers=5,
+                num_workers=self.n_jobs,
             )
             if isinstance(self.data_df.index, pd.MultiIndex):
                 is_multi_index = True
@@ -51,13 +49,11 @@ class Openxbow(Featureset):
             # save the smile features
             smile_df.to_csv(lld_name, sep=";", header=False)
             # get the path of the xbow java jar file
-            xbow_path = self.util.config_val(
-                "FEATS", "xbow.model", "openXBOW")
+            xbow_path = self.util.config_val("FEATS", "xbow.model", "openXBOW")
             # check if JAR file exist
             if not os.path.isfile(f"{xbow_path}/openXBOW.jar"):
                 # download using wget if not exist and locate in xbow_path
-                os.system(
-                    f"git clone https://github.com/openXBOW/openXBOW")
+                os.system(f"git clone https://github.com/openXBOW/openXBOW")
             # get the size of the codebook
             size = self.util.config_val("FEATS", "size", 500)
             # get the number of assignements
@@ -87,7 +83,7 @@ class Openxbow(Featureset):
                 smile = opensmile.Smile(
                     feature_set=opensmile.FeatureSet.eGeMAPSv02,  # always use eGemaps for this
                     feature_level=opensmile.FeatureLevel.Functionals,
-                    num_workers=5,
+                    num_workers=self.n_jobs,
                 )
                 if isinstance(self.data_df.index, pd.MultiIndex):
                     is_multi_index = True

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/feat_extract/featureset.py RENAMED Viewed

@@ -16,6 +16,7 @@ class Featureset:
         self.data_df = data_df
         self.util = Util("featureset")
         self.feats_type = feats_type
+        self.n_jobs = int(self.util.config_val("MODEL", "n_jobs", "8"))
     def extract(self):
         pass

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/models/model.py RENAMED Viewed

@@ -3,6 +3,7 @@ import ast
 import pickle
 import random
+from joblib import parallel_backend
 import numpy as np
 import pandas as pd
 from sklearn.model_selection import GridSearchCV
@@ -34,6 +35,7 @@ class Model:
         self.epoch = 0
         self.logo = self.util.config_val("MODEL", "logo", False)
         self.xfoldx = self.util.config_val("MODEL", "k_fold_cross", False)
+        self.n_jobs = int(self.util.config_val("MODEL", "n_jobs", "8"))
     def set_model_type(self, type):
         self.model_type = type
@@ -75,7 +77,8 @@ class Model:
         ):
             train_x = feats.iloc[train_index].to_numpy()
             train_y = targets[train_index]
-            self.clf.fit(train_x, train_y)
+            with parallel_backend("threading", n_jobs=self.n_jobs):
+                self.clf.fit(train_x, train_y)
             truth_x = feats.iloc[test_index].to_numpy()
             truth_y = targets[test_index]
             predict_y = self.clf.predict(truth_x)
@@ -141,7 +144,8 @@ class Model:
         ):
             train_x = feats.iloc[train_index].to_numpy()
             train_y = targets.iloc[train_index]
-            self.clf.fit(train_x, train_y)
+            with parallel_backend("threading", n_jobs=self.n_jobs):
+                self.clf.fit(train_x, train_y)
             truth_x = feats.iloc[test_index].to_numpy()
             truth_y = targets.iloc[test_index]
@@ -171,7 +175,7 @@ class Model:
         )
     def train(self):
-        """Train the model"""
+        """Train the model."""
         # # first check if the model already has been trained
         # if os.path.isfile(self.store_path):
         #     self.load(self.run, self.epoch)
@@ -204,22 +208,39 @@ class Model:
             )
         tuning_params = self.util.config_val("MODEL", "tuning_params", False)
-        if tuning_params:
-            # tune the model meta parameters
-            tuning_params = ast.literal_eval(tuning_params)
-            tuned_params = {}
-            try:
-                scoring = glob_conf.config["MODEL"]["scoring"]
-            except KeyError:
-                self.util.error("got tuning params but no scoring")
-            for param in tuning_params:
-                values = ast.literal_eval(glob_conf.config["MODEL"][param])
-                tuned_params[param] = values
-            self.util.debug(f"tuning on {tuned_params}")
-            self.clf = GridSearchCV(
-                self.clf, tuned_params, refit=True, verbose=3, scoring=scoring
-            )
-            try:
+        with parallel_backend("threading", n_jobs=self.n_jobs):
+            if tuning_params:
+                # tune the model meta parameters
+                tuning_params = ast.literal_eval(tuning_params)
+                tuned_params = {}
+                try:
+                    scoring = glob_conf.config["MODEL"]["scoring"]
+                except KeyError:
+                    self.util.error("got tuning params but no scoring")
+                for param in tuning_params:
+                    values = ast.literal_eval(glob_conf.config["MODEL"][param])
+                    tuned_params[param] = values
+                self.util.debug(f"tuning on {tuned_params}")
+                self.clf = GridSearchCV(
+                    self.clf, tuned_params, refit=True, verbose=3, scoring=scoring
+                )
+                try:
+                    class_weight = eval(
+                        self.util.config_val("MODEL", "class_weight", "False")
+                    )
+                    if class_weight:
+                        self.util.debug("using class weight")
+                        self.clf.fit(
+                            feats,
+                            self.df_train[self.target],
+                            sample_weight=self.classes_weights,
+                        )
+                    else:
+                        self.clf.fit(feats, self.df_train[self.target])
+                except KeyError:
+                    self.clf.fit(feats, self.df_train[self.target])
+                self.util.debug(f"winner parameters: {self.clf.best_params_}")
+            else:
                 class_weight = self.util.config_val("MODEL", "class_weight", False)
                 if class_weight:
                     self.util.debug("using class weight")
@@ -229,22 +250,8 @@ class Model:
                         sample_weight=self.classes_weights,
                     )
                 else:
-                    self.clf.fit(feats, self.df_train[self.target])
-            except KeyError:
-                self.clf.fit(feats, self.df_train[self.target])
-            self.util.debug(f"winner parameters: {self.clf.best_params_}")
-        else:
-            class_weight = self.util.config_val("MODEL", "class_weight", False)
-            if class_weight:
-                self.util.debug("using class weight")
-                self.clf.fit(
-                    feats,
-                    self.df_train[self.target],
-                    sample_weight=self.classes_weights,
-                )
-            else:
-                labels = self.df_train[self.target]
-                self.clf.fit(feats, labels)
+                    labels = self.df_train[self.target]
+                    self.clf.fit(feats, labels)
     def get_predictions(self):
         #        predictions = self.clf.predict(self.feats_test.to_numpy())

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/models/model_cnn.py RENAMED Viewed

@@ -80,7 +80,7 @@ class CNNModel(Model):
         # batch size
         self.batch_size = int(self.util.config_val("MODEL", "batch_size", 8))
         # number of parallel processes
-        self.num_workers = int(self.util.config_val("MODEL", "num_workers", 5))
+        self.num_workers = self.n_jobs
         # set up the data_loaders
@@ -100,13 +100,13 @@ class CNNModel(Model):
             train_set,
             batch_size=self.batch_size,
             shuffle=True,
-            num_workers=self.num_workers,
+            num_workers=self.n_jobs,
         )
         self.testloader = torch.utils.data.DataLoader(
             test_set,
             batch_size=self.batch_size,
             shuffle=False,
-            num_workers=self.num_workers,
+            num_workers=self.n_jobs,
         )
     class Dataset_image(Dataset):
@@ -136,7 +136,7 @@ class CNNModel(Model):
             test_set,
             batch_size=self.batch_size,
             shuffle=False,
-            num_workers=self.num_workers,
+            num_workers=self.n_jobs,
         )
     def reset_test(self, df_test, feats_test):

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/models/model_mlp.py RENAMED Viewed

@@ -71,7 +71,7 @@ class MLPModel(Model):
         # batch size
         self.batch_size = int(self.util.config_val("MODEL", "batch_size", 8))
         # number of parallel processes
-        self.num_workers = int(self.util.config_val("MODEL", "num_workers", 5))
+        self.num_workers = self.n_jobs
         if feats_train.isna().to_numpy().any():
             self.util.debug(
                 f"Model, train: replacing {feats_train.isna().sum().sum()} NANs"

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/models/model_mlp_regression.py RENAMED Viewed

@@ -64,7 +64,7 @@ class MLP_Reg_model(Model):
         # batch size
         self.batch_size = int(self.util.config_val("MODEL", "batch_size", 8))
         # number of parallel processes
-        self.num_workers = int(self.util.config_val("MODEL", "num_workers", 5))
+        self.num_workers = self.n_jobs
         # set up the data_loaders
         if feats_train.isna().to_numpy().any():
             self.util.debug(
@@ -117,7 +117,7 @@ class MLP_Reg_model(Model):
             dataset=data_set,
             batch_size=self.batch_size,
             shuffle=shuffle,
-            num_workers=self.num_workers,
+            num_workers=self.n_jobs,
         )
         return loader

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/models/model_tree.py RENAMED Viewed

@@ -12,4 +12,6 @@ class Tree_model(Model):
     def __init__(self, df_train, df_test, feats_train, feats_test):
         super().__init__(df_train, df_test, feats_train, feats_test)
         self.name = "tree"
-        self.clf = DecisionTreeClassifier()  # set up the classifier
+        self.clf = DecisionTreeClassifier(
+            random_state=42
+        )  # set up the classifier

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/reporting/reporter.py RENAMED Viewed

@@ -27,6 +27,7 @@ from sklearn.metrics import (
     r2_score,
     roc_auc_score,
     roc_curve,
+    RocCurveDisplay,
 )
 import nkululeko.glob_conf as glob_conf
@@ -75,6 +76,7 @@ class Reporter:
         self.result = Result(0, 0, 0, 0, "unknown")
         self.run = run
         self.epoch = epoch
+        self.model_type = self.util.get_model_type()
         self._set_metric()
         self.filenameadd = ""
         self.cont_to_cat = False
@@ -387,6 +389,7 @@ class Reporter:
             epoch = self.epoch
         """Print all evaluation values to text file."""
         res_dir = self.util.get_path("res_dir")
+        fig_dir = self.util.get_path("fig_dir")
         file_name = f"{res_dir}{self.util.get_exp_name()}_{epoch}{self.filenameadd}.txt"
         if self.util.exp_is_classification():
             labels = glob_conf.labels
@@ -397,6 +400,10 @@ class Reporter:
                     target_names=labels,
                     output_dict=True,
                 )
+                # print classifcation report in console
+                self.util.debug(
+                    f"\n {classification_report(self.truths, self.preds, target_names=labels)}"
+                )
             except ValueError as e:
                 self.util.debug(
                     "Reporter: caught a ValueError when trying to get"
@@ -415,6 +422,17 @@ class Reporter:
                 if len(np.unique(self.truths)) == 2:
                     fpr, tpr, _ = roc_curve(self.truths, self.preds)
                     auc_score = auc(fpr, tpr)
+                    display = RocCurveDisplay(
+                        fpr=fpr,
+                        tpr=tpr,
+                        roc_auc=auc_score,
+                        estimator_name=f"{self.model_type} estimator",
+                    )
+                    # save plot
+                    plot_path = f"{fig_dir}{self.util.get_exp_name()}_{epoch}{self.filenameadd}_roc.{self.format}"
+                    display.plot(ax=None)
+                    plt.savefig(plot_path)
+                    self.util.debug(f"Saved ROC curve to {plot_path}")
                     pauc_score = roc_auc_score(self.truths, self.preds, max_fpr=0.1)
                     auc_pauc = f"auc: {auc_score:.3f}, pauc: {pauc_score:.3f} from epoch: {epoch}"
                     self.util.debug(auc_pauc)

{nkululeko-0.88.11 → nkululeko-0.89.0}/nkululeko/utils/util.py RENAMED Viewed

@@ -27,6 +27,7 @@ class Util:
         "pkl",
         "eGeMAPSv02",
         "functionals",
+        "n_jobs",
     ]
     def __init__(self, caller=None, has_config=True):
@@ -150,7 +151,7 @@ class Util:
         # self.logged_configs.clear()
     def get_save_name(self):
-        """Return a relative path to a name to save the experiment"""
+        """Return a relative path to a name to save the experiment."""
         store = self.get_path("store")
         return f"{store}/{self.get_exp_name()}.pkl"
@@ -161,7 +162,7 @@ class Util:
         return f"{store}/pred_{target}_{pred_name}.csv"
     def is_categorical(self, pd_series):
-        """Check if a dataframe column is categorical"""
+        """Check if a dataframe column is categorical."""
         return pd_series.dtype.name == "object" or isinstance(
             pd_series.dtype, pd.CategoricalDtype
         )
@@ -174,7 +175,7 @@ class Util:
         """Get the experiment directory."""
         root = os.path.join(self.config["EXP"]["root"], "")
         name = self.config["EXP"]["name"]
-        dir_name = f"{root}{name}"
+        dir_name = f"{root}/{name}"
         audeer.mkdir(dir_name)
         return dir_name
@@ -307,7 +308,7 @@ class Util:
             self.config[section][key] = str(value)
     def check_df(self, i, df):
-        """Check a dataframe"""
+        """Check a dataframe."""
         print(f"check {i}: {df.shape}")
         print(df.head(1))

{nkululeko-0.88.11 → nkululeko-0.89.0/nkululeko.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: nkululeko
-Version: 0.88.11
+Version: 0.89.0
 Summary: Machine learning audio prediction experiments based on templates
 Home-page: https://github.com/felixbur/nkululeko
 Author: Felix Burkhardt
@@ -204,7 +204,7 @@ All of them take *--config <my_config.ini>* as an argument.
   * *configurations*: which experiments to combine
   * *--method* (optional): majority_voting, mean (default), max, sum, uncertainty, uncertainty_weighted, confidence_weighted, performance_weighted
   * *--threshold*: uncertainty threshold (1.0 means no threshold)
-  * *--weightes*: weights for performance_weighted method (could be from previous UAR, ACC)
+  * *--weights*: weights for performance_weighted method (could be from previous UAR, ACC)
   * *--outfile* (optional): name of CSV file for output (default: ensemble_result.csv)
   * *--no_labels* (optional): indicate that no ground truth is given
 * **nkululeko.multidb**: do [multiple experiments](http://blog.syntheticspeech.de/2024/01/02/nkululeko-compare-several-databases/), comparing several databases cross and in itself
@@ -220,14 +220,11 @@ All of them take *--config <my_config.ini>* as an argument.
 * **nkululeko.predict**: [predict features](http://blog.syntheticspeech.de/2023/08/16/nkululeko-how-to-predict-labels-for-your-data-from-existing-models-and-check-them/) like SNR, MOS, arousal/valence, age/gender, with DNN models
 * **nkululeko.segment**: [segment a database](http://blog.syntheticspeech.de/2023/07/14/nkululeko-segmenting-a-database/) based on VAD (voice activity detection)
 * **nkululeko.resample**: check on all [sampling rates and change](http://blog.syntheticspeech.de/2023/08/31/how-to-fix-different-sampling-rates-in-a-dataset-with-nkululeko/) to 16kHz
-* **nkululeko.nkuluflag**: a convenient module to specify configuration parameters on the command-line.
-  * usage: nkuluflag.py [-h] [--config CONFIG] [--data [DATA ...]] [--label [LABEL ...]] [--tuning_params [TUNING_PARAMS ...]] [--layers [LAYERS ...]] [--model MODEL] [--feat FEAT] [--set SET]
-                    [--with_os WITH_OS] [--target TARGET] [--epochs EPOCHS] [--runs RUNS] [--learning_rate LEARNING_RATE] [--drop DROP]
+* **nkululeko.nkuluflag**: a convenient module to specify configuration parameters on the command-line. Usage:
+  ```bash
+  $ python -m nkululeko.nkuluflag.py [-h] [--config CONFIG] [--data [DATA ...]] [--label [LABEL ...]] [--tuning_params [TUNING_PARAMS ...]] [--layers [LAYERS ...]] [--model MODEL] [--feat FEAT] [--set SET] [--with_os WITH_OS] [--target TARGET] [--epochs EPOCHS] [--runs RUNS] [--learning_rate LEARNING_RATE] [--drop DROP]
+  ```
 There's my [blog](http://blog.syntheticspeech.de/?s=nkululeko) with tutorials:
 * [Introduction](http://blog.syntheticspeech.de/2021/08/04/machine-learning-experiment-framework/)
@@ -359,6 +356,16 @@ F. Burkhardt, Johannes Wagner, Hagen Wierstorf, Florian Eyben and Björn Schulle
 Changelog
 =========
+Version 0.89.0
+--------------
+* added Roc plots and classification report on Debug
+Version 0.88.12
+---------------
+* added n_jobs for sklearn processing
+* re_named num_workers n_jobs
 Version 0.88.11
 --------------
 * removed hack in Praat script
@@ -828,9 +835,9 @@ Version 0.66.3
 Version 0.66.2
 --------------
-* enabled data-pathes with quotes
+* enabled data-pacthes with quotes
 * enabled missing category labels
-* used tgdm for progress display
+* used tqdm for progress display
 Version 0.66.1
 --------------

{nkululeko-0.88.11 → nkululeko-0.89.0}/LICENSE RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/aesdd/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/androids/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/ased/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/asvp-esd/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/baved/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/cafe/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/clac/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/cmu-mosei/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/demos/process_database.py RENAMED Viewed

File without changes

{nkululeko-0.88.11 → nkululeko-0.89.0}/data/ekorpus/process_database.py RENAMED Viewed

File without changes

nkululeko 0.88.11__tar.gz → 0.89.0__tar.gz

nkululeko 0.88.11tar.gz → 0.89.0tar.gz