guidepost 0.2.11__tar.gz → 0.2.12__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {guidepost-0.2.11/guidepost.egg-info → guidepost-0.2.12}/PKG-INFO +14 -15
- {guidepost-0.2.11 → guidepost-0.2.12}/README.md +13 -14
- guidepost-0.2.12/guidepost/version.py +2 -0
- {guidepost-0.2.11 → guidepost-0.2.12/guidepost.egg-info}/PKG-INFO +14 -15
- guidepost-0.2.11/guidepost/version.py +0 -2
- {guidepost-0.2.11 → guidepost-0.2.12}/LICENSE +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/figs/__init__.py +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/figs/guidepost_tutorial_info.png +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost/__init__.py +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost/guidepost.js +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost/guidepost.py +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost.egg-info/SOURCES.txt +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost.egg-info/dependency_links.txt +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost.egg-info/requires.txt +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/guidepost.egg-info/top_level.txt +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/pyproject.toml +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/setup.cfg +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/setup.py +0 -0
- {guidepost-0.2.11 → guidepost-0.2.12}/tutorials/__init__.py +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.2
|
|
2
2
|
Name: guidepost
|
|
3
|
-
Version: 0.2.
|
|
3
|
+
Version: 0.2.12
|
|
4
4
|
Summary: Guidepost. An overview visualization for understanding supercomputer queue data.
|
|
5
5
|
Home-page: https://github.com/cscully-allison/guidepost
|
|
6
6
|
Author: Connor Scully-Allison
|
|
@@ -28,7 +28,7 @@ Dynamic: summary
|
|
|
28
28
|
|
|
29
29
|
# Guidepost
|
|
30
30
|
|
|
31
|
-
Guidepost is a Python library designed
|
|
31
|
+
Guidepost is a Python library designed to visualize High Performance Computing (HPC) job data in jupyter notebooks. It simplifies the process of understanding HPC workloads by providing a single, interactive visualization that offers an intuitive overview of job performance, resource usage, and other critical metrics.
|
|
32
32
|
|
|
33
33
|
---
|
|
34
34
|
|
|
@@ -37,7 +37,6 @@ Guidepost is a Python library designed for seamless integration into Jupyter not
|
|
|
37
37
|
- **Jupyter Notebook Integration**: Designed for your existing workflow. Load and interact with the visualization directly in your Jupyter environment.
|
|
38
38
|
- **HPC Job Data Insights**: Visualize key metrics, including job runtimes, resource usage, and queue performance.
|
|
39
39
|
- **Interactive Exploration**: Export selections of specific jobs or groups of jobs for deeper analysis.
|
|
40
|
-
- **Lightweight and Easy to Use**: Focused on simplicity and efficiency for HPC users.
|
|
41
40
|
|
|
42
41
|
---
|
|
43
42
|
|
|
@@ -69,7 +68,7 @@ jobs_data = pd.read_parquet("data/jobs_data.parquet")
|
|
|
69
68
|
gp.load_data(jobs_data)
|
|
70
69
|
```
|
|
71
70
|
|
|
72
|
-
Guidepost supports input data in a
|
|
71
|
+
Guidepost supports input data in a pandas DataFrame format.
|
|
73
72
|
|
|
74
73
|
At least three numerical and 2 categorical columns are required. Datetime columns are also supported for encoding on the x axis.
|
|
75
74
|
|
|
@@ -80,7 +79,7 @@ Here is a sample table containg jobs-related data from a supercomputer schedulin
|
|
|
80
79
|
| 12345 | 2023-11-01 21:19:33 |5.2 | 10 | short | Complete | User1 |
|
|
81
80
|
| 12346 | 2023-11-01 21:20:01 |12.0 | 20 | long | Running | User2 |
|
|
82
81
|
|
|
83
|
-
In this example, the three data values we will use for our x, y and color variables are: start_time, queue_wait and nodes_requested. We would also like to use `parition` to facet
|
|
82
|
+
In this example, the three data values we will use for our x, y and color variables are: start_time, queue_wait and nodes_requested. We would also like to use `parition` to facet our data and `user` as an additional categorical variable to filter on. In the [next section](#3-configure-visualization), we show how to specify which columns in your dataset correspond to parts of the visualizaiton.
|
|
84
83
|
|
|
85
84
|
The `load_data()` function will format your data for json serialization and will update the visualization if it has already been run. This function will report out any columns or rows which are dropped from the original dataset due to conainting `null`/`NaN`/`None` values or unallowed datatypes like `timedelta`s.
|
|
86
85
|
|
|
@@ -102,7 +101,7 @@ gp.vis_configs = {
|
|
|
102
101
|
- `color`: Name of the column in the dataframe which will be shown by the darkness of each square's color.
|
|
103
102
|
- `color_agg`: The aggregation method used to determine the color. Can be: 'avg', 'variance', 'std', 'sum', or 'median'
|
|
104
103
|
- `categorical`: Name of the column containing categorical data values which will be shown on a bar chart associated with each group of the data.
|
|
105
|
-
- `facet_by`: Name of the column containing categorical data values which dictate the highest level grouping of the data
|
|
104
|
+
- `facet_by`: Name of the column containing categorical data values which dictate the highest level grouping of the data and organizes the data into groups of subcharts.
|
|
106
105
|
|
|
107
106
|
See the [Vis Configs Section](#vis_configs) for more details on datatype restrictions for each configuration.
|
|
108
107
|
|
|
@@ -121,7 +120,7 @@ Here is an example of what the viusalization will look like:
|
|
|
121
120
|
Here we explain some elements of the visualization:
|
|
122
121
|
|
|
123
122
|
#### `Data Grouping Name`:
|
|
124
|
-
This is name of the high level groups which are dictated by the `facet_by` configuration. If your data only logically contains one group, adding a synthetic column
|
|
123
|
+
This is name of the high level groups which are dictated by the `facet_by` configuration. Each group of subcharts corresponds to all data associated with an instance of a value in `facet_by`. If your data only logically contains one group, adding a synthetic column and specifying that column name for the `facet_by` cofiguration is advised.
|
|
125
124
|
|
|
126
125
|
#### `Main Summary View`:
|
|
127
126
|
The main summary view is the primary view associated with each group of data specified by `facet_by` configuration. This view shows the data organized by the x and y axes. Data values at similar locations along the x and y axes are grouped into squares at that location. The amount of data in each row and column are shown with the histograms framing this view. The color of each square shows an aggregrate of a third numerical variable that exists on each data value.
|
|
@@ -135,7 +134,7 @@ The bar chart in the lower right hand corner of each row of subcharts shows the
|
|
|
135
134
|
|
|
136
135
|
|
|
137
136
|
#### `Current Seleciton of Records for Export`:
|
|
138
|
-
Records can be selected for export from the visualization by brushing over the right and bottom histograms. The area of selected data is indicated by the orange coloring on the main summary view. The
|
|
137
|
+
Records can be selected for export from the visualization by brushing over the right and bottom histograms. The area of selected data is indicated by the orange coloring on the main summary view. The amount of records selected is indicated at the top left for each chart. Selections can be made across multiple charts. The final selection is returned as one dataframe containg all selections.
|
|
139
138
|
|
|
140
139
|
|
|
141
140
|
### 5. Retrieve Selections from Visualization
|
|
@@ -146,7 +145,7 @@ df = gp.retrieve_selected_data()
|
|
|
146
145
|
|
|
147
146
|
After selecting data by brushing over either the bottom or right histograms associated with a subchart, you can retrieve selected data using the above method.
|
|
148
147
|
|
|
149
|
-
This will return a pandas
|
|
148
|
+
This will return a pandas DataFrame containing all your subselected rows from the original dataset.
|
|
150
149
|
|
|
151
150
|
|
|
152
151
|
|
|
@@ -165,12 +164,6 @@ Below is an example of the kind of data Guidepost works with:
|
|
|
165
164
|
|
|
166
165
|
## API Reference
|
|
167
166
|
|
|
168
|
-
### `load_data`
|
|
169
|
-
- **Description**: Loads a pandas dataframe into the guidepost system for visualizaiton. Will report data dropped from the dataframe if it contains NaNs, `timedeltas`, `arrays` in cells, or other invalid values.
|
|
170
|
-
- **Arguments**:
|
|
171
|
-
- `in_df` (Pandas Dataframe): The dataframe containing data to be visualized.
|
|
172
|
-
- `supress_warnings` (Boolean): Specifies whether to suppress warnings when loading data. Defaults to `False`
|
|
173
|
-
|
|
174
167
|
### `vis_data`
|
|
175
168
|
- **Description**: Holds the vis data to passed to the visualization. Updates to this variable will automatically update the visualization.
|
|
176
169
|
|
|
@@ -187,6 +180,12 @@ Vis configurations must be specified as a python dictonary with the following fi
|
|
|
187
180
|
- 'facet_by': A categorical variable from the dataset. Automatically looks for 'queue' or 'partition' if this config is not specified.
|
|
188
181
|
|
|
189
182
|
|
|
183
|
+
### `load_data(in_df, supress_warnings)`
|
|
184
|
+
- **Description**: Loads a pandas dataframe into the guidepost system for visualizaiton. Will report data dropped from the dataframe if it contains NaNs, `timedeltas`, `arrays` in cells, or other invalid values.
|
|
185
|
+
- **Arguments**:
|
|
186
|
+
- `in_df` (Pandas Dataframe): The dataframe containing data to be visualized.
|
|
187
|
+
- `supress_warnings` (Boolean): Specifies whether to suppress warnings when loading data. Defaults to `False`
|
|
188
|
+
|
|
190
189
|
|
|
191
190
|
### `retrieve_selected_data()`
|
|
192
191
|
- **Description**: Returns selected data back from the visualization.
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
# Guidepost
|
|
2
2
|
|
|
3
|
-
Guidepost is a Python library designed
|
|
3
|
+
Guidepost is a Python library designed to visualize High Performance Computing (HPC) job data in jupyter notebooks. It simplifies the process of understanding HPC workloads by providing a single, interactive visualization that offers an intuitive overview of job performance, resource usage, and other critical metrics.
|
|
4
4
|
|
|
5
5
|
---
|
|
6
6
|
|
|
@@ -9,7 +9,6 @@ Guidepost is a Python library designed for seamless integration into Jupyter not
|
|
|
9
9
|
- **Jupyter Notebook Integration**: Designed for your existing workflow. Load and interact with the visualization directly in your Jupyter environment.
|
|
10
10
|
- **HPC Job Data Insights**: Visualize key metrics, including job runtimes, resource usage, and queue performance.
|
|
11
11
|
- **Interactive Exploration**: Export selections of specific jobs or groups of jobs for deeper analysis.
|
|
12
|
-
- **Lightweight and Easy to Use**: Focused on simplicity and efficiency for HPC users.
|
|
13
12
|
|
|
14
13
|
---
|
|
15
14
|
|
|
@@ -41,7 +40,7 @@ jobs_data = pd.read_parquet("data/jobs_data.parquet")
|
|
|
41
40
|
gp.load_data(jobs_data)
|
|
42
41
|
```
|
|
43
42
|
|
|
44
|
-
Guidepost supports input data in a
|
|
43
|
+
Guidepost supports input data in a pandas DataFrame format.
|
|
45
44
|
|
|
46
45
|
At least three numerical and 2 categorical columns are required. Datetime columns are also supported for encoding on the x axis.
|
|
47
46
|
|
|
@@ -52,7 +51,7 @@ Here is a sample table containg jobs-related data from a supercomputer schedulin
|
|
|
52
51
|
| 12345 | 2023-11-01 21:19:33 |5.2 | 10 | short | Complete | User1 |
|
|
53
52
|
| 12346 | 2023-11-01 21:20:01 |12.0 | 20 | long | Running | User2 |
|
|
54
53
|
|
|
55
|
-
In this example, the three data values we will use for our x, y and color variables are: start_time, queue_wait and nodes_requested. We would also like to use `parition` to facet
|
|
54
|
+
In this example, the three data values we will use for our x, y and color variables are: start_time, queue_wait and nodes_requested. We would also like to use `parition` to facet our data and `user` as an additional categorical variable to filter on. In the [next section](#3-configure-visualization), we show how to specify which columns in your dataset correspond to parts of the visualizaiton.
|
|
56
55
|
|
|
57
56
|
The `load_data()` function will format your data for json serialization and will update the visualization if it has already been run. This function will report out any columns or rows which are dropped from the original dataset due to conainting `null`/`NaN`/`None` values or unallowed datatypes like `timedelta`s.
|
|
58
57
|
|
|
@@ -74,7 +73,7 @@ gp.vis_configs = {
|
|
|
74
73
|
- `color`: Name of the column in the dataframe which will be shown by the darkness of each square's color.
|
|
75
74
|
- `color_agg`: The aggregation method used to determine the color. Can be: 'avg', 'variance', 'std', 'sum', or 'median'
|
|
76
75
|
- `categorical`: Name of the column containing categorical data values which will be shown on a bar chart associated with each group of the data.
|
|
77
|
-
- `facet_by`: Name of the column containing categorical data values which dictate the highest level grouping of the data
|
|
76
|
+
- `facet_by`: Name of the column containing categorical data values which dictate the highest level grouping of the data and organizes the data into groups of subcharts.
|
|
78
77
|
|
|
79
78
|
See the [Vis Configs Section](#vis_configs) for more details on datatype restrictions for each configuration.
|
|
80
79
|
|
|
@@ -93,7 +92,7 @@ Here is an example of what the viusalization will look like:
|
|
|
93
92
|
Here we explain some elements of the visualization:
|
|
94
93
|
|
|
95
94
|
#### `Data Grouping Name`:
|
|
96
|
-
This is name of the high level groups which are dictated by the `facet_by` configuration. If your data only logically contains one group, adding a synthetic column
|
|
95
|
+
This is name of the high level groups which are dictated by the `facet_by` configuration. Each group of subcharts corresponds to all data associated with an instance of a value in `facet_by`. If your data only logically contains one group, adding a synthetic column and specifying that column name for the `facet_by` cofiguration is advised.
|
|
97
96
|
|
|
98
97
|
#### `Main Summary View`:
|
|
99
98
|
The main summary view is the primary view associated with each group of data specified by `facet_by` configuration. This view shows the data organized by the x and y axes. Data values at similar locations along the x and y axes are grouped into squares at that location. The amount of data in each row and column are shown with the histograms framing this view. The color of each square shows an aggregrate of a third numerical variable that exists on each data value.
|
|
@@ -107,7 +106,7 @@ The bar chart in the lower right hand corner of each row of subcharts shows the
|
|
|
107
106
|
|
|
108
107
|
|
|
109
108
|
#### `Current Seleciton of Records for Export`:
|
|
110
|
-
Records can be selected for export from the visualization by brushing over the right and bottom histograms. The area of selected data is indicated by the orange coloring on the main summary view. The
|
|
109
|
+
Records can be selected for export from the visualization by brushing over the right and bottom histograms. The area of selected data is indicated by the orange coloring on the main summary view. The amount of records selected is indicated at the top left for each chart. Selections can be made across multiple charts. The final selection is returned as one dataframe containg all selections.
|
|
111
110
|
|
|
112
111
|
|
|
113
112
|
### 5. Retrieve Selections from Visualization
|
|
@@ -118,7 +117,7 @@ df = gp.retrieve_selected_data()
|
|
|
118
117
|
|
|
119
118
|
After selecting data by brushing over either the bottom or right histograms associated with a subchart, you can retrieve selected data using the above method.
|
|
120
119
|
|
|
121
|
-
This will return a pandas
|
|
120
|
+
This will return a pandas DataFrame containing all your subselected rows from the original dataset.
|
|
122
121
|
|
|
123
122
|
|
|
124
123
|
|
|
@@ -137,12 +136,6 @@ Below is an example of the kind of data Guidepost works with:
|
|
|
137
136
|
|
|
138
137
|
## API Reference
|
|
139
138
|
|
|
140
|
-
### `load_data`
|
|
141
|
-
- **Description**: Loads a pandas dataframe into the guidepost system for visualizaiton. Will report data dropped from the dataframe if it contains NaNs, `timedeltas`, `arrays` in cells, or other invalid values.
|
|
142
|
-
- **Arguments**:
|
|
143
|
-
- `in_df` (Pandas Dataframe): The dataframe containing data to be visualized.
|
|
144
|
-
- `supress_warnings` (Boolean): Specifies whether to suppress warnings when loading data. Defaults to `False`
|
|
145
|
-
|
|
146
139
|
### `vis_data`
|
|
147
140
|
- **Description**: Holds the vis data to passed to the visualization. Updates to this variable will automatically update the visualization.
|
|
148
141
|
|
|
@@ -159,6 +152,12 @@ Vis configurations must be specified as a python dictonary with the following fi
|
|
|
159
152
|
- 'facet_by': A categorical variable from the dataset. Automatically looks for 'queue' or 'partition' if this config is not specified.
|
|
160
153
|
|
|
161
154
|
|
|
155
|
+
### `load_data(in_df, supress_warnings)`
|
|
156
|
+
- **Description**: Loads a pandas dataframe into the guidepost system for visualizaiton. Will report data dropped from the dataframe if it contains NaNs, `timedeltas`, `arrays` in cells, or other invalid values.
|
|
157
|
+
- **Arguments**:
|
|
158
|
+
- `in_df` (Pandas Dataframe): The dataframe containing data to be visualized.
|
|
159
|
+
- `supress_warnings` (Boolean): Specifies whether to suppress warnings when loading data. Defaults to `False`
|
|
160
|
+
|
|
162
161
|
|
|
163
162
|
### `retrieve_selected_data()`
|
|
164
163
|
- **Description**: Returns selected data back from the visualization.
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.2
|
|
2
2
|
Name: guidepost
|
|
3
|
-
Version: 0.2.
|
|
3
|
+
Version: 0.2.12
|
|
4
4
|
Summary: Guidepost. An overview visualization for understanding supercomputer queue data.
|
|
5
5
|
Home-page: https://github.com/cscully-allison/guidepost
|
|
6
6
|
Author: Connor Scully-Allison
|
|
@@ -28,7 +28,7 @@ Dynamic: summary
|
|
|
28
28
|
|
|
29
29
|
# Guidepost
|
|
30
30
|
|
|
31
|
-
Guidepost is a Python library designed
|
|
31
|
+
Guidepost is a Python library designed to visualize High Performance Computing (HPC) job data in jupyter notebooks. It simplifies the process of understanding HPC workloads by providing a single, interactive visualization that offers an intuitive overview of job performance, resource usage, and other critical metrics.
|
|
32
32
|
|
|
33
33
|
---
|
|
34
34
|
|
|
@@ -37,7 +37,6 @@ Guidepost is a Python library designed for seamless integration into Jupyter not
|
|
|
37
37
|
- **Jupyter Notebook Integration**: Designed for your existing workflow. Load and interact with the visualization directly in your Jupyter environment.
|
|
38
38
|
- **HPC Job Data Insights**: Visualize key metrics, including job runtimes, resource usage, and queue performance.
|
|
39
39
|
- **Interactive Exploration**: Export selections of specific jobs or groups of jobs for deeper analysis.
|
|
40
|
-
- **Lightweight and Easy to Use**: Focused on simplicity and efficiency for HPC users.
|
|
41
40
|
|
|
42
41
|
---
|
|
43
42
|
|
|
@@ -69,7 +68,7 @@ jobs_data = pd.read_parquet("data/jobs_data.parquet")
|
|
|
69
68
|
gp.load_data(jobs_data)
|
|
70
69
|
```
|
|
71
70
|
|
|
72
|
-
Guidepost supports input data in a
|
|
71
|
+
Guidepost supports input data in a pandas DataFrame format.
|
|
73
72
|
|
|
74
73
|
At least three numerical and 2 categorical columns are required. Datetime columns are also supported for encoding on the x axis.
|
|
75
74
|
|
|
@@ -80,7 +79,7 @@ Here is a sample table containg jobs-related data from a supercomputer schedulin
|
|
|
80
79
|
| 12345 | 2023-11-01 21:19:33 |5.2 | 10 | short | Complete | User1 |
|
|
81
80
|
| 12346 | 2023-11-01 21:20:01 |12.0 | 20 | long | Running | User2 |
|
|
82
81
|
|
|
83
|
-
In this example, the three data values we will use for our x, y and color variables are: start_time, queue_wait and nodes_requested. We would also like to use `parition` to facet
|
|
82
|
+
In this example, the three data values we will use for our x, y and color variables are: start_time, queue_wait and nodes_requested. We would also like to use `parition` to facet our data and `user` as an additional categorical variable to filter on. In the [next section](#3-configure-visualization), we show how to specify which columns in your dataset correspond to parts of the visualizaiton.
|
|
84
83
|
|
|
85
84
|
The `load_data()` function will format your data for json serialization and will update the visualization if it has already been run. This function will report out any columns or rows which are dropped from the original dataset due to conainting `null`/`NaN`/`None` values or unallowed datatypes like `timedelta`s.
|
|
86
85
|
|
|
@@ -102,7 +101,7 @@ gp.vis_configs = {
|
|
|
102
101
|
- `color`: Name of the column in the dataframe which will be shown by the darkness of each square's color.
|
|
103
102
|
- `color_agg`: The aggregation method used to determine the color. Can be: 'avg', 'variance', 'std', 'sum', or 'median'
|
|
104
103
|
- `categorical`: Name of the column containing categorical data values which will be shown on a bar chart associated with each group of the data.
|
|
105
|
-
- `facet_by`: Name of the column containing categorical data values which dictate the highest level grouping of the data
|
|
104
|
+
- `facet_by`: Name of the column containing categorical data values which dictate the highest level grouping of the data and organizes the data into groups of subcharts.
|
|
106
105
|
|
|
107
106
|
See the [Vis Configs Section](#vis_configs) for more details on datatype restrictions for each configuration.
|
|
108
107
|
|
|
@@ -121,7 +120,7 @@ Here is an example of what the viusalization will look like:
|
|
|
121
120
|
Here we explain some elements of the visualization:
|
|
122
121
|
|
|
123
122
|
#### `Data Grouping Name`:
|
|
124
|
-
This is name of the high level groups which are dictated by the `facet_by` configuration. If your data only logically contains one group, adding a synthetic column
|
|
123
|
+
This is name of the high level groups which are dictated by the `facet_by` configuration. Each group of subcharts corresponds to all data associated with an instance of a value in `facet_by`. If your data only logically contains one group, adding a synthetic column and specifying that column name for the `facet_by` cofiguration is advised.
|
|
125
124
|
|
|
126
125
|
#### `Main Summary View`:
|
|
127
126
|
The main summary view is the primary view associated with each group of data specified by `facet_by` configuration. This view shows the data organized by the x and y axes. Data values at similar locations along the x and y axes are grouped into squares at that location. The amount of data in each row and column are shown with the histograms framing this view. The color of each square shows an aggregrate of a third numerical variable that exists on each data value.
|
|
@@ -135,7 +134,7 @@ The bar chart in the lower right hand corner of each row of subcharts shows the
|
|
|
135
134
|
|
|
136
135
|
|
|
137
136
|
#### `Current Seleciton of Records for Export`:
|
|
138
|
-
Records can be selected for export from the visualization by brushing over the right and bottom histograms. The area of selected data is indicated by the orange coloring on the main summary view. The
|
|
137
|
+
Records can be selected for export from the visualization by brushing over the right and bottom histograms. The area of selected data is indicated by the orange coloring on the main summary view. The amount of records selected is indicated at the top left for each chart. Selections can be made across multiple charts. The final selection is returned as one dataframe containg all selections.
|
|
139
138
|
|
|
140
139
|
|
|
141
140
|
### 5. Retrieve Selections from Visualization
|
|
@@ -146,7 +145,7 @@ df = gp.retrieve_selected_data()
|
|
|
146
145
|
|
|
147
146
|
After selecting data by brushing over either the bottom or right histograms associated with a subchart, you can retrieve selected data using the above method.
|
|
148
147
|
|
|
149
|
-
This will return a pandas
|
|
148
|
+
This will return a pandas DataFrame containing all your subselected rows from the original dataset.
|
|
150
149
|
|
|
151
150
|
|
|
152
151
|
|
|
@@ -165,12 +164,6 @@ Below is an example of the kind of data Guidepost works with:
|
|
|
165
164
|
|
|
166
165
|
## API Reference
|
|
167
166
|
|
|
168
|
-
### `load_data`
|
|
169
|
-
- **Description**: Loads a pandas dataframe into the guidepost system for visualizaiton. Will report data dropped from the dataframe if it contains NaNs, `timedeltas`, `arrays` in cells, or other invalid values.
|
|
170
|
-
- **Arguments**:
|
|
171
|
-
- `in_df` (Pandas Dataframe): The dataframe containing data to be visualized.
|
|
172
|
-
- `supress_warnings` (Boolean): Specifies whether to suppress warnings when loading data. Defaults to `False`
|
|
173
|
-
|
|
174
167
|
### `vis_data`
|
|
175
168
|
- **Description**: Holds the vis data to passed to the visualization. Updates to this variable will automatically update the visualization.
|
|
176
169
|
|
|
@@ -187,6 +180,12 @@ Vis configurations must be specified as a python dictonary with the following fi
|
|
|
187
180
|
- 'facet_by': A categorical variable from the dataset. Automatically looks for 'queue' or 'partition' if this config is not specified.
|
|
188
181
|
|
|
189
182
|
|
|
183
|
+
### `load_data(in_df, supress_warnings)`
|
|
184
|
+
- **Description**: Loads a pandas dataframe into the guidepost system for visualizaiton. Will report data dropped from the dataframe if it contains NaNs, `timedeltas`, `arrays` in cells, or other invalid values.
|
|
185
|
+
- **Arguments**:
|
|
186
|
+
- `in_df` (Pandas Dataframe): The dataframe containing data to be visualized.
|
|
187
|
+
- `supress_warnings` (Boolean): Specifies whether to suppress warnings when loading data. Defaults to `False`
|
|
188
|
+
|
|
190
189
|
|
|
191
190
|
### `retrieve_selected_data()`
|
|
192
191
|
- **Description**: Returns selected data back from the visualization.
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|