pandas-survey-toolkit 1.0.4__py3-none-any.whl → 1.0.10__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,73 +1,77 @@
1
- Metadata-Version: 2.4
2
- Name: pandas-survey-toolkit
3
- Version: 1.0.4
4
- Summary: A pandas extension for survey analysis
5
- Home-page: https://github.com/Andy7475/pandas-survey-toolkit
6
- Author: Andy Laing
7
- Author-email: andylaing5@gmail.com
8
- License: MIT
9
- Project-URL: Bug Tracker, https://github.com/Andy7475/pandas-survey-toolkit/issues
10
- Classifier: Programming Language :: Python :: 3
11
- Classifier: License :: OSI Approved :: MIT License
12
- Classifier: Operating System :: OS Independent
13
- Requires-Python: >=3.8
14
- Description-Content-Type: text/markdown
15
- License-File: LICENSE
16
- Requires-Dist: sentence-transformers>=3.0
17
- Requires-Dist: umap-learn>=0.5
18
- Requires-Dist: scikit-learn>=1.3
19
- Requires-Dist: pandas>=2.0.3
20
- Requires-Dist: numpy==1.24.3
21
- Requires-Dist: pandas-flavor>=0.6.0
22
- Requires-Dist: spacy>=3.0.0
23
- Requires-Dist: gensim>=4.0.0
24
- Requires-Dist: altair>=5.0.0
25
- Dynamic: author
26
- Dynamic: author-email
27
- Dynamic: classifier
28
- Dynamic: description
29
- Dynamic: description-content-type
30
- Dynamic: home-page
31
- Dynamic: license
32
- Dynamic: license-file
33
- Dynamic: project-url
34
- Dynamic: requires-dist
35
- Dynamic: requires-python
36
- Dynamic: summary
37
-
38
- # Faster and more Insightful analysis of survey results
39
-
40
- This package lets you apply advanced Natural Language Processing (NLP) and Machine Learning functions on survey results directly within a dataframe.
41
-
42
- It fills a gap where many NLP packages (like spacy, genism, sentence_transformers) are not designed for data in a spreadsheet (and therefore imported into a dataframe), and many of the people who are tasked with analysing survey results are often not data scientists.
43
-
44
- For example, to extract the sentiment you can just type:
45
-
46
- df.extract_sentiment(input_column="survey-comments")
47
-
48
- It will abstract away a lot of the data transformation pipeline to give you useful functionality with minimal code.
49
-
50
- # Examples
51
- See Read-the-docs for simple example notebooks. There are more detailed notebooks in the repo under notebooks/
52
-
53
- # Functionality
54
-
55
- ## Clustering comments
56
- It will group similar free-text comments together and assign a cluster ID. This is a useful step prior to any qualitative analysis.
57
-
58
- ## Sentiment Analysis
59
- It will measure the sentiment in terms or postive / neutral / negative and assign a score for each of those parts, picking the highest scoring as the most likely overall sentiment.
60
-
61
- ## Topic analysis
62
- Involves TFIDF and word co-occurence to gain some high level insights into the likely topics
63
-
64
- ## Clustering likert questions (or other responses)
65
- For strongly disagree ... neutral ... strong agree type responses, it will groups all those questions together to identity groups of respondents within your survey data. This can be much more useful than overall averages across the survey.
66
-
67
- ## Visualisation
68
- Functions to help make sense of the clusters and topics you have identified using the above functions (in development)
69
-
70
- ## Setup
71
- If sentence transformers throws dll errors: https://stackoverflow.com/questions/78484297/c-torch-lib-fbgemm-dll-or-one-of-its-dependencies/78794748#78794748
72
-
73
-
1
+ Metadata-Version: 2.4
2
+ Name: pandas-survey-toolkit
3
+ Version: 1.0.10
4
+ Summary: A pandas extension for survey analysis
5
+ Home-page: https://github.com/Andy7475/pandas-survey-toolkit
6
+ Author: Andy Laing
7
+ Author-email: andylaing5@gmail.com
8
+ License: MIT
9
+ Project-URL: Bug Tracker, https://github.com/Andy7475/pandas-survey-toolkit/issues
10
+ Classifier: Programming Language :: Python :: 3
11
+ Classifier: License :: OSI Approved :: MIT License
12
+ Classifier: Operating System :: OS Independent
13
+ Requires-Python: >=3.11
14
+ Description-Content-Type: text/markdown
15
+ License-File: LICENSE
16
+ Requires-Dist: torch<3.0,>=2.1.0
17
+ Requires-Dist: sentence-transformers<4.0,>=3.0
18
+ Requires-Dist: umap-learn<1.0,>=0.5
19
+ Requires-Dist: scikit-learn<2.0,>=1.3
20
+ Requires-Dist: pandas<3.0,>=2.2.0
21
+ Requires-Dist: numpy<2.0,>=1.26.0
22
+ Requires-Dist: pandas-flavor<1.0,>=0.6.0
23
+ Requires-Dist: spacy<4.0,>=3.0.0
24
+ Requires-Dist: gensim<5.0,>=4.0.0
25
+ Requires-Dist: altair<5.0,>=4.0.0
26
+ Requires-Dist: matplotlib<4.0,>=3.0.0
27
+ Requires-Dist: pyvis<1.0,>=0.3.2
28
+ Dynamic: author
29
+ Dynamic: author-email
30
+ Dynamic: classifier
31
+ Dynamic: description
32
+ Dynamic: description-content-type
33
+ Dynamic: home-page
34
+ Dynamic: license
35
+ Dynamic: license-file
36
+ Dynamic: project-url
37
+ Dynamic: requires-dist
38
+ Dynamic: requires-python
39
+ Dynamic: summary
40
+
41
+ # Faster and more Insightful analysis of survey results
42
+
43
+ This package lets you apply advanced Natural Language Processing (NLP) and Machine Learning functions on survey results directly within a dataframe.
44
+
45
+ It fills a gap where many NLP packages (like spacy, genism, sentence_transformers) are not designed for data in a spreadsheet (and therefore imported into a dataframe), and many of the people who are tasked with analysing survey results are often not data scientists.
46
+
47
+ For example, to extract the sentiment you can just type:
48
+
49
+ df.extract_sentiment(input_column="survey-comments")
50
+
51
+ It will abstract away a lot of the data transformation pipeline to give you useful functionality with minimal code.
52
+
53
+ # Examples
54
+
55
+ See [ReadTheDocs](https://pandas-survey-toolkit.readthedocs.io/en/latest/) for simple example notebooks. There are more detailed notebooks in the repo under notebooks/
56
+
57
+ # Functionality
58
+
59
+ ## Clustering comments
60
+ It will group similar free-text comments together and assign a cluster ID. This is a useful step prior to any qualitative analysis.
61
+
62
+ ## Sentiment Analysis
63
+ It will measure the sentiment in terms or postive / neutral / negative and assign a score for each of those parts, picking the highest scoring as the most likely overall sentiment.
64
+
65
+ ## Topic analysis
66
+ Involves TFIDF and word co-occurence to gain some high level insights into the likely topics
67
+
68
+ ## Clustering likert questions (or other responses)
69
+ For strongly disagree ... neutral ... strong agree type responses, it will groups all those questions together to identity groups of respondents within your survey data. This can be much more useful than overall averages across the survey.
70
+
71
+ ## Visualisation
72
+ Functions to help make sense of the clusters and topics you have identified using the above functions (in development)
73
+
74
+ ## Setup
75
+ If sentence transformers throws dll errors: https://stackoverflow.com/questions/78484297/c-torch-lib-fbgemm-dll-or-one-of-its-dependencies/78794748#78794748
76
+
77
+
@@ -0,0 +1,10 @@
1
+ pandas_survey_toolkit/__init__.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
2
+ pandas_survey_toolkit/analytics.py,sha256=N2YWmvKYHN3x9qEw48aePLnY_sfKqqrT_w7eHMPWwps,5995
3
+ pandas_survey_toolkit/nlp.py,sha256=SLbEsw1d8aJt2HbnjHU4d87Dj62NAq2ZG7Ued9DxCgk,34279
4
+ pandas_survey_toolkit/utils.py,sha256=l-qIf_WkzavDeO9TTMy2bUs271udVoI_5oGeVCZ-te0,3978
5
+ pandas_survey_toolkit/vis.py,sha256=6XQfWwGodIhWPqLScm7h7xe8ux897sMdJ33-TSY7y2E,7349
6
+ pandas_survey_toolkit-1.0.10.dist-info/licenses/LICENSE,sha256=N1vp-xmMExKVeJ3ppjmIBJXFkBN8YXoJs-n7mJJtCyk,1065
7
+ pandas_survey_toolkit-1.0.10.dist-info/METADATA,sha256=aBfiitYfI0F21V4EKGQsDvRuwmWkEJGFvHWTL67Wvh0,3224
8
+ pandas_survey_toolkit-1.0.10.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
9
+ pandas_survey_toolkit-1.0.10.dist-info/top_level.txt,sha256=LBMaptAHxmWRzCE08eJUvsL21kLLHQ4lJleja3_zpYM,22
10
+ pandas_survey_toolkit-1.0.10.dist-info/RECORD,,
@@ -1,5 +1,5 @@
1
1
  Wheel-Version: 1.0
2
- Generator: setuptools (78.1.0)
2
+ Generator: setuptools (80.9.0)
3
3
  Root-Is-Purelib: true
4
4
  Tag: py3-none-any
5
5
 
@@ -1,21 +1,21 @@
1
- MIT License
2
-
3
- Copyright (c) 2024 Andy7475
4
-
5
- Permission is hereby granted, free of charge, to any person obtaining a copy
6
- of this software and associated documentation files (the "Software"), to deal
7
- in the Software without restriction, including without limitation the rights
8
- to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
- copies of the Software, and to permit persons to whom the Software is
10
- furnished to do so, subject to the following conditions:
11
-
12
- The above copyright notice and this permission notice shall be included in all
13
- copies or substantial portions of the Software.
14
-
15
- THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
- IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
- FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
- AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
- LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
- OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
- SOFTWARE.
1
+ MIT License
2
+
3
+ Copyright (c) 2024 Andy7475
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
@@ -1,10 +0,0 @@
1
- pandas_survey_toolkit/__init__.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
2
- pandas_survey_toolkit/analytics.py,sha256=UP-7SU-NJqw7Njq7jA3FaS0XaodmFf-Mnn2uMv1sy1w,5003
3
- pandas_survey_toolkit/nlp.py,sha256=myGZRnhA9RbCqUVtRBHDH4Ra0XTUeLQ1HOQoV_LRAcY,31765
4
- pandas_survey_toolkit/utils.py,sha256=WOXKN9GzcTl-ZUtXBJYxOHNQeul852ZU6F6L7jrlTjc,3554
5
- pandas_survey_toolkit/vis.py,sha256=3FimMnb3XlYz4PW-11Nbub5uMeEkzVHfjBNxQakJRsA,25801
6
- pandas_survey_toolkit-1.0.4.dist-info/licenses/LICENSE,sha256=g9daX7xNrlzl5Qj2TdJjJ1MJKKDvGsMmmudshsykG7A,1086
7
- pandas_survey_toolkit-1.0.4.dist-info/METADATA,sha256=dThNtqyxh0j6z-z2370m9FbxvXw2_d--1E49Mk2-Nhg,3088
8
- pandas_survey_toolkit-1.0.4.dist-info/WHEEL,sha256=CmyFI0kx5cdEMTLiONQRbGQwjIoR1aIYB7eCAQ4KPJ0,91
9
- pandas_survey_toolkit-1.0.4.dist-info/top_level.txt,sha256=LBMaptAHxmWRzCE08eJUvsL21kLLHQ4lJleja3_zpYM,22
10
- pandas_survey_toolkit-1.0.4.dist-info/RECORD,,