pystylometry 1.1.0__py3-none-any.whl → 1.3.0__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- pystylometry/README.md +42 -0
- pystylometry/__init__.py +17 -1
- pystylometry/_types.py +54 -0
- pystylometry/authorship/README.md +21 -0
- pystylometry/authorship/__init__.py +9 -6
- pystylometry/authorship/additional_methods.py +262 -17
- pystylometry/authorship/compression.py +175 -0
- pystylometry/authorship/kilgarriff.py +8 -1
- pystylometry/character/README.md +17 -0
- pystylometry/consistency/README.md +27 -0
- pystylometry/dialect/README.md +26 -0
- pystylometry/lexical/README.md +23 -0
- pystylometry/ngrams/README.md +18 -0
- pystylometry/ngrams/extended_ngrams.py +314 -69
- pystylometry/prosody/README.md +17 -0
- pystylometry/prosody/rhythm_prosody.py +773 -11
- pystylometry/readability/README.md +23 -0
- pystylometry/stylistic/README.md +20 -0
- pystylometry/stylistic/cohesion_coherence.py +669 -13
- pystylometry/stylistic/genre_register.py +1560 -17
- pystylometry/stylistic/markers.py +611 -17
- pystylometry/stylistic/vocabulary_overlap.py +354 -13
- pystylometry/syntactic/README.md +20 -0
- pystylometry/viz/README.md +27 -0
- pystylometry-1.3.0.dist-info/METADATA +136 -0
- {pystylometry-1.1.0.dist-info → pystylometry-1.3.0.dist-info}/RECORD +28 -15
- pystylometry-1.1.0.dist-info/METADATA +0 -278
- {pystylometry-1.1.0.dist-info → pystylometry-1.3.0.dist-info}/WHEEL +0 -0
- {pystylometry-1.1.0.dist-info → pystylometry-1.3.0.dist-info}/entry_points.txt +0 -0
|
@@ -0,0 +1,23 @@
|
|
|
1
|
+
# readability
|
|
2
|
+
|
|
3
|
+

|
|
4
|
+

|
|
5
|
+
|
|
6
|
+
Text readability scoring using established formulas from educational and linguistic research.
|
|
7
|
+
|
|
8
|
+
## Catalogue
|
|
9
|
+
|
|
10
|
+
| File | Functions | Formula |
|
|
11
|
+
|------|-----------|---------|
|
|
12
|
+
| `flesch.py` | `compute_flesch` | Flesch Reading Ease & Flesch-Kincaid Grade Level |
|
|
13
|
+
| `gunning_fog.py` | `compute_gunning_fog` | Gunning Fog Index (complex word ratio) |
|
|
14
|
+
| `coleman_liau.py` | `compute_coleman_liau` | Coleman-Liau Index (character-based) |
|
|
15
|
+
| `ari.py` | `compute_ari` | Automated Readability Index |
|
|
16
|
+
| `smog.py` | `compute_smog` | SMOG Grade (polysyllabic word count) |
|
|
17
|
+
| `additional_formulas.py` | `compute_dale_chall`, `compute_fry`, `compute_forcast`, `compute_linsear_write`, `compute_powers_sumner_kearl` | Dale-Chall, Fry Graph, FORCAST, Linsear Write, Powers-Sumner-Kearl |
|
|
18
|
+
| `syllables.py` | _(internal)_ | Syllable counting engine |
|
|
19
|
+
| `complex_words.py` | _(internal)_ | Complex word detection heuristics |
|
|
20
|
+
|
|
21
|
+
## See Also
|
|
22
|
+
|
|
23
|
+
- [`_normalize.py`](../_normalize.py) for text normalization applied before readability scoring
|
|
@@ -0,0 +1,20 @@
|
|
|
1
|
+
# stylistic
|
|
2
|
+
|
|
3
|
+

|
|
4
|
+

|
|
5
|
+
|
|
6
|
+
Style markers, vocabulary overlap, cohesion/coherence, and genre/register classification.
|
|
7
|
+
|
|
8
|
+
## Catalogue
|
|
9
|
+
|
|
10
|
+
| File | Function | What It Measures |
|
|
11
|
+
|------|----------|-----------------|
|
|
12
|
+
| `markers.py` | `compute_stylistic_markers` | Contractions, intensifiers, hedges, modals, negation, punctuation style |
|
|
13
|
+
| `vocabulary_overlap.py` | `compute_vocabulary_overlap` | Jaccard, Dice, Cosine similarity, KL divergence, overlap coefficient |
|
|
14
|
+
| `cohesion_coherence.py` | `compute_cohesion_coherence` | Referential cohesion, connectives, coherence measures |
|
|
15
|
+
| `genre_register.py` | `compute_genre_register` | Formality scoring, register classification, genre prediction |
|
|
16
|
+
|
|
17
|
+
## See Also
|
|
18
|
+
|
|
19
|
+
- [`lexical/function_words.py`](../lexical/) for function word distributions (complements marker analysis)
|
|
20
|
+
- [`dialect/`](../dialect/) for regional variant detection (British vs. American)
|