gdeltnews 1.0.0__tar.gz → 1.0.1__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/PKG-INFO +15 -10
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/README.md +14 -9
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/pyproject.toml +1 -1
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews.egg-info/PKG-INFO +15 -10
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/LICENSE +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/setup.cfg +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews/__init__.py +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews/download.py +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews/filtermerge.py +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews/reconstruct.py +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews/wordmatch.py +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews.egg-info/SOURCES.txt +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews.egg-info/dependency_links.txt +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews.egg-info/requires.txt +0 -0
- {gdeltnews-1.0.0 → gdeltnews-1.0.1}/src/gdeltnews.egg-info/top_level.txt +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: gdeltnews
|
|
3
|
-
Version: 1.0.
|
|
3
|
+
Version: 1.0.1
|
|
4
4
|
Summary: Reconstruct full-text news articles from GDELT Web News NGrams 3.0
|
|
5
5
|
Author: Andrea Fronzetti Colladon, Roberto Vestrelli
|
|
6
6
|
License: GNU GENERAL PUBLIC LICENSE
|
|
@@ -708,15 +708,21 @@ http://data.gdeltproject.org/gdeltv3/webngrams/20250316000100.webngrams.json.gz
|
|
|
708
708
|
|
|
709
709
|
Reconstruction quality depends on the n-gram fragments available in the dataset.
|
|
710
710
|
|
|
711
|
-
##
|
|
711
|
+
## Docs
|
|
712
|
+
|
|
713
|
+
This package documentation is available [here](https://iandreafc.github.io/gdeltnews/), and a more detailed explanation of the functions’ logic is provided in the accompanying [paper](https://doi.org/10.3390/bdcc10020045).
|
|
714
|
+
|
|
715
|
+
## GUI Version
|
|
716
|
+
If you prefer to use a **software with a graphical user interface** that runs this code, you can find it [here](https://github.com/iandreafc/gdeltnews/tree/main/GUI) and read the [instructions here](https://iandreafc.github.io/gdeltnews/gui).
|
|
717
|
+
|
|
718
|
+
## Python Package Quickstart
|
|
719
|
+
|
|
720
|
+
### Install
|
|
712
721
|
|
|
713
722
|
```bash
|
|
714
723
|
pip install gdeltnews
|
|
715
724
|
```
|
|
716
725
|
|
|
717
|
-
## Quickstart and Docs
|
|
718
|
-
The package is documented [here](https://iandreafc.github.io/gdeltnews/).
|
|
719
|
-
|
|
720
726
|
### Step 1: Download Web NGrams files
|
|
721
727
|
|
|
722
728
|
```bash
|
|
@@ -734,6 +740,7 @@ download(
|
|
|
734
740
|
Multiprocessing can be problematic inside notebooks. Run this from a `.py` script.
|
|
735
741
|
|
|
736
742
|
```bash
|
|
743
|
+
from multiprocessing import freeze_support
|
|
737
744
|
from gdeltnews.reconstruct import reconstruct
|
|
738
745
|
|
|
739
746
|
def main():
|
|
@@ -746,6 +753,7 @@ def main():
|
|
|
746
753
|
)
|
|
747
754
|
|
|
748
755
|
if __name__ == "__main__":
|
|
756
|
+
freeze_support() # important on Windows
|
|
749
757
|
main()
|
|
750
758
|
```
|
|
751
759
|
|
|
@@ -763,12 +771,9 @@ filtermerge(
|
|
|
763
771
|
|
|
764
772
|
Advanced users can pre-filter and download GDELT data via Google BigQuery, then process it directly with `wordmatch.py`.
|
|
765
773
|
|
|
766
|
-
## Citation
|
|
774
|
+
## Citation and Credits
|
|
767
775
|
|
|
768
776
|
If you use this package for research, please cite:
|
|
769
|
-
|
|
770
|
-
A. Fronzetti Colladon, R. Vestrelli (2025). “A Python Tool for Reconstructing Full News Text from GDELT.” [https://arxiv.org/abs/2504.16063](https://arxiv.org/abs/2504.16063)
|
|
771
|
-
|
|
772
|
-
## Credits
|
|
777
|
+
Fronzetti Colladon, A., & Vestrelli, R. (2026). Free Access to World News: Reconstructing Full-Text Articles from GDELT. Big Data and Cognitive Computing, 10(2), 45. [https://doi.org/10.3390/bdcc10020045](https://doi.org/10.3390/bdcc10020045)
|
|
773
778
|
|
|
774
779
|
Code co-developed with [robves99](https://github.com/robves99).
|
|
@@ -15,15 +15,21 @@ http://data.gdeltproject.org/gdeltv3/webngrams/20250316000100.webngrams.json.gz
|
|
|
15
15
|
|
|
16
16
|
Reconstruction quality depends on the n-gram fragments available in the dataset.
|
|
17
17
|
|
|
18
|
-
##
|
|
18
|
+
## Docs
|
|
19
|
+
|
|
20
|
+
This package documentation is available [here](https://iandreafc.github.io/gdeltnews/), and a more detailed explanation of the functions’ logic is provided in the accompanying [paper](https://doi.org/10.3390/bdcc10020045).
|
|
21
|
+
|
|
22
|
+
## GUI Version
|
|
23
|
+
If you prefer to use a **software with a graphical user interface** that runs this code, you can find it [here](https://github.com/iandreafc/gdeltnews/tree/main/GUI) and read the [instructions here](https://iandreafc.github.io/gdeltnews/gui).
|
|
24
|
+
|
|
25
|
+
## Python Package Quickstart
|
|
26
|
+
|
|
27
|
+
### Install
|
|
19
28
|
|
|
20
29
|
```bash
|
|
21
30
|
pip install gdeltnews
|
|
22
31
|
```
|
|
23
32
|
|
|
24
|
-
## Quickstart and Docs
|
|
25
|
-
The package is documented [here](https://iandreafc.github.io/gdeltnews/).
|
|
26
|
-
|
|
27
33
|
### Step 1: Download Web NGrams files
|
|
28
34
|
|
|
29
35
|
```bash
|
|
@@ -41,6 +47,7 @@ download(
|
|
|
41
47
|
Multiprocessing can be problematic inside notebooks. Run this from a `.py` script.
|
|
42
48
|
|
|
43
49
|
```bash
|
|
50
|
+
from multiprocessing import freeze_support
|
|
44
51
|
from gdeltnews.reconstruct import reconstruct
|
|
45
52
|
|
|
46
53
|
def main():
|
|
@@ -53,6 +60,7 @@ def main():
|
|
|
53
60
|
)
|
|
54
61
|
|
|
55
62
|
if __name__ == "__main__":
|
|
63
|
+
freeze_support() # important on Windows
|
|
56
64
|
main()
|
|
57
65
|
```
|
|
58
66
|
|
|
@@ -70,12 +78,9 @@ filtermerge(
|
|
|
70
78
|
|
|
71
79
|
Advanced users can pre-filter and download GDELT data via Google BigQuery, then process it directly with `wordmatch.py`.
|
|
72
80
|
|
|
73
|
-
## Citation
|
|
81
|
+
## Citation and Credits
|
|
74
82
|
|
|
75
83
|
If you use this package for research, please cite:
|
|
76
|
-
|
|
77
|
-
A. Fronzetti Colladon, R. Vestrelli (2025). “A Python Tool for Reconstructing Full News Text from GDELT.” [https://arxiv.org/abs/2504.16063](https://arxiv.org/abs/2504.16063)
|
|
78
|
-
|
|
79
|
-
## Credits
|
|
84
|
+
Fronzetti Colladon, A., & Vestrelli, R. (2026). Free Access to World News: Reconstructing Full-Text Articles from GDELT. Big Data and Cognitive Computing, 10(2), 45. [https://doi.org/10.3390/bdcc10020045](https://doi.org/10.3390/bdcc10020045)
|
|
80
85
|
|
|
81
86
|
Code co-developed with [robves99](https://github.com/robves99).
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: gdeltnews
|
|
3
|
-
Version: 1.0.
|
|
3
|
+
Version: 1.0.1
|
|
4
4
|
Summary: Reconstruct full-text news articles from GDELT Web News NGrams 3.0
|
|
5
5
|
Author: Andrea Fronzetti Colladon, Roberto Vestrelli
|
|
6
6
|
License: GNU GENERAL PUBLIC LICENSE
|
|
@@ -708,15 +708,21 @@ http://data.gdeltproject.org/gdeltv3/webngrams/20250316000100.webngrams.json.gz
|
|
|
708
708
|
|
|
709
709
|
Reconstruction quality depends on the n-gram fragments available in the dataset.
|
|
710
710
|
|
|
711
|
-
##
|
|
711
|
+
## Docs
|
|
712
|
+
|
|
713
|
+
This package documentation is available [here](https://iandreafc.github.io/gdeltnews/), and a more detailed explanation of the functions’ logic is provided in the accompanying [paper](https://doi.org/10.3390/bdcc10020045).
|
|
714
|
+
|
|
715
|
+
## GUI Version
|
|
716
|
+
If you prefer to use a **software with a graphical user interface** that runs this code, you can find it [here](https://github.com/iandreafc/gdeltnews/tree/main/GUI) and read the [instructions here](https://iandreafc.github.io/gdeltnews/gui).
|
|
717
|
+
|
|
718
|
+
## Python Package Quickstart
|
|
719
|
+
|
|
720
|
+
### Install
|
|
712
721
|
|
|
713
722
|
```bash
|
|
714
723
|
pip install gdeltnews
|
|
715
724
|
```
|
|
716
725
|
|
|
717
|
-
## Quickstart and Docs
|
|
718
|
-
The package is documented [here](https://iandreafc.github.io/gdeltnews/).
|
|
719
|
-
|
|
720
726
|
### Step 1: Download Web NGrams files
|
|
721
727
|
|
|
722
728
|
```bash
|
|
@@ -734,6 +740,7 @@ download(
|
|
|
734
740
|
Multiprocessing can be problematic inside notebooks. Run this from a `.py` script.
|
|
735
741
|
|
|
736
742
|
```bash
|
|
743
|
+
from multiprocessing import freeze_support
|
|
737
744
|
from gdeltnews.reconstruct import reconstruct
|
|
738
745
|
|
|
739
746
|
def main():
|
|
@@ -746,6 +753,7 @@ def main():
|
|
|
746
753
|
)
|
|
747
754
|
|
|
748
755
|
if __name__ == "__main__":
|
|
756
|
+
freeze_support() # important on Windows
|
|
749
757
|
main()
|
|
750
758
|
```
|
|
751
759
|
|
|
@@ -763,12 +771,9 @@ filtermerge(
|
|
|
763
771
|
|
|
764
772
|
Advanced users can pre-filter and download GDELT data via Google BigQuery, then process it directly with `wordmatch.py`.
|
|
765
773
|
|
|
766
|
-
## Citation
|
|
774
|
+
## Citation and Credits
|
|
767
775
|
|
|
768
776
|
If you use this package for research, please cite:
|
|
769
|
-
|
|
770
|
-
A. Fronzetti Colladon, R. Vestrelli (2025). “A Python Tool for Reconstructing Full News Text from GDELT.” [https://arxiv.org/abs/2504.16063](https://arxiv.org/abs/2504.16063)
|
|
771
|
-
|
|
772
|
-
## Credits
|
|
777
|
+
Fronzetti Colladon, A., & Vestrelli, R. (2026). Free Access to World News: Reconstructing Full-Text Articles from GDELT. Big Data and Cognitive Computing, 10(2), 45. [https://doi.org/10.3390/bdcc10020045](https://doi.org/10.3390/bdcc10020045)
|
|
773
778
|
|
|
774
779
|
Code co-developed with [robves99](https://github.com/robves99).
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|