equity-aggregator 0.1.4__py3-none-any.whl → 0.1.6__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -2,7 +2,7 @@
2
2
 
3
3
  ## Overview
4
4
 
5
- The equity aggregator is a sophisticated financial data processing system that aggregates equity information from multiple discovery sources (LSEG, SEC, Stock Analysis, TradingView, XETRA) and enriches it with supplementary data from Yahoo Finance and Intrinio.
5
+ The equity aggregator is a sophisticated financial data processing system that aggregates equity information from multiple discovery sources (LSEG, SEC, Stock Analysis, TradingView, XETRA and Intrinio) and enriches it with supplementary data from Yahoo Finance and the Global LEI Foundation.
6
6
 
7
7
  ## Architecture & Design
8
8
 
@@ -15,6 +15,7 @@ src/equity_aggregator/
15
15
  ├── cli/ # Presentation Layer - User Interface
16
16
  ├── domain/ # Business Logic Layer - Core Domain
17
17
  │ ├── pipeline/ # Aggregation pipeline orchestration
18
+ │ ├── retrieval/ # Canonical equity download and retrieval
18
19
  │ └── _utils/ # Domain-specific utilities
19
20
  ├── adapters/ # Infrastructure Layer - External Integrations
20
21
  │ └── data_sources/ # Data source adapters
@@ -39,7 +40,7 @@ Raw Data Sources → Parse → Convert → Identify → Group → Enrich → Can
39
40
 
40
41
  Orchestrates parallel data fetching from discovery feeds:
41
42
 
42
- - Fetches data from LSEG, SEC, Stock Analysis, TradingView, and XETRA concurrently
43
+ - Fetches data from LSEG, SEC, Stock Analysis, TradingView, XETRA and Intrinio concurrently
43
44
  - Combines all feed data into a single stream for processing
44
45
  - Maintains feed source metadata for downstream processing
45
46
 
@@ -64,14 +65,13 @@ Standardises financial data to USD reference currency:
64
65
  Enriches records with global identification metadata:
65
66
 
66
67
  - Queries OpenFIGI API for FIGI identifiers
67
- - Adds CUSIP, ISIN, and other standard identifiers
68
68
  - Creates globally unique equity identities
69
69
 
70
70
  #### 5. **Group**
71
71
 
72
72
  Groups equities by Share Class FIGI:
73
73
 
74
- - Groups records with identical share_class_figi values
74
+ - Groups records with identical Share Class FIGI values
75
75
  - Preserves all discovery feed source data for later merging
76
76
  - Each group represents the same equity from multiple discovery sources
77
77
  - Yields groups as `list[RawEquity]` for enrichment processing
@@ -80,11 +80,10 @@ Groups equities by Share Class FIGI:
80
80
 
81
81
  Fetches enrichment data and performs comprehensive single merge:
82
82
 
83
- - Extracts representative identifiers from discovery sources using merge algorithms
84
- - Queries enrichment feeds (Yahoo Finance, Intrinio) with these identifiers
83
+ - Fetches representative identifiers from discovery data sources
84
+ - Queries enrichment feeds (Yahoo Finance, Global LEI Foundation) using these identifiers
85
85
  - Performs single merge of all sources (discovery + enrichment) for optimal data quality
86
- - Uses same merge logic for identifiers and final merge (mode for IDs, fuzzy clustering for names, frequency for symbols)
87
- - Applies controlled concurrency to respect API limits
86
+ - Applies controlled concurrency to enrichment feeds to respect API limits
88
87
 
89
88
  #### 7. **Canonicalise**
90
89
 
@@ -102,7 +101,7 @@ The pipeline uses asynchronous operations to process thousands of equity records
102
101
 
103
102
  **Parallel Data Fetching**
104
103
 
105
- - All discovery feeds (LSEG, SEC, Stock Analysis, TradingView, XETRA) are fetched simultaneously
104
+ - All discovery feeds (LSEG, SEC, Stock Analysis, TradingView, XETRA, Intrinio) are fetched simultaneously
106
105
 
107
106
  **Streaming Pipeline**
108
107
 
@@ -110,7 +109,9 @@ The pipeline uses asynchronous operations to process thousands of equity records
110
109
 
111
110
  **Controlled Concurrency**
112
111
 
113
- - External API calls (OpenFIGI, Yahoo Finance, Intrinio) use semaphores to limit concurrent requests and respect rate limits
112
+ - External API calls (OpenFIGI, Yahoo Finance, GLEIF) use semaphores to limit concurrent requests and respect rate limits
113
+ - Each enrichment feed has a configurable concurrency limit
114
+ - Fetch operations include timeout protection to prevent indefinite blocking
114
115
 
115
116
  **Non-blocking Operations**
116
117
 
@@ -145,11 +146,14 @@ schemas/
145
146
  ├── raw.py # RawEquity - intermediate pipeline format
146
147
  ├── canonical.py # CanonicalEquity - final standardised format
147
148
  ├── types.py # Type definitions and validators
149
+ ├── validators.py # Reusable validators for identifiers and financial data
148
150
  └── feeds/ # Feed-specific data models
149
151
  ├── lseg_feed_data.py
150
152
  ├── sec_feed_data.py
151
153
  ├── stock_analysis_feed_data.py
152
154
  ├── tradingview_feed_data.py
155
+ ├── gleif_feed_data.py
156
+ ├── feed_validators.py
153
157
  ├── xetra_feed_data.py
154
158
  ├── yfinance_feed_data.py
155
159
  └── intrinio_feed_data.py
@@ -158,6 +162,7 @@ schemas/
158
162
  ### Critical Role of Schemas
159
163
 
160
164
  #### 1. **Data Validation at Boundaries**
165
+
161
166
  Each feed has a dedicated Pydantic schema that:
162
167
  - Validates incoming data structure and types
163
168
  - Normalises field names and formats
@@ -204,6 +209,7 @@ class LsegFeedData(BaseModel):
204
209
  - **Stock Analysis**: US equities with comprehensive financial metrics
205
210
  - **TradingView**: US equities with comprehensive financial metrics
206
211
  - **XETRA**: Deutsche Börse Stock Exchange
212
+ - **Intrinio**: US financial data API providing company, securities, and real-time quote data
207
213
 
208
214
  **Characteristics**:
209
215
 
@@ -213,12 +219,12 @@ class LsegFeedData(BaseModel):
213
219
  ### Enrichment Feeds (Supplementary Sources)
214
220
 
215
221
  - **Yahoo Finance**: Market data and financial metrics
216
- - **Intrinio**: Market data and financial metrics
222
+ - **Global LEI Foundation**: ISIN->LEI mapping for Legal Entity Identifier enrichment
217
223
 
218
224
  **Characteristics**
219
225
 
220
226
  - Provides additional financial metrics (market cap, analyst ratings, etc.)
221
- - Uses representative identifiers from discovery sources for lookup
227
+ - Uses representative identifiers from discovery sources for look-up
222
228
  - Applied after grouping but before final merge
223
229
 
224
230
  ## Equity Aggregator Components
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: equity-aggregator
3
- Version: 0.1.4
3
+ Version: 0.1.6
4
4
  Summary: Financial equity data aggregation toolkit
5
5
  Project-URL: Homepage, https://github.com/gregorykelleher/equity-aggregator
6
6
  Project-URL: Repository, https://github.com/gregorykelleher/equity-aggregator
@@ -45,7 +45,7 @@ Discovery feeds provide raw equity data from primary market sources:
45
45
  | 🇺🇸 SEC | United States | Securities and Exchange Commission - US-listed equities |
46
46
  | 🇺🇸 Stock Analysis | United States | Stock Analysis - Global equities |
47
47
  | 🇺🇸 TradingView | United States | TradingView - US-listed equities |
48
- | 🇩🇪 XETRA | Germany | Deutsche Börse electronic trading platform - Global listed equities |
48
+ | 🇩🇪 XETRA | International | Deutsche Börse electronic trading platform - Global listed equities |
49
49
 
50
50
  ### Enrichment Feeds
51
51
 
@@ -1,4 +1,4 @@
1
- equity_aggregator/README.md,sha256=hGekh37-V1T6bRqEewh4jTFxJ67mf4-MxgJouXRW1bY,8496
1
+ equity_aggregator/README.md,sha256=5yKFsEHyv3OJw2es3d6so410XftO8FJVuRQTTxyHXYY,8890
2
2
  equity_aggregator/__init__.py,sha256=hGyCpePTRYSkNJ3sNCV7rqi4sgcFxouSEykzJ0aq4Ek,264
3
3
  equity_aggregator/logging_config.py,sha256=fpokfpRZ-RIQaxzE53lVuszsA0zCTvUFdc3fbTW_MzQ,3339
4
4
  equity_aggregator/adapters/__init__.py,sha256=9GinsOLyA8usXVMiRtzshviV7QPnuRalwaG4aulDLEs,899
@@ -96,8 +96,8 @@ equity_aggregator/storage/cache.py,sha256=pZ2aFuBthuK3WfDjq6YnFOIMg8VqeAfTbnS5Vk
96
96
  equity_aggregator/storage/data_store.py,sha256=ZINcde7b5T2DQDbV8MiNu8vZ0VZrv0ZGhpvU5L1ujFs,5099
97
97
  equity_aggregator/storage/export.py,sha256=QhccHYUH9eDwb3eRijiJiarjoWIPwS2XMQT9yHn7XWw,5537
98
98
  equity_aggregator/storage/metadata.py,sha256=bCjPTKVBTmndhCoDJ3vUxau5Jr_cG6PEkiBbNKorYyE,3095
99
- equity_aggregator-0.1.4.dist-info/METADATA,sha256=2eqr8FiRJXypv1WidCHP0UCxCE7-M-qP-yqkgdrxWu8,27668
100
- equity_aggregator-0.1.4.dist-info/WHEEL,sha256=WLgqFyCfm_KASv4WHyYy0P3pM_m7J5L9k2skdKLirC8,87
101
- equity_aggregator-0.1.4.dist-info/entry_points.txt,sha256=pedEbEhWbz-9HIUpN8KTALxu94qKn9jxgrn08AvHb6c,70
102
- equity_aggregator-0.1.4.dist-info/licenses/LICENCE.txt,sha256=lyiWH8xJzlkFXkZpQf2_u8KtlJiPrVCczrZEZNyqSrg,1063
103
- equity_aggregator-0.1.4.dist-info/RECORD,,
99
+ equity_aggregator-0.1.6.dist-info/METADATA,sha256=_SWA_aEfWIJRhV5RIkFxhpteyOY6ogqe1qh-_ZWs-XI,27674
100
+ equity_aggregator-0.1.6.dist-info/WHEEL,sha256=WLgqFyCfm_KASv4WHyYy0P3pM_m7J5L9k2skdKLirC8,87
101
+ equity_aggregator-0.1.6.dist-info/entry_points.txt,sha256=pedEbEhWbz-9HIUpN8KTALxu94qKn9jxgrn08AvHb6c,70
102
+ equity_aggregator-0.1.6.dist-info/licenses/LICENCE.txt,sha256=lyiWH8xJzlkFXkZpQf2_u8KtlJiPrVCczrZEZNyqSrg,1063
103
+ equity_aggregator-0.1.6.dist-info/RECORD,,