@yottagraph-app/data-model-skill 0.0.33 → 0.0.35
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/package.json +1 -1
- package/skill/fjcidb/DATA_DICTIONARY.md +178 -0
- package/skill/fjcidb/schema.yaml +206 -0
- package/skill/sbaloans/DATA_DICTIONARY.md +240 -0
- package/skill/sbaloans/schema.yaml +380 -0
package/package.json
CHANGED
|
@@ -0,0 +1,178 @@
|
|
|
1
|
+
# Data Dictionary: FJC IDB (Federal Judicial Center — Civil Cases)
|
|
2
|
+
|
|
3
|
+
Last updated: 2026-05-20
|
|
4
|
+
|
|
5
|
+
## Source Overview
|
|
6
|
+
|
|
7
|
+
The **Federal Judicial Center Integrated Database (IDB)** provides administrative statistics on civil cases filed in U.S. district courts. Lovelace ingests the **civil SAS extract** published on the FJC site (annual FY files such as `cv26.sas7bdat`), not PACER dockets. Each SAS row is one civil case with coded fields for court location, docket number, party labels, nature of suit, and disposition.
|
|
8
|
+
|
|
9
|
+
The streamer uses a **diffing** pipeline: it materializes the SAS file (HTTP download or local path), parses rows in chunks, stores a normalized JSON snapshot per case under the raw store (`fjcidb/download/{case-id}.json`), and republishes atom batches only when a row’s JSON changes.
|
|
10
|
+
|
|
11
|
+
| Item | Value |
|
|
12
|
+
|------|--------|
|
|
13
|
+
| Stream source constant | `fjcidb-source` |
|
|
14
|
+
| `Record.Source` | `fjcidb` |
|
|
15
|
+
| Default dataset URL | FJC FY civil SAS (see FJC civil cases landing page) |
|
|
16
|
+
| Poll cadence | Configurable (`pollTimeMin`); typical dev runs use a large interval or one-shot via fetcheval |
|
|
17
|
+
|
|
18
|
+
**Data quality notes**
|
|
19
|
+
|
|
20
|
+
- Party fields **PLT** and **DEF** are short text labels (often truncated). They are not full party rosters; numeric values mean “count of plaintiffs/defendants” rather than a name.
|
|
21
|
+
- Person vs organization for textual parties uses **regex heuristics** on the label (see `party_regex.go`), not LLM classification in the default ingest path.
|
|
22
|
+
- Code values (district, disposition, etc.) follow the FJC codebook; this dictionary documents KG mapping, not codebook semantics.
|
|
23
|
+
|
|
24
|
+
---
|
|
25
|
+
|
|
26
|
+
## Entity Types
|
|
27
|
+
|
|
28
|
+
### `legal_case`
|
|
29
|
+
|
|
30
|
+
One civil case in the IDB extract, identified by district, office, docket, and filing year.
|
|
31
|
+
|
|
32
|
+
- **Subject name:** Human-readable label including `fjcidb_case_id`.
|
|
33
|
+
- **Strong id:** `fjcidb_case_id` on the case subject.
|
|
34
|
+
- **Resolver:** `NOT_MERGEABLE` — passive administrative case node.
|
|
35
|
+
- **Timestamp:** Atomization time (microseconds) for the ingest pass.
|
|
36
|
+
|
|
37
|
+
### `person`
|
|
38
|
+
|
|
39
|
+
An individual named on the plaintiff or defendant side when the IDB field is **textual** (not digits-only) and classified as a person by regex rules.
|
|
40
|
+
|
|
41
|
+
- **Subject name:** Normalized party display (trailing `, ET AL` removed from the raw field).
|
|
42
|
+
- **Property:** `name` (normalized party label from PLT/DEF).
|
|
43
|
+
- **Resolver:** `MERGEABLE` named entity; lawsuit-context snippet on the record (who sues whom; NOS phrase included).
|
|
44
|
+
- **Examples:** `SHANKS`, `BECERRA`, `BIDEN` (with or without `, ET AL`).
|
|
45
|
+
|
|
46
|
+
### `organization`
|
|
47
|
+
|
|
48
|
+
An institution or collective named on the plaintiff or defendant side when the field is textual and classified as organization (regex), or when classification is ambiguous and defaults to organization.
|
|
49
|
+
|
|
50
|
+
- **Subject name:** Same normalization as person parties.
|
|
51
|
+
- **Property:** `name` (normalized party label from PLT/DEF).
|
|
52
|
+
- **Resolver:** `MERGEABLE` named entity; same lawsuit snippet pattern as person parties.
|
|
53
|
+
- **Examples:** `DEPARTMENT OF DEFENSE`, `CUMMINS INC.`, `INTERNATIONAL UNION OF , ET AL`.
|
|
54
|
+
|
|
55
|
+
### `nature_of_suit`
|
|
56
|
+
|
|
57
|
+
A federal civil **nature of suit (NOS)** code from the U.S. Courts classification, linked from the case’s **NOS** field.
|
|
58
|
+
|
|
59
|
+
- **Subject name:** Short title when known, e.g. `Employment`; otherwise `Nature of suit {code} (federal civil)`.
|
|
60
|
+
- **Strong id:** `nos_code`.
|
|
61
|
+
- **Resolver:** `NOT_MERGEABLE`.
|
|
62
|
+
- **Reference:** U.S. Courts civil NOS code descriptions PDF (titles/descriptions embedded in ingest).
|
|
63
|
+
|
|
64
|
+
---
|
|
65
|
+
|
|
66
|
+
## Properties
|
|
67
|
+
|
|
68
|
+
### Legal case
|
|
69
|
+
|
|
70
|
+
* `fjcidb_case_id`
|
|
71
|
+
* Definition: Stable case identifier for this IDB row.
|
|
72
|
+
* Examples: `3-1-12345-2025`, `90-1-2303817-2023`
|
|
73
|
+
* Derivation: `{DISTRICT}-{OFFICE}-{DOCKET}-{year}` where year comes from `FILEDATE`, or from `FDATEUSE` when `FILEDATE` is absent. Row omitted if year cannot be determined.
|
|
74
|
+
|
|
75
|
+
* `district_code`
|
|
76
|
+
* Definition: FJC district court code (`DISTRICT`).
|
|
77
|
+
* Examples: `3`, `90`
|
|
78
|
+
* Derivation: SAS `DISTRICT`.
|
|
79
|
+
|
|
80
|
+
* `office_code`
|
|
81
|
+
* Definition: FJC office within the district (`OFFICE`).
|
|
82
|
+
* Examples: `1`
|
|
83
|
+
* Derivation: SAS `OFFICE`.
|
|
84
|
+
|
|
85
|
+
* `case_docket_number`
|
|
86
|
+
* Definition: Court docket number (`DOCKET`).
|
|
87
|
+
* Examples: `12345`, `2303817`
|
|
88
|
+
* Derivation: SAS `DOCKET`.
|
|
89
|
+
|
|
90
|
+
* `case_filing_date`
|
|
91
|
+
* Definition: Filing date as stored in the extract.
|
|
92
|
+
* Derivation: SAS `FILEDATE`.
|
|
93
|
+
|
|
94
|
+
* `termination_date`
|
|
95
|
+
* Definition: Termination date when present.
|
|
96
|
+
* Derivation: SAS `TERMDATE`.
|
|
97
|
+
|
|
98
|
+
* `origin_code`, `jurisdiction_code`, `disposition_code`, `class_action_code`, `procedural_progress_code`
|
|
99
|
+
* Definition: FJC codebook fields for procedural status.
|
|
100
|
+
* Derivation: SAS `ORIGIN`, `JURIS`, `DISP`, `CLASSACT`, `PROCPROG` when non-empty.
|
|
101
|
+
|
|
102
|
+
### Person and organization (shared)
|
|
103
|
+
|
|
104
|
+
* `name`
|
|
105
|
+
* Definition: Normalized party label from PLT or DEF.
|
|
106
|
+
* Examples: `SHANKS`, `INTERNATIONAL UNION OF`
|
|
107
|
+
* Derivation: Trim and remove trailing `, ET AL` from the raw IDB field; matches the record subject name.
|
|
108
|
+
|
|
109
|
+
### Nature of suit
|
|
110
|
+
|
|
111
|
+
* `nos_code`
|
|
112
|
+
* Definition: Numeric NOS code from the case row.
|
|
113
|
+
* Examples: `110`, `442`
|
|
114
|
+
* Derivation: SAS `NOS`.
|
|
115
|
+
|
|
116
|
+
* `nos_title`
|
|
117
|
+
* Definition: Short title from the U.S. Courts NOS codebook.
|
|
118
|
+
* Examples: `Insurance`, `Employment`
|
|
119
|
+
* Derivation: Lookup table when code is known; omitted if unknown.
|
|
120
|
+
|
|
121
|
+
* `nos_description`
|
|
122
|
+
* Definition: Long description from the same codebook.
|
|
123
|
+
* Derivation: Lookup table when code is known; omitted if unknown.
|
|
124
|
+
|
|
125
|
+
---
|
|
126
|
+
|
|
127
|
+
## Entity Relationships
|
|
128
|
+
|
|
129
|
+
```
|
|
130
|
+
legal_case ──[has_nature_of_suit]──→ nature_of_suit
|
|
131
|
+
|
|
132
|
+
person ──[is_plaintiff_in]────→ legal_case
|
|
133
|
+
organization ──[is_plaintiff_in]────→ legal_case
|
|
134
|
+
|
|
135
|
+
person ──[is_defendant_in]────→ legal_case
|
|
136
|
+
organization ──[is_defendant_in]────→ legal_case
|
|
137
|
+
```
|
|
138
|
+
|
|
139
|
+
- **`is_plaintiff_in` / `is_defendant_in`:** Emitted on the **party** record; target atom points at the case entity (with case strong id on the target for graph linkage). Only for non-empty, non-numeric, non-`SEALED` PLT/DEF values.
|
|
140
|
+
- **`has_nature_of_suit`:** Emitted on the **case** record when `NOS` is present; target is the `nature_of_suit` entity for that code.
|
|
141
|
+
|
|
142
|
+
---
|
|
143
|
+
|
|
144
|
+
## Records Per Case
|
|
145
|
+
|
|
146
|
+
Typical atomization for one row with textual PLT and DEF, and a NOS code:
|
|
147
|
+
|
|
148
|
+
1. One `legal_case` record (case properties + `has_nature_of_suit` target).
|
|
149
|
+
2. One `nature_of_suit` record (when NOS present).
|
|
150
|
+
3. Up to two party records (`person` or `organization` per side).
|
|
151
|
+
|
|
152
|
+
Rows with only numeric PLT/DEF produce a case record with count properties only (no party entities). `SEALED` parties produce no party entities.
|
|
153
|
+
|
|
154
|
+
---
|
|
155
|
+
|
|
156
|
+
## Party labeling (person vs organization)
|
|
157
|
+
|
|
158
|
+
Textual PLT/DEF labels are classified before flavor assignment:
|
|
159
|
+
|
|
160
|
+
1. **Skip** — empty, all digits, or withheld (`SEALED`).
|
|
161
|
+
2. **Organization** — label matches org indicators (legal suffixes, government words, `OF`/`AND`/`THE`, commas, truncated org stems) or has **three or more** name tokens after normalization.
|
|
162
|
+
3. **Person** — one or two tokens that look like personal name parts (letters, hyphen, apostrophe).
|
|
163
|
+
4. **Default** — organization.
|
|
164
|
+
|
|
165
|
+
Consumers should treat regex labels as heuristic, especially on truncated government and corporate strings.
|
|
166
|
+
|
|
167
|
+
---
|
|
168
|
+
|
|
169
|
+
## Citations
|
|
170
|
+
|
|
171
|
+
Citation text on atoms and entities: `Federal Judicial Center Integrated Database, civil SAS extract` with link to the FJC IDB research page. NOS entities additionally cite the U.S. Courts NOS descriptions document.
|
|
172
|
+
|
|
173
|
+
---
|
|
174
|
+
|
|
175
|
+
## Validation
|
|
176
|
+
|
|
177
|
+
- **Unit fixture:** `testdata/sample_fjcidb_case.pb.txt` (SHANKS vs International Union, NOS 442) — regenerate via `./run_legal_test.sh` at repo root.
|
|
178
|
+
- **recordeval:** Same script runs schema validation on `testdata/*.pb.txt`.
|
|
@@ -0,0 +1,206 @@
|
|
|
1
|
+
# Dataset schema for FJC Integrated Database — U.S. district court civil cases (SAS extracts).
|
|
2
|
+
#
|
|
3
|
+
# Source: https://www.fjc.gov/research/idb/civil-cases-filed-terminated-and-pending-sy-1988-present
|
|
4
|
+
# Codebook: Civil Codebook 1988 Forward (FJC PDF).
|
|
5
|
+
#
|
|
6
|
+
# Default ingest target is the FY civil SAS file (e.g. cv26.sas7bdat), refreshed by FJC on a schedule.
|
|
7
|
+
name: "fjcidb"
|
|
8
|
+
description: "Federal Judicial Center Integrated Database civil case records for U.S. district courts, including office and docket identity, party labels (paintif/defendant), filing and termination timing, jurisdiction, nature-of-suit entities linked from each case, and disposition codes"
|
|
9
|
+
|
|
10
|
+
extraction:
|
|
11
|
+
flavors: closed
|
|
12
|
+
properties: closed
|
|
13
|
+
relationships: closed
|
|
14
|
+
attributes: closed
|
|
15
|
+
events: closed
|
|
16
|
+
|
|
17
|
+
flavors:
|
|
18
|
+
- name: "legal_case"
|
|
19
|
+
description: "A civil case docketed in a U.S. federal district court, identified by FJC administrative codes rather than PACER"
|
|
20
|
+
display_name: "Legal case"
|
|
21
|
+
mergeability: not_mergeable
|
|
22
|
+
strong_id_properties: ["fjcidb_case_id"]
|
|
23
|
+
passive: true
|
|
24
|
+
|
|
25
|
+
- name: "organization"
|
|
26
|
+
description: "A particular business, institution, or organization such as a corporation, university, government agency, or non-profit"
|
|
27
|
+
display_name: "Organization"
|
|
28
|
+
mergeability: mergeable
|
|
29
|
+
passive: true
|
|
30
|
+
|
|
31
|
+
- name: "person"
|
|
32
|
+
description: "A real person as opposed to a fictional character, such as a CEO, politician, or public figure"
|
|
33
|
+
display_name: "Person"
|
|
34
|
+
mergeability: mergeable
|
|
35
|
+
passive: true
|
|
36
|
+
|
|
37
|
+
- name: "nature_of_suit"
|
|
38
|
+
description: "Federal civil nature-of-suit (NOS) code from the U.S. Courts classification"
|
|
39
|
+
display_name: "Nature of suit"
|
|
40
|
+
mergeability: not_mergeable
|
|
41
|
+
strong_id_properties: ["nos_code"]
|
|
42
|
+
passive: true
|
|
43
|
+
|
|
44
|
+
properties:
|
|
45
|
+
- name: "fjcidb_case_id"
|
|
46
|
+
namespace: "fjcidb"
|
|
47
|
+
type: string
|
|
48
|
+
description: "Stable FJC civil case identifier formed from district, office, docket, and filing year"
|
|
49
|
+
display_name: "FJC civil case ID"
|
|
50
|
+
mergeability: not_mergeable
|
|
51
|
+
domain_flavors: ["legal_case"]
|
|
52
|
+
passive: true
|
|
53
|
+
|
|
54
|
+
- name: "name"
|
|
55
|
+
namespace: "fjcidb"
|
|
56
|
+
type: string
|
|
57
|
+
description: "Display name of the entity"
|
|
58
|
+
display_name: "Name"
|
|
59
|
+
mergeability: not_mergeable
|
|
60
|
+
domain_flavors: ["organization", "person"]
|
|
61
|
+
passive: true
|
|
62
|
+
|
|
63
|
+
- name: "nos_code"
|
|
64
|
+
namespace: "fjcidb"
|
|
65
|
+
type: string
|
|
66
|
+
description: "Nature of suit numeric code (matches IDB NOS and the U.S. Courts civil NOS codebook)"
|
|
67
|
+
display_name: "NOS code"
|
|
68
|
+
mergeability: not_mergeable
|
|
69
|
+
domain_flavors: ["nature_of_suit"]
|
|
70
|
+
passive: true
|
|
71
|
+
|
|
72
|
+
- name: "nos_title"
|
|
73
|
+
namespace: "fjcidb"
|
|
74
|
+
type: string
|
|
75
|
+
description: "Short title for this nature of suit code from the U.S. Courts civil NOS descriptions"
|
|
76
|
+
display_name: "Nature of suit title"
|
|
77
|
+
mergeability: not_mergeable
|
|
78
|
+
domain_flavors: ["nature_of_suit"]
|
|
79
|
+
passive: true
|
|
80
|
+
|
|
81
|
+
- name: "nos_description"
|
|
82
|
+
namespace: "fjcidb"
|
|
83
|
+
type: string
|
|
84
|
+
description: "Official description for this nature of suit code from the U.S. Courts civil NOS descriptions"
|
|
85
|
+
display_name: "Nature of suit description"
|
|
86
|
+
mergeability: not_mergeable
|
|
87
|
+
domain_flavors: ["nature_of_suit"]
|
|
88
|
+
passive: true
|
|
89
|
+
|
|
90
|
+
- name: "district_code"
|
|
91
|
+
namespace: "fjcidb"
|
|
92
|
+
type: string
|
|
93
|
+
description: "FJC district court code for where the civil action was filed (IDB DISTRICT)"
|
|
94
|
+
display_name: "FJC district code"
|
|
95
|
+
mergeability: not_mergeable
|
|
96
|
+
domain_flavors: ["legal_case"]
|
|
97
|
+
passive: true
|
|
98
|
+
|
|
99
|
+
- name: "office_code"
|
|
100
|
+
namespace: "fjcidb"
|
|
101
|
+
type: string
|
|
102
|
+
description: "FJC office code within the district where the case was filed (IDB OFFICE)"
|
|
103
|
+
display_name: "FJC office code"
|
|
104
|
+
mergeability: not_mergeable
|
|
105
|
+
domain_flavors: ["legal_case"]
|
|
106
|
+
passive: true
|
|
107
|
+
|
|
108
|
+
- name: "case_docket_number"
|
|
109
|
+
namespace: "fjcidb"
|
|
110
|
+
type: string
|
|
111
|
+
description: "Docket number assigned by the filing office (IDB DOCKET)"
|
|
112
|
+
display_name: "Docket number"
|
|
113
|
+
mergeability: not_mergeable
|
|
114
|
+
domain_flavors: ["legal_case"]
|
|
115
|
+
passive: true
|
|
116
|
+
|
|
117
|
+
- name: "case_filing_date"
|
|
118
|
+
namespace: "fjcidb"
|
|
119
|
+
type: string
|
|
120
|
+
description: "Case filing date"
|
|
121
|
+
display_name: "Filing date"
|
|
122
|
+
mergeability: not_mergeable
|
|
123
|
+
domain_flavors: ["legal_case"]
|
|
124
|
+
passive: true
|
|
125
|
+
|
|
126
|
+
- name: "termination_date"
|
|
127
|
+
namespace: "fjcidb"
|
|
128
|
+
type: string
|
|
129
|
+
description: "Termination date if the case terminated in the reporting window"
|
|
130
|
+
display_name: "Termination date"
|
|
131
|
+
mergeability: not_mergeable
|
|
132
|
+
domain_flavors: ["legal_case"]
|
|
133
|
+
passive: true
|
|
134
|
+
|
|
135
|
+
- name: "origin_code"
|
|
136
|
+
namespace: "fjcidb"
|
|
137
|
+
type: string
|
|
138
|
+
description: "Procedural origin code for the filing"
|
|
139
|
+
display_name: "Origin code"
|
|
140
|
+
mergeability: not_mergeable
|
|
141
|
+
domain_flavors: ["legal_case"]
|
|
142
|
+
passive: true
|
|
143
|
+
|
|
144
|
+
- name: "jurisdiction_code"
|
|
145
|
+
namespace: "fjcidb"
|
|
146
|
+
type: string
|
|
147
|
+
description: "Jurisdiction code"
|
|
148
|
+
display_name: "Jurisdiction code"
|
|
149
|
+
mergeability: not_mergeable
|
|
150
|
+
domain_flavors: ["legal_case"]
|
|
151
|
+
passive: true
|
|
152
|
+
|
|
153
|
+
- name: "disposition_code"
|
|
154
|
+
namespace: "fjcidb"
|
|
155
|
+
type: string
|
|
156
|
+
description: "Disposition code when the case terminated"
|
|
157
|
+
display_name: "Disposition code"
|
|
158
|
+
mergeability: not_mergeable
|
|
159
|
+
domain_flavors: ["legal_case"]
|
|
160
|
+
passive: true
|
|
161
|
+
|
|
162
|
+
- name: "class_action_code"
|
|
163
|
+
namespace: "fjcidb"
|
|
164
|
+
type: string
|
|
165
|
+
description: "Class action designation code"
|
|
166
|
+
display_name: "Class action code"
|
|
167
|
+
mergeability: not_mergeable
|
|
168
|
+
domain_flavors: ["legal_case"]
|
|
169
|
+
passive: true
|
|
170
|
+
|
|
171
|
+
- name: "procedural_progress_code"
|
|
172
|
+
namespace: "fjcidb"
|
|
173
|
+
type: string
|
|
174
|
+
description: "Procedural progress code"
|
|
175
|
+
display_name: "Procedural progress code"
|
|
176
|
+
mergeability: not_mergeable
|
|
177
|
+
domain_flavors: ["legal_case"]
|
|
178
|
+
passive: true
|
|
179
|
+
|
|
180
|
+
relationships:
|
|
181
|
+
- name: "is_plaintiff_in"
|
|
182
|
+
namespace: "fjcidb"
|
|
183
|
+
description: "The party is named as the plaintiff side in this FJC civil case"
|
|
184
|
+
display_name: "Is plaintiff in"
|
|
185
|
+
mergeability: not_mergeable
|
|
186
|
+
domain_flavors: ["organization", "person"]
|
|
187
|
+
target_flavors: ["legal_case"]
|
|
188
|
+
passive: true
|
|
189
|
+
|
|
190
|
+
- name: "is_defendant_in"
|
|
191
|
+
namespace: "fjcidb"
|
|
192
|
+
description: "The party is named as the defendant side in this FJC civil case"
|
|
193
|
+
display_name: "Is defendant in"
|
|
194
|
+
mergeability: not_mergeable
|
|
195
|
+
domain_flavors: ["organization", "person"]
|
|
196
|
+
target_flavors: ["legal_case"]
|
|
197
|
+
passive: true
|
|
198
|
+
|
|
199
|
+
- name: "has_nature_of_suit"
|
|
200
|
+
namespace: "fjcidb"
|
|
201
|
+
description: "The civil case is classified under this federal nature-of-suit code (IDB NOS field)"
|
|
202
|
+
display_name: "Has nature of suit"
|
|
203
|
+
mergeability: not_mergeable
|
|
204
|
+
domain_flavors: ["legal_case"]
|
|
205
|
+
target_flavors: ["nature_of_suit"]
|
|
206
|
+
passive: true
|
|
@@ -0,0 +1,240 @@
|
|
|
1
|
+
# Data Dictionary: SBA Loans (FOIA)
|
|
2
|
+
|
|
3
|
+
## Source Overview
|
|
4
|
+
|
|
5
|
+
The SBA 7(a) & 504 FOIA dataset is published by the U.S. Small Business Administration, Office of Capital Access. It contains loan-level records for the two main SBA loan guarantee programs:
|
|
6
|
+
|
|
7
|
+
- **7(a) Loan Program** -- the SBA's primary and most flexible loan program, covering working capital, equipment, real estate, and debt refinancing. Maximum loan amount is $5 million.
|
|
8
|
+
- **504 Loan Program** -- provides long-term, fixed-rate financing for major fixed assets (real estate, heavy equipment) through Certified Development Companies (CDCs). Structured as a three-party arrangement: borrower (10%), CDC-backed SBA portion (40%), and a conventional third-party lender (50%).
|
|
9
|
+
|
|
10
|
+
The data is distributed as CSV files via the CKAN open data platform at `https://data.sba.gov/dataset/7-a-504-foia`. File URLs change with each quarterly update (contain an "as of" date); the CKAN package API is used to discover current resource URLs dynamically.
|
|
11
|
+
|
|
12
|
+
The dataset is updated quarterly, typically available one month after the quarter ends.
|
|
13
|
+
|
|
14
|
+
| Program | CSV Files | Approx Size |
|
|
15
|
+
|---------|-----------|-------------|
|
|
16
|
+
| 7(a) | 4 files by decade: FY1991-1999, FY2000-2009, FY2010-2019, FY2020-Present | ~780MB |
|
|
17
|
+
| 504 | 2 files: FY1991-2009, FY2010-Present | ~104MB |
|
|
18
|
+
|
|
19
|
+
Both programs share a common column subset (borrower info, approval details, NAICS codes, loan status). 7(a) adds bank/lender fields and interest rate details; 504 adds CDC and third-party lender fields.
|
|
20
|
+
|
|
21
|
+
| Pipeline | `Record.Source` |
|
|
22
|
+
|----------|----------------|
|
|
23
|
+
| All files | `sbaloans` |
|
|
24
|
+
|
|
25
|
+
---
|
|
26
|
+
|
|
27
|
+
## Entity Types
|
|
28
|
+
|
|
29
|
+
### `financial_instrument`
|
|
30
|
+
|
|
31
|
+
An SBA-backed loan under the 7(a) or 504 program, identified by its location ID (a unique loan identifier assigned by the SBA).
|
|
32
|
+
|
|
33
|
+
- Primary key: Composite of program and location ID (e.g., `"7a_317954"`, `"504_188194"`)
|
|
34
|
+
- Entity resolver: named entity. Strong ID = `sba_loan_id`. Not mergeable across sources.
|
|
35
|
+
|
|
36
|
+
### `organization`
|
|
37
|
+
|
|
38
|
+
A business, bank, CDC, or third-party lender involved in an SBA loan. Three roles:
|
|
39
|
+
- **Borrower** -- the small business receiving the loan
|
|
40
|
+
- **Bank/Primary Lender** (7(a)) -- the financial institution originating the loan, identified by FDIC certificate number when available
|
|
41
|
+
- **CDC** (504) -- the Certified Development Company administering the 504 portion
|
|
42
|
+
- **Third-Party Lender** (504) -- the conventional lender providing the remaining portion
|
|
43
|
+
|
|
44
|
+
- Primary key: Organization name (borrowers), or FDIC certificate number (banks)
|
|
45
|
+
- Entity resolver: named entity, mergeable. Banks with an FDIC certificate number use it as a strong ID (property `fdic_certificate_number`), enabling cross-source entity resolution with existing FDIC-sourced bank entities.
|
|
46
|
+
|
|
47
|
+
### `location`
|
|
48
|
+
|
|
49
|
+
A geographic location representing the project county and state where the loan-funded activity takes place.
|
|
50
|
+
|
|
51
|
+
- Primary key: County + state combination (e.g., `"EAST BATON ROUGE, LA"`)
|
|
52
|
+
- Entity resolver: named entity, mergeable. Disambiguation via county name + state code.
|
|
53
|
+
|
|
54
|
+
---
|
|
55
|
+
|
|
56
|
+
## Properties
|
|
57
|
+
|
|
58
|
+
### Loan Properties (`financial_instrument`)
|
|
59
|
+
|
|
60
|
+
#### Identity
|
|
61
|
+
|
|
62
|
+
* `sba_loan_id`
|
|
63
|
+
* Definition: Composite SBA loan identifier combining program and location ID.
|
|
64
|
+
* Examples: `"7a_317954"`, `"504_188194"`
|
|
65
|
+
* Derivation: Constructed from `program` (normalized) + `"_"` + `locationid` CSV column.
|
|
66
|
+
|
|
67
|
+
* `program`
|
|
68
|
+
* Definition: SBA loan program name.
|
|
69
|
+
* Examples: `"7a"`, `"504"`
|
|
70
|
+
* Derivation: `program` CSV column, normalized (leading/trailing whitespace trimmed, " 7A" becomes "7a").
|
|
71
|
+
|
|
72
|
+
#### Financial Terms
|
|
73
|
+
|
|
74
|
+
* `gross_approval`
|
|
75
|
+
* Definition: Total approved loan amount in dollars.
|
|
76
|
+
* Examples: `"450000"`, `"810000"`
|
|
77
|
+
* Derivation: `grossapproval` CSV column.
|
|
78
|
+
|
|
79
|
+
* `sba_guaranteed_approval`
|
|
80
|
+
* Definition: SBA-guaranteed portion of the loan amount in dollars. 7(a) only.
|
|
81
|
+
* Examples: `"337500"`
|
|
82
|
+
* Derivation: `sbaguaranteedapproval` CSV column. Empty for 504 loans.
|
|
83
|
+
|
|
84
|
+
* `initial_interest_rate`
|
|
85
|
+
* Definition: Initial interest rate of the loan as a percentage. 7(a) only.
|
|
86
|
+
* Examples: `"6"`, `"5.5"`
|
|
87
|
+
* Derivation: `initialinterestrate` CSV column.
|
|
88
|
+
|
|
89
|
+
* `interest_type`
|
|
90
|
+
* Definition: Whether the interest rate is fixed or variable. 7(a) only.
|
|
91
|
+
* Examples: `"Fixed"`, `"Variable"`
|
|
92
|
+
* Derivation: `fixedorvariableinterestind` CSV column, decoded: F=Fixed, V=Variable.
|
|
93
|
+
|
|
94
|
+
* `term_in_months`
|
|
95
|
+
* Definition: Loan term length in months.
|
|
96
|
+
* Examples: `"120"`, `"240"`, `"300"`
|
|
97
|
+
* Derivation: `terminmonths` CSV column.
|
|
98
|
+
|
|
99
|
+
* `third_party_dollars`
|
|
100
|
+
* Definition: Third-party lender contribution in dollars. 504 only.
|
|
101
|
+
* Examples: `"1334500"`
|
|
102
|
+
* Derivation: `thirdpartydollars` CSV column.
|
|
103
|
+
|
|
104
|
+
#### Dates
|
|
105
|
+
|
|
106
|
+
* `approval_date`
|
|
107
|
+
* Definition: Date the loan was approved.
|
|
108
|
+
* Examples: `"12/5/2020"`, `"10/2/2009"`
|
|
109
|
+
* Derivation: `approvaldate` CSV column, format M/D/YYYY.
|
|
110
|
+
|
|
111
|
+
* `approval_fiscal_year`
|
|
112
|
+
* Definition: Federal fiscal year of approval (Oct 1 - Sep 30).
|
|
113
|
+
* Examples: `"2021"`, `"2010"`
|
|
114
|
+
* Derivation: `approvalfy` CSV column.
|
|
115
|
+
|
|
116
|
+
* `first_disbursement_date`
|
|
117
|
+
* Definition: Date of first loan disbursement.
|
|
118
|
+
* Examples: `"1/1/2021"`, `"9/15/2010"`
|
|
119
|
+
* Derivation: `firstdisbursementdate` CSV column.
|
|
120
|
+
|
|
121
|
+
* `paid_in_full_date`
|
|
122
|
+
* Definition: Date the loan was paid in full.
|
|
123
|
+
* Derivation: `paidinfulldate` CSV column. Empty if loan is not paid off.
|
|
124
|
+
|
|
125
|
+
* `chargeoff_date`
|
|
126
|
+
* Definition: Date the loan was charged off.
|
|
127
|
+
* Derivation: `chargeoffdate` CSV column. Empty if loan has not been charged off.
|
|
128
|
+
|
|
129
|
+
#### Processing
|
|
130
|
+
|
|
131
|
+
* `processing_method`
|
|
132
|
+
* Definition: SBA processing method used for the loan.
|
|
133
|
+
* Examples: `"Preferred Lenders Program"`, `"504 Basic"`
|
|
134
|
+
* Derivation: `processingmethod` CSV column.
|
|
135
|
+
|
|
136
|
+
* `sub_program`
|
|
137
|
+
* Definition: SBA sub-program designation.
|
|
138
|
+
* Examples: `"Guaranty"`, `"Sec. 504 - Loan Guarantees - Private Sector Financed"`
|
|
139
|
+
* Derivation: `subprogram` CSV column.
|
|
140
|
+
|
|
141
|
+
#### Industry and Business
|
|
142
|
+
|
|
143
|
+
* `naics_code`
|
|
144
|
+
* Definition: North American Industry Classification System code for the borrower's business.
|
|
145
|
+
* Examples: `"531390"`, `"721110"`
|
|
146
|
+
* Derivation: `naicscode` CSV column.
|
|
147
|
+
|
|
148
|
+
* `naics_description`
|
|
149
|
+
* Definition: Description of the NAICS industry classification.
|
|
150
|
+
* Examples: `"Other Activities Related to Real Estate"`, `"Hotels (except Casino Hotels) and Motels"`
|
|
151
|
+
* Derivation: `naicsdescription` CSV column.
|
|
152
|
+
|
|
153
|
+
* `franchise_name`
|
|
154
|
+
* Definition: Name of the franchise if the borrower is a franchisee.
|
|
155
|
+
* Examples: `"CHOICE HOTELS INTERNATIONAL INC."`
|
|
156
|
+
* Derivation: `franchisename` CSV column. Empty if not a franchise.
|
|
157
|
+
|
|
158
|
+
* `business_type`
|
|
159
|
+
* Definition: Legal structure of the borrowing business.
|
|
160
|
+
* Examples: `"CORPORATION"`, `"LLC"`, `"PARTNERSHIP"`, `"INDIVIDUAL"`
|
|
161
|
+
* Derivation: `businesstype` CSV column.
|
|
162
|
+
|
|
163
|
+
* `business_age`
|
|
164
|
+
* Definition: Age category of the borrowing business at time of loan.
|
|
165
|
+
* Examples: `"Existing or more than 2 years old"`, `"New Business or 2 years or less"`, `"Less than 4 years old but at least 3"`
|
|
166
|
+
* Derivation: `businessage` CSV column.
|
|
167
|
+
|
|
168
|
+
#### Status
|
|
169
|
+
|
|
170
|
+
* `loan_status`
|
|
171
|
+
* Definition: Current status of the loan.
|
|
172
|
+
* Examples: `"LIQUID"` (active), `"P I F"` (paid in full), `"CHGOFF"` (charged off), `"EXEMPT"`, `"PREPAID IN FULL"`
|
|
173
|
+
* Derivation: `loanstatus` CSV column.
|
|
174
|
+
|
|
175
|
+
* `gross_chargeoff_amount`
|
|
176
|
+
* Definition: Gross charge-off amount in dollars.
|
|
177
|
+
* Examples: `"0"`, `"125000"`
|
|
178
|
+
* Derivation: `grosschargeoffamount` CSV column.
|
|
179
|
+
|
|
180
|
+
* `revolver_status`
|
|
181
|
+
* Definition: Whether the loan is a revolving line of credit. 7(a) only.
|
|
182
|
+
* Examples: `"TRUE"`, `"FALSE"`
|
|
183
|
+
* Derivation: `revolverstatus` CSV column.
|
|
184
|
+
|
|
185
|
+
* `jobs_supported`
|
|
186
|
+
* Definition: Number of jobs supported by the loan.
|
|
187
|
+
* Examples: `"9"`, `"250"`, `"0"`
|
|
188
|
+
* Derivation: `jobssupported` CSV column.
|
|
189
|
+
|
|
190
|
+
#### Geography
|
|
191
|
+
|
|
192
|
+
* `sba_district_office`
|
|
193
|
+
* Definition: SBA district office that processed the loan.
|
|
194
|
+
* Examples: `"LOUISIANA DISTRICT OFFICE"`, `"ILLINOIS DISTRICT OFFICE"`
|
|
195
|
+
* Derivation: `sbadistrictoffice` CSV column.
|
|
196
|
+
|
|
197
|
+
* `congressional_district`
|
|
198
|
+
* Definition: Congressional district of the project.
|
|
199
|
+
* Examples: `"6"`, `"13"`
|
|
200
|
+
* Derivation: `congressionaldistrict` CSV column.
|
|
201
|
+
|
|
202
|
+
### Organization Properties
|
|
203
|
+
|
|
204
|
+
* `address`
|
|
205
|
+
* Definition: Physical address of the organization formatted as "street, city, state zip".
|
|
206
|
+
* Examples: `"11851 Wentling Ave, BATON ROUGE, LA 70816"`, `"1981 Marcus Avenue, LAKE SUCCESS, NY 11042"`
|
|
207
|
+
* Derivation: Formatted from borrower address fields (`borrstreet`/`borrcity`/`borrstate`/`borrzip`) for borrowers, bank fields (`bankstreet`/`bankcity`/`bankstate`/`bankzip`) for 7(a) banks, or CDC fields (`cdc_street`/`cdc_city`/`cdc_state`/`cdc_zip`) for 504 CDCs.
|
|
208
|
+
|
|
209
|
+
* `fdic_certificate_number`
|
|
210
|
+
* Definition: FDIC certificate number identifying a bank. Used as a strong ID for entity resolution with FDIC-sourced bank entities.
|
|
211
|
+
* Examples: `"59345"`
|
|
212
|
+
* Derivation: `bankfdicnumber` CSV column. Only present for 7(a) bank entities.
|
|
213
|
+
* Note: Enables cross-source entity resolution with the FDIC dataset (`fdic` source), which also uses `fdic_certificate_number` as a strong ID property on bank organizations.
|
|
214
|
+
|
|
215
|
+
* `ncua_number`
|
|
216
|
+
* Definition: NCUA charter number identifying a credit union.
|
|
217
|
+
* Derivation: `bankncuanumber` CSV column. Only present for 7(a) entities where the lender is a credit union.
|
|
218
|
+
* Note: Sparse -- most lenders are banks, not credit unions.
|
|
219
|
+
|
|
220
|
+
---
|
|
221
|
+
|
|
222
|
+
## Entity Relationships
|
|
223
|
+
|
|
224
|
+
```
|
|
225
|
+
organization (borrower) ──[borrower_of]──→ financial_instrument (loan)
|
|
226
|
+
organization (bank/CDC) ──[lender_of]────→ financial_instrument (loan) attr: lender_role=primary|cdc
|
|
227
|
+
organization (third-party) ──[lender_of]────→ financial_instrument (loan) attr: lender_role=third_party
|
|
228
|
+
financial_instrument (loan) ──[is_located_at]─→ location (project county/state)
|
|
229
|
+
```
|
|
230
|
+
|
|
231
|
+
---
|
|
232
|
+
|
|
233
|
+
## Attributes
|
|
234
|
+
|
|
235
|
+
* `lender_role` (on `lender_of` relationship)
|
|
236
|
+
* Definition: Role of the lending organization in the loan.
|
|
237
|
+
* Values: `"primary"` (7(a) bank), `"cdc"` (504 Certified Development Company), `"third_party"` (504 conventional third-party lender)
|
|
238
|
+
* Derivation: Set based on which organization record is being emitted and the loan program type.
|
|
239
|
+
|
|
240
|
+
---
|
|
@@ -0,0 +1,380 @@
|
|
|
1
|
+
# Dataset schema for SBA 7(a) & 504 FOIA loan data.
|
|
2
|
+
#
|
|
3
|
+
# Source: https://data.sba.gov/dataset/7-a-504-foia
|
|
4
|
+
# Updated quarterly (typically one month after quarter end).
|
|
5
|
+
#
|
|
6
|
+
# Each CSV row represents a single SBA-backed loan. The atomizer produces:
|
|
7
|
+
# - A financial_instrument entity for the loan itself
|
|
8
|
+
# - An organization entity for the borrower
|
|
9
|
+
# - An organization entity for the bank/CDC/lender
|
|
10
|
+
# - Optionally, a third-party lender organization (504 loans)
|
|
11
|
+
# - A location entity for the project county/state
|
|
12
|
+
name: "sbaloans"
|
|
13
|
+
description: "SBA 7(a) and 504 FOIA loan data including loan terms, borrower and lender organizations, and project locations"
|
|
14
|
+
|
|
15
|
+
extraction:
|
|
16
|
+
flavors: closed
|
|
17
|
+
properties: closed
|
|
18
|
+
relationships: closed
|
|
19
|
+
attributes: closed
|
|
20
|
+
events: closed
|
|
21
|
+
|
|
22
|
+
flavors:
|
|
23
|
+
- name: "sba_loan"
|
|
24
|
+
description: "An SBA-backed loan under the 7(a) or 504 program"
|
|
25
|
+
display_name: "SBA Loan"
|
|
26
|
+
mergeability: not_mergeable
|
|
27
|
+
strong_id_properties: ["sba_loan_id"]
|
|
28
|
+
passive: true
|
|
29
|
+
|
|
30
|
+
- name: "organization"
|
|
31
|
+
description: "A particular business, institution, or organization such as a corporation, university, government agency, or non-profit"
|
|
32
|
+
display_name: "Organization"
|
|
33
|
+
mergeability: not_mergeable
|
|
34
|
+
strong_id_properties: ["fdic_certificate_number"]
|
|
35
|
+
passive: true
|
|
36
|
+
|
|
37
|
+
- name: "sba_program"
|
|
38
|
+
description: "An SBA loan program or sub-program (e.g., SBA 7a: Guaranty, SBA 504: Sec. 504 - Loan Guarantees)"
|
|
39
|
+
display_name: "SBA Program"
|
|
40
|
+
mergeability: not_mergeable
|
|
41
|
+
strong_id_properties: ["sba_program_id"]
|
|
42
|
+
passive: true
|
|
43
|
+
|
|
44
|
+
- name: "industry"
|
|
45
|
+
description: "An industry classification of economic activity (e.g. NAICS or SIC) identifying the line of business associated with an organization or award"
|
|
46
|
+
display_name: "Industry"
|
|
47
|
+
mergeability: not_mergeable
|
|
48
|
+
strong_id_properties: ["naics_code"]
|
|
49
|
+
passive: true
|
|
50
|
+
|
|
51
|
+
- name: "location"
|
|
52
|
+
description: "A specific named geographic location such as a city, country, region, or landmark"
|
|
53
|
+
display_name: "Location"
|
|
54
|
+
mergeability: not_mergeable
|
|
55
|
+
examples: ["EAST BATON ROUGE, LA", "LINN, IA"]
|
|
56
|
+
passive: true
|
|
57
|
+
|
|
58
|
+
properties:
|
|
59
|
+
# Loan identity
|
|
60
|
+
- name: "sba_loan_id"
|
|
61
|
+
type: string
|
|
62
|
+
description: "Composite SBA loan identifier derived from program, borrower name, lender name, and approval date"
|
|
63
|
+
display_name: "SBA Loan ID"
|
|
64
|
+
mergeability: not_mergeable
|
|
65
|
+
domain_flavors: ["sba_loan"]
|
|
66
|
+
examples: ["7a_Title2Land,_L.L.C._Newtek_Small_Business_Finance,_Inc._12_5_2020"]
|
|
67
|
+
passive: true
|
|
68
|
+
|
|
69
|
+
|
|
70
|
+
- name: "sba_program_id"
|
|
71
|
+
type: string
|
|
72
|
+
description: "Composite identifier for an SBA program/sub-program (lowercased program name)"
|
|
73
|
+
display_name: "SBA Program ID"
|
|
74
|
+
mergeability: not_mergeable
|
|
75
|
+
domain_flavors: ["sba_program"]
|
|
76
|
+
examples: ["sba 7a: guaranty", "sba 504: sec. 504 - loan guarantees - private sector financed"]
|
|
77
|
+
passive: true
|
|
78
|
+
|
|
79
|
+
- name: "program"
|
|
80
|
+
type: string
|
|
81
|
+
description: "SBA loan program name"
|
|
82
|
+
display_name: "Program"
|
|
83
|
+
mergeability: not_mergeable
|
|
84
|
+
domain_flavors: ["sba_program"]
|
|
85
|
+
examples: ["7a", "504"]
|
|
86
|
+
passive: true
|
|
87
|
+
|
|
88
|
+
- name: "gross_approval"
|
|
89
|
+
type: float
|
|
90
|
+
description: "Total approved loan amount in dollars"
|
|
91
|
+
display_name: "Gross Approval"
|
|
92
|
+
mergeability: not_mergeable
|
|
93
|
+
domain_flavors: ["sba_loan"]
|
|
94
|
+
examples: ["450000", "810000"]
|
|
95
|
+
passive: true
|
|
96
|
+
|
|
97
|
+
- name: "sba_guaranteed_approval"
|
|
98
|
+
type: float
|
|
99
|
+
description: "SBA-guaranteed portion of the loan amount in dollars (7(a) only)"
|
|
100
|
+
display_name: "SBA Guaranteed Approval"
|
|
101
|
+
mergeability: not_mergeable
|
|
102
|
+
domain_flavors: ["sba_loan"]
|
|
103
|
+
examples: ["337500"]
|
|
104
|
+
passive: true
|
|
105
|
+
|
|
106
|
+
- name: "approval_date"
|
|
107
|
+
type: string
|
|
108
|
+
description: "Date the loan was approved"
|
|
109
|
+
display_name: "Approval Date"
|
|
110
|
+
mergeability: not_mergeable
|
|
111
|
+
domain_flavors: ["sba_loan"]
|
|
112
|
+
examples: ["12/5/2020"]
|
|
113
|
+
passive: true
|
|
114
|
+
|
|
115
|
+
- name: "approval_fiscal_year"
|
|
116
|
+
type: string
|
|
117
|
+
description: "Federal fiscal year of approval"
|
|
118
|
+
display_name: "Approval Fiscal Year"
|
|
119
|
+
mergeability: not_mergeable
|
|
120
|
+
domain_flavors: ["sba_loan"]
|
|
121
|
+
examples: ["2021", "2010"]
|
|
122
|
+
passive: true
|
|
123
|
+
|
|
124
|
+
- name: "first_disbursement_date"
|
|
125
|
+
type: string
|
|
126
|
+
description: "Date of first loan disbursement"
|
|
127
|
+
display_name: "First Disbursement Date"
|
|
128
|
+
mergeability: not_mergeable
|
|
129
|
+
domain_flavors: ["sba_loan"]
|
|
130
|
+
passive: true
|
|
131
|
+
|
|
132
|
+
- name: "term_in_months"
|
|
133
|
+
type: float
|
|
134
|
+
description: "Loan term length in months"
|
|
135
|
+
display_name: "Term in Months"
|
|
136
|
+
mergeability: not_mergeable
|
|
137
|
+
domain_flavors: ["sba_loan"]
|
|
138
|
+
examples: ["120", "240", "300"]
|
|
139
|
+
passive: true
|
|
140
|
+
|
|
141
|
+
- name: "initial_interest_rate"
|
|
142
|
+
type: float
|
|
143
|
+
description: "Initial interest rate of the loan as a percentage (7(a) only)"
|
|
144
|
+
display_name: "Initial Interest Rate"
|
|
145
|
+
mergeability: not_mergeable
|
|
146
|
+
domain_flavors: ["sba_loan"]
|
|
147
|
+
examples: ["6", "5.5"]
|
|
148
|
+
passive: true
|
|
149
|
+
|
|
150
|
+
- name: "interest_type"
|
|
151
|
+
type: string
|
|
152
|
+
description: "Whether the interest rate is fixed or variable (7(a) only)"
|
|
153
|
+
display_name: "Interest Type"
|
|
154
|
+
mergeability: not_mergeable
|
|
155
|
+
domain_flavors: ["sba_loan"]
|
|
156
|
+
examples: ["Fixed", "Variable"]
|
|
157
|
+
passive: true
|
|
158
|
+
|
|
159
|
+
- name: "processing_method"
|
|
160
|
+
type: string
|
|
161
|
+
description: "SBA processing method used for the loan"
|
|
162
|
+
display_name: "Processing Method"
|
|
163
|
+
mergeability: not_mergeable
|
|
164
|
+
domain_flavors: ["sba_loan"]
|
|
165
|
+
examples: ["Preferred Lenders Program", "504 Basic"]
|
|
166
|
+
passive: true
|
|
167
|
+
|
|
168
|
+
- name: "sub_program"
|
|
169
|
+
type: string
|
|
170
|
+
description: "SBA sub-program designation"
|
|
171
|
+
display_name: "Sub-Program"
|
|
172
|
+
mergeability: not_mergeable
|
|
173
|
+
domain_flavors: ["sba_program"]
|
|
174
|
+
examples: ["Guaranty", "Sec. 504 - Loan Guarantees - Private Sector Financed"]
|
|
175
|
+
passive: true
|
|
176
|
+
|
|
177
|
+
- name: "naics_code"
|
|
178
|
+
type: string
|
|
179
|
+
description: "North American Industry Classification System code (typically 6 digits) identifying the industry of work performed under a contract (e.g., \"524114\" for Direct Health and Medical Insurance Carriers)"
|
|
180
|
+
display_name: "NAICS code"
|
|
181
|
+
mergeability: not_mergeable
|
|
182
|
+
domain_flavors: ["industry"]
|
|
183
|
+
examples: ["531390", "721110"]
|
|
184
|
+
passive: true
|
|
185
|
+
|
|
186
|
+
- name: "naics_description"
|
|
187
|
+
type: string
|
|
188
|
+
description: "Human-readable name of the NAICS industry (e.g., \"DIRECT HEALTH AND MEDICAL INSURANCE CARRIERS\")"
|
|
189
|
+
display_name: "NAICS description"
|
|
190
|
+
mergeability: not_mergeable
|
|
191
|
+
domain_flavors: ["industry"]
|
|
192
|
+
examples: ["Other Activities Related to Real Estate", "Hotels (except Casino Hotels) and Motels"]
|
|
193
|
+
passive: true
|
|
194
|
+
|
|
195
|
+
- name: "franchise_name"
|
|
196
|
+
type: string
|
|
197
|
+
description: "Name of the franchise if the borrower is a franchisee"
|
|
198
|
+
display_name: "Franchise Name"
|
|
199
|
+
mergeability: not_mergeable
|
|
200
|
+
domain_flavors: ["sba_loan"]
|
|
201
|
+
passive: true
|
|
202
|
+
|
|
203
|
+
- name: "business_type"
|
|
204
|
+
type: string
|
|
205
|
+
description: "Legal structure of the borrowing business"
|
|
206
|
+
display_name: "Business Type"
|
|
207
|
+
mergeability: not_mergeable
|
|
208
|
+
domain_flavors: ["sba_loan"]
|
|
209
|
+
examples: ["CORPORATION", "LLC", "PARTNERSHIP"]
|
|
210
|
+
passive: true
|
|
211
|
+
|
|
212
|
+
- name: "business_age"
|
|
213
|
+
type: string
|
|
214
|
+
description: "Age category of the borrowing business at time of loan"
|
|
215
|
+
display_name: "Business Age"
|
|
216
|
+
mergeability: not_mergeable
|
|
217
|
+
domain_flavors: ["sba_loan"]
|
|
218
|
+
examples: ["Existing or more than 2 years old", "New Business or 2 years or less"]
|
|
219
|
+
passive: true
|
|
220
|
+
|
|
221
|
+
- name: "loan_status"
|
|
222
|
+
type: string
|
|
223
|
+
description: "Current status of the loan"
|
|
224
|
+
display_name: "Loan Status"
|
|
225
|
+
mergeability: not_mergeable
|
|
226
|
+
domain_flavors: ["sba_loan"]
|
|
227
|
+
examples: ["LIQUID", "P I F", "CHGOFF", "EXEMPT"]
|
|
228
|
+
passive: true
|
|
229
|
+
|
|
230
|
+
- name: "paid_in_full_date"
|
|
231
|
+
type: string
|
|
232
|
+
description: "Date the loan was paid in full"
|
|
233
|
+
display_name: "Paid in Full Date"
|
|
234
|
+
mergeability: not_mergeable
|
|
235
|
+
domain_flavors: ["sba_loan"]
|
|
236
|
+
passive: true
|
|
237
|
+
|
|
238
|
+
- name: "chargeoff_date"
|
|
239
|
+
type: string
|
|
240
|
+
description: "Date the loan was charged off"
|
|
241
|
+
display_name: "Charge-Off Date"
|
|
242
|
+
mergeability: not_mergeable
|
|
243
|
+
domain_flavors: ["sba_loan"]
|
|
244
|
+
passive: true
|
|
245
|
+
|
|
246
|
+
- name: "gross_chargeoff_amount"
|
|
247
|
+
type: float
|
|
248
|
+
description: "Gross charge-off amount in dollars"
|
|
249
|
+
display_name: "Gross Charge-Off Amount"
|
|
250
|
+
mergeability: not_mergeable
|
|
251
|
+
domain_flavors: ["sba_loan"]
|
|
252
|
+
passive: true
|
|
253
|
+
|
|
254
|
+
- name: "revolver_status"
|
|
255
|
+
type: string
|
|
256
|
+
description: "Whether the loan is a revolving line of credit (7(a) only)"
|
|
257
|
+
display_name: "Revolver Status"
|
|
258
|
+
mergeability: not_mergeable
|
|
259
|
+
domain_flavors: ["sba_loan"]
|
|
260
|
+
passive: true
|
|
261
|
+
|
|
262
|
+
- name: "jobs_supported"
|
|
263
|
+
type: float
|
|
264
|
+
description: "Number of jobs supported by the loan"
|
|
265
|
+
display_name: "Jobs Supported"
|
|
266
|
+
mergeability: not_mergeable
|
|
267
|
+
domain_flavors: ["sba_loan"]
|
|
268
|
+
examples: ["9", "250"]
|
|
269
|
+
passive: true
|
|
270
|
+
|
|
271
|
+
- name: "third_party_dollars"
|
|
272
|
+
type: float
|
|
273
|
+
description: "Third-party lender contribution in dollars (504 only)"
|
|
274
|
+
display_name: "Third-Party Dollars"
|
|
275
|
+
mergeability: not_mergeable
|
|
276
|
+
domain_flavors: ["sba_loan"]
|
|
277
|
+
passive: true
|
|
278
|
+
|
|
279
|
+
- name: "sba_district_office"
|
|
280
|
+
type: string
|
|
281
|
+
description: "SBA district office that processed the loan"
|
|
282
|
+
display_name: "SBA District Office"
|
|
283
|
+
mergeability: not_mergeable
|
|
284
|
+
domain_flavors: ["sba_loan"]
|
|
285
|
+
passive: true
|
|
286
|
+
|
|
287
|
+
- name: "congressional_district"
|
|
288
|
+
type: string
|
|
289
|
+
description: "Congressional district of the project"
|
|
290
|
+
display_name: "Congressional District"
|
|
291
|
+
mergeability: not_mergeable
|
|
292
|
+
domain_flavors: ["sba_loan"]
|
|
293
|
+
passive: true
|
|
294
|
+
|
|
295
|
+
# Organization properties
|
|
296
|
+
- name: "address"
|
|
297
|
+
type: string
|
|
298
|
+
description: "Physical street address of the entity"
|
|
299
|
+
display_name: "Address"
|
|
300
|
+
mergeability: not_mergeable
|
|
301
|
+
domain_flavors: ["organization"]
|
|
302
|
+
examples: ["11851 Wentling Ave, BATON ROUGE, LA 70816"]
|
|
303
|
+
passive: true
|
|
304
|
+
|
|
305
|
+
- name: "fdic_certificate_number"
|
|
306
|
+
type: string
|
|
307
|
+
description: "FDIC Certificate Number uniquely identifying an insured institution, derived from the CERT field in the Institutions API"
|
|
308
|
+
display_name: "FDIC Certificate Number"
|
|
309
|
+
mergeability: not_mergeable
|
|
310
|
+
domain_flavors: ["organization"]
|
|
311
|
+
passive: true
|
|
312
|
+
|
|
313
|
+
- name: "ncua_number"
|
|
314
|
+
type: string
|
|
315
|
+
description: "NCUA charter number identifying a credit union"
|
|
316
|
+
display_name: "NCUA Number"
|
|
317
|
+
mergeability: not_mergeable
|
|
318
|
+
domain_flavors: ["organization"]
|
|
319
|
+
passive: true
|
|
320
|
+
|
|
321
|
+
relationships:
|
|
322
|
+
- name: "borrower_of"
|
|
323
|
+
description: "An organization is the borrower, applicant, or account party on a financial instrument such as a letter of credit, loan, or bond issue."
|
|
324
|
+
display_name: "Borrower Of"
|
|
325
|
+
mergeability: not_mergeable
|
|
326
|
+
domain_flavors: ["organization"]
|
|
327
|
+
target_flavors: ["sba_loan"]
|
|
328
|
+
examples: ["Title2Land, L.L.C. is borrower of SBA 7a Loan 317954"]
|
|
329
|
+
passive: true
|
|
330
|
+
|
|
331
|
+
- name: "lender_of"
|
|
332
|
+
description: "An organization (bank, CDC, or third-party lender) is the lender on an SBA loan"
|
|
333
|
+
display_name: "Lender Of"
|
|
334
|
+
mergeability: not_mergeable
|
|
335
|
+
domain_flavors: ["organization"]
|
|
336
|
+
target_flavors: ["sba_loan"]
|
|
337
|
+
examples: ["Newtek Small Business Finance, Inc. is lender of SBA 7a Loan 317954"]
|
|
338
|
+
passive: true
|
|
339
|
+
|
|
340
|
+
- name: "in_program"
|
|
341
|
+
description: "The SBA program/sub-program under which this loan was made"
|
|
342
|
+
display_name: "In Program"
|
|
343
|
+
mergeability: not_mergeable
|
|
344
|
+
domain_flavors: ["sba_loan"]
|
|
345
|
+
target_flavors: ["sba_program"]
|
|
346
|
+
passive: true
|
|
347
|
+
|
|
348
|
+
- name: "in_industry"
|
|
349
|
+
description: "The industry (NAICS) classification of the work performed under this contract or IDV"
|
|
350
|
+
display_name: "In industry (NAICS)"
|
|
351
|
+
mergeability: not_mergeable
|
|
352
|
+
domain_flavors: ["organization"]
|
|
353
|
+
target_flavors: ["industry"]
|
|
354
|
+
passive: true
|
|
355
|
+
|
|
356
|
+
- name: "project_is_located_at"
|
|
357
|
+
description: "Geographic location of the SBA loan's project"
|
|
358
|
+
display_name: "Project Located At"
|
|
359
|
+
mergeability: not_mergeable
|
|
360
|
+
domain_flavors: ["sba_loan"]
|
|
361
|
+
target_flavors: ["location"]
|
|
362
|
+
examples: ["SBA 7a Loan 317954 is located at EAST BATON ROUGE, LA"]
|
|
363
|
+
passive: true
|
|
364
|
+
|
|
365
|
+
attributes:
|
|
366
|
+
- properties: ["lender_of"]
|
|
367
|
+
name: "lender_type"
|
|
368
|
+
type: string
|
|
369
|
+
description: "Type of lender in the loan (primary, cdc, third_party)"
|
|
370
|
+
display_name: "Lender Type"
|
|
371
|
+
mergeability: not_mergeable
|
|
372
|
+
examples: ["primary", "cdc", "third_party"]
|
|
373
|
+
|
|
374
|
+
- properties: ["lender_of"]
|
|
375
|
+
name: "lender_amount"
|
|
376
|
+
type: float
|
|
377
|
+
description: "Dollar amount lent by this lender"
|
|
378
|
+
display_name: "Lender Amount"
|
|
379
|
+
mergeability: not_mergeable
|
|
380
|
+
examples: ["450000", "810000", "1334500"]
|