Flat File Schema
Overview
This document describes the star schema architecture for Sayari entity and relationship data modeling. The schema uses entity_id
as the universal primary key, with all data organized around entities (organizations, people, or other objects) and their relationships.
Fact Tables
The star schema contains two fact tables that serve as the central hub for analysis:
Entity Summary Table
Purpose: Primary fact table containing core entity metrics and attributes.
Key Relationships: All entity dimension tables join via entity_id
Relationship Summary Table
Purpose: Fact table capturing connections between entities with temporal tracking.
Key Relationships:
src_id
anddst_id
referenceentity_id
in Entity Summary- Relationship dimensions join via
(src_id, dst_id)
composite key
Entity Dimension Tables
Core Attributes
Name Table
Purpose: Stores name variations including translations and transliterations.
Identifier Table
Purpose: Various identifiers (tax IDs, registration numbers, etc.).
Status Table
Purpose: Entity status conditions over time.
Address Table
Purpose: Comprehensive geographic information for entities.
Business Attributes
Business Purpose Table
Purpose: Industry classifications and business activities.
Company Type Table
Purpose: Company type classification.
Country Table
Purpose: Country context and jurisdictions.
Financial Attributes
Financials Table
Purpose: Comprehensive financial metrics.
Monetary Value Table
Purpose: Monetary amounts in various contexts.
Shares Table (Entity)
Purpose: Share ownership information.
Supplementary Attributes
Contact Table
Purpose: Contact information.
Measurement Table
Purpose: Measurement information.
Additional Information Table
Purpose: Additional information data.
Edges Table
Purpose: Relationship connection summary.
Risk Attributes
Risk Intelligence Table
Purpose: Detailed risk intelligence information.
Risk Values Table
Purpose: Wide-format table with individual risk factors as columns.
Note: This table contains 200+ risk factor columns including sanctions indicators, PEP flags, export control violations, forced labor indicators, state ownership markers, and financial crime risks.
Risk Paths Table
Purpose: Risk propagation through entity networks.
Note: Similar structure to Risk Values but tracks how risks propagate through ownership and relationship chains.
Relationship Dimension Tables
Position Table
Purpose: Employment and governance positions.
Shares Table (Relationship)
Purpose: Ownership stakes between entities.
Business Purpose Table (Relationship)
Purpose: Business purpose for relationships.
Additional Information Table (Relationship)
Purpose: Additional information relationship details.