Bulk Data

Data Format

Our data is delivered in two sets of files: entities and relationships. Within each file, a row describes one entity or relationship.

Note: Any entity or relationship may have multiple attributes of the same type; for example, an entity may have multiple addresses (physical, mailing, etc.). Accordingly, all attributes are included as arrays.


Like an entity profile in the Sayari suite of products, a row in an entity file describes a single entity, including its attributes, risk factors, and summary information.

Summary information includes properties that describe the entity, as outlined below:

entity_idstringPrimary key
typestringEntity type, see Entities
labelstringMost commonly reported name
label_enstringMost commonly reported American Standard Code for Information Interchange (ASCII) name
closedbooleanWhether an entity is closed
degreelongNumber of unique neighboring entities
edge_countsmapNumber of neighbors per edge type
sanctionedbooleanSee Sanctioned
pepbooleanSee Politically Exposed Person (PEP)
sourcesarray[string]List of data sources an entity was referenced in
num_documentslongNumber of source documents an entity was referenced in


A row in a relationship file describes a single relationship, including its attributes and summary information. Relationships connect two entities (i.e., vertices), which are specified by their entity_ids.

srcstringentity_id of the tail vertex
dststringentity_id of the head vertex
typestringRelationship type, see Relationships
from_datestringStart date of a relationship
datestringAs-of date of a relationship
to_datestringEnd date of a relationship

Data Formats

Was this page helpful?