Attributes Data

Cortex extends the value of your business’s unique data by enabling anyone to transform that data into Machine Learning Pipelines. The types of data supported by Cortex fall into five categories: events, attributes, subscriptions, items, and ID mappings.

Dataset	Description	Example
Events *	Timestamped actions taken by your users.	Customer ABC completes a purchase event at time T.
Attributes	Characteristics or traits of your users.	Customer ABC has job title ‘Professor’, and age 49.
Subscriptions	Records of which users subscribed to which offerings and when.	Customer ABC has subscription ‘Premium’ with start time T and end time null.
Items	Metadata attributes for the items that your users interact with, via events.	Item XYZ has category ‘shoes’, and price 49.99.
ID Mappings	Associations between one set of IDs and another.	Customer ABC has cookie ID 123.

* required

In this guide, we’ll provide an overview of Attributes data and discuss the various ways that it may be ingested into Cortex.

What are attributes?

Attributes provide information about the characteristics or traits of your users. Attributes data is optional in Cortex, but recommended — the more data at Cortex’s disposal, the wider the set of predictions you can make, and the more accurate your predictions will be.

Attributes data can include any descriptor of the users tied to your events data. For example, if you’re tracking customer purchase behavior via Events data, you may supplement that data with user attributes from a Customer Data Platform (e.g. age, gender, geo, loyalty status, etc.).

The below table shows what a sample of Attributes data might look like for a set of customers. Note that Cortex must be able to match each user’s attributes with their events, so please be sure to either (a) reference the same set of IDs, or (b) send a separate ID Mapping data source into Cortex.

user_id	age	location	loyalty_status
1asdvn329	41	US-CA	gold
d189dhl99	27	US-MA	silver
as1k2f0f0	63	US-AK	none
0sdf10933	44	CN-BC	silver
kdfgf30gh	36	US-IL	gold

When you upload a file like this, Cortex stores the attributes with a timestamp indicating when each was ingested. This allows your pipelines to access a historical record of attribute values over time. For example, by recording that User 1asdvn329 was a paid member on January 1st but churned to a free member on January 8th, your pipeline will know the user’s membership type at the time of each of their events.

How do I send Attributes data into Cortex?

There are two ways that you can ingest Attributes data into Cortex:

1) Batch File Uploads

Attributes data can be batch uploaded to Cortex in files exported from, say, your data lake or CDP vendor. To do this, schedule a recurring file upload into a cloud-based directory hosted by either you or Cortex. Cortex provides flexibility around both the format of these files (CSV, JSON, Parquet, etc.) as well as the type of directory into which they’re uploaded (Amazon S3, Google Cloud, Azure, etc.).

2) Direct Integrations

Cortex integrates with several third party platforms (CDPs, data warehouses, analytics providers, etc.) commonly used for enterprise data storage. For more information about the data source integrations supported by Cortex, click here.

Attributes Data

How Can We Help?

Attributes Data

What are attributes?

How do I send Attributes data into Cortex?

1) Batch File Uploads

2) Direct Integrations

Related Links