Rich Attributes
Our Rich Attributes expand on the standard details about a Place to provide even more meaningful metadata. More granular category tags. Open hours. Closed On Dates. And much more.
Contents
- Column Definitions for definitions and in depth details
- Key Concepts relevant to our Places data
- Known Artifacts for any nuance to the dataset
Helpful Links
Column Definitions: Rich Attributes
category_tags
category_tags
category_tags
provide higher-granularity category information beyond what can be found in a NAICS code label. So instead of just "Full-Service Restaurant" (NAICS 722511), we'll also provide tags like 'Pizza', 'Lunch', 'Dinner', 'Drive Through', and 'Late Night' so that you can glean more meaningful details about that specific restaurant. Trying to find a place that serves coffee? Our 'coffee shop' tag spans more than 30 NAICS categorizations, giving you a much easier method to pinpoint specific places based on granular inputs.- Category information is conveyed as a list of descriptive words about the POI. e.g
['Mexican Food, Dinner]
- Category tags are broadly available across NAICS codes though not every NAICS code will have them. Here is the full list of possible tags for each NAICS code. SafeGraph strives to label all relevant tags and will include up to 13 tags for any given POI.
domains
domains
The top-level domain(s) for the website(s) associated with the POI as provided and maintained by the POI itself. These will be provided as the domain names relevant to the business or operation in place at a given location. If there are multiple relevant domains (e.g. a unique website for a hotel but also the umbrella hotel chain website) we will include as many as relevant. Prior to June 2023, this had been known as the websites
column.
website
website
The web URL for the POI's publicly available website. This will only be populated if a website exists specifically dedicated to the place, not just if for example there is an overarching corporate website. This attribute was introduced in the June 2023 release.
phone_number
phone_number
This is a 10 digit phone number in the US and Canada or a 12 digit phone number in Great Britain. We filter out toll-free numbers (e.g. 1-800) and strive to have POI-specific numbers (not franchise-level or corporate-level numbers).
open_hours
open_hours
The new format for open hours is a JSON string with days as keys and opening & closing times (in the POI's local time) as values.
- Each JSON string is guaranteed to have all 7 days as keys
- We indicate that a POI is closed for the day by giving it a value of "[]"
- We indicate that a POI is open the entire day by using a format like: `
- "Thu": [["0:00", "24:00"]]`
- For POI that open and close multiple times throughout the day (e.g. a restaurant open in the morning and evening but not midday), we list multiple opening/closing pairs. For example:
“Sat": [["8:00", "13:00"], ["15:00", "22:30"]]
- This indicates that a POI is open from 8 am to 1 pm and also from 3 pm to 10:30 pm on Saturday.
- For POI that open and close on different days (e.g. a bar which opens on Tuesday at 6 pm and closes on Wednesday at 2 am), we use a format like:
"Tue": [["18:00", "24:00"]], "Wed": [["0:00", "2:00"]]
opened_on
, closed_on
, tracking_closed_since
opened_on
, closed_on
, tracking_closed_since
These columns reflect the dates surrounding the known openings or closings of certain businesses. See our commentary on Determining when POI Open and Close for detailed information on our approach.
store_id
store_id
Store_id is the unique ID associated with a store as provided and maintained by the store/brand itself. This is a premium column only applicable to places with a safegraph_brand_id
. Most store_ids
can be found directly on store locators, but in some cases, the store_id
is embedded within the store locator URL for the specific store. Note that there is no single source of truth for store_ids
, and some first party datasets may not define store_id
in the same manner which SafeGraph does, but we strive to provide the most widely used concept of store_id
.
Typically, store_ids
are alphanumeric codes unique to each location. However, they are not always alphanumeric. For example the store number for this store is 1615.
store_id
is especially useful as join key when working with transaction data. For example, “TJ256Y8” may be the only location specific information within a transaction dataset. A Places dataset which also contains "TJ256Y8" as a store_id
enables a join to contextualize transaction data (or other internal, store-level data) with SafeGraph places information.
Determining POI Open/Close (opened_on
, closed_on
, tracking_closed_since
)
opened_on
, closed_on
, tracking_closed_since
)opened_on
and closed_on
dates are determined from metadata at the source level. If a new POI from an existing source repeatedly appears in our build pipeline, it is flagged as opened_on
during the month in which it first appears. Similarly, if a POI from an existing source repeatedly disappears in our build pipeline, it is flagged as closed_on
during the month in which it first disappears. opened_on
dates are only inferred for POIs with a safegraph_brand_id
whereas closed_on
dates are attempted for both branded and non-branded POIs. POIs with an opened_on
or closed_on
date have been determined to be accurate within a ~60 day margin of error.
Open/Close features are currently supported in the following 11 countries:
iso_country_code
= 'US'iso_country_code
= 'CA'iso_country_code
= 'GB'iso_country_code
= 'DE'iso_country_code
= 'FR'iso_country_code
= 'ES'iso_country_code
= 'IT'iso_country_code
= 'MX'iso_country_code
= 'JP'iso_country_code
= 'CN'iso_country_code
= 'SG'
For supported countries, if a POI has not yet been sourced consistently enough to provide the metadata needed to determine closed_on
dates, then it will have a null value in the tracking_closed_since
column. Outside of supported countries, the tracking_closed_since
column should be ignored. In general, the SafeGraph Places product tracks opened_on
and closed_on
dates from as early as 2019-07 onward, and therefore, the majority of POIs that have a tracking_closed_since
date will show a value of '2019-07.'
We know that some POIs are permanently closed but are not confident in the exact closure date. These POIs show closed_on
= '1900-01' to denote this uncertainty.
Note that temporary closures are not captured in open/close tracking. It became difficult to distinguish permanent closures from temporary closures at the onset of COVID-19, and this resulted in a relatively low count of POIs with closed_on
values between "2020-03" and "2020-06" as we erred towards the side of caution to not mistakenly mark temporarily closed businesses as permanently closed.
As we caught up on permanent closures post COVID and invested in open/close model improvements, the following spikes are reflected in closed_on
dates - which may extend past a ~60 day window of actual closure:
- March 2022 (
closed_on
= '2022-03')- Covid catch-up
- April 2022 (
closed_on
= '2022-04')- Covid catch-up
- May 2023 (
closed_on
= '2023-05')- Model improvement
- March 2024 (
closed_on
= '2024-03')- Model improvement
Place Format
The place_format
column applies to specific brands or categories that adhere to a widely adopted classification system that specifies the POI's layout or format type. Each brand or category with this feature includes a dictionary for its values.
Place Format Definitions
Shopping malls (US and Canada only; naics_code
= 531120)
Value | Description |
---|---|
Super-Regional Center | The largest format regional centers typically with 4+ anchor tenants. Super Regionals have all of the attributes of a Regional Center but are larger and more of a destination location. Sometimes Power, Community, Lifestyle or Neighborhood components are incorporated on the periphery of the development. |
Regional Center | General shopping goods merchandise, traditional mass market or fashion oriented department stores and discount department stores. Regional centers are typically enclosed with the exceptions being in year-round temperate climates. |
Power | Dominated by 5+ anchors including discount department stores, warehouse clubs and category killers. The proportion of small format tenants is less than other types of anchored centers. Components of Lifestyle and Neighborhood centers are often incorporated into Power Center layouts. |
Lifestyle | Typically associated with upscale neighborhoods or high-end retail. Lifestyle centers are characterized by a village-like layout with parking, dining, and entertainment complimentary to the upscale ambiance and conducive to casual browsing. Often anchored by upscale department stores or grocers though no anchor is required. |
Outlet | Outlet stores and quick service restaurants. |
Community | General merchandise; Convenience. |
Neighborhood | Convenience. |
Festival / Entertainment | Leisure; tourist‐oriented; retail and service. |
Strip / Convenience | Convenience, restaurant, general merchandise and services. |
Mixed | Mix of commercial office space and retail and service. |
Airport | Clustered outlets inside of regional and international airports. |
Updated 3 months ago