January-2020 Release Notes
SafeGraph's New Year's Resolution 2️⃣0️⃣2️⃣0️⃣ - Make SafeGraph data your favorite thing about coming to work.
Welcome to the January-2020 Places release notes (v2019-12-23/1577135031) shipped 2020-01-06.
Highlights
- New Column Available:
category_tags
provides enhanced, high-resolution category information on Food & Drink Services POI, taking category-based analysis, visualization, and querying to the next level. Ask your SafeGraph rep to addcategory_tags
to your deliveries, or try it out from the SafeGraph Data Bar. - SafeGraph Places now has 6,094,106 places in the United States and Canada (+14,833 from last release).
- This month SG Patterns has 1,073,278,638 visits from 40,577,097 visitors (up +117,021,790 visits, +6,747,731 visitors from last month!). 🎄 🕎kinara ❄️ 👪 🎁 📈
- 101 new brands, such as Marathon (marathonbrand.com) with 5,688 US places.
Table of Contents:
Enhancements - Core Places and Brands
- Last month SG Places had 6,079,273 points-of-interest. This month SG Places has 6,094,106 points-of-interest (net + 14,833 places). These are +14,835
US
Places and -2CA
places. - Enhanced Category Resolution. This is big. 👍
- First, we like NAICS because it is a government- and industry-standard used throughout the world to categorize businesses and points-of-interest. However, it has some shortcomings. For example, the 6-digit NAICS 722511 covers all "Full-Service Restaurants" under one code. This means that as far as NAICS is concerned, Le Bernardin 🐟, Ruth's Chris Steak House 🥩, and your local family-owned Chinese food restaurant 🍜 are all the same type of place. This is unsatisfying for SafeGraph, and for many SafeGraph customers, including anyone supporting local search, category-based visualizations, or competitive retail intelligence. Seemingly simple queries like "show me the nearest Mexican restaurants" are very difficult when the only category information available is the NAICS code.
- Today we are super excited to launch
category_tags
as a major augmentation to the category information available in SafeGraph Core Places.category_tags
is focused on providing more detailed category information for food-and-drink-related places. For POI withnaics_code
starting 722 ("Food Services and Drinking Places"), SafeGraph now provides a comma-separated list of human-readable "tags" forming a detailed description of the place. See below for some examples of the power ofcategory_tags
. - (For a full list of all possible category_tag values, see the Places Manual: Category Tags)
- (For information on how this new column affects the column-order of your new deliveries, see the Places Manual: Column Ordering)
- Examples of the new column
category_tags
:
location_name | naics_code | top_category | sub_category | category_tags (NEW) |
---|---|---|---|---|
Le Bernardin | 722511 | Restaurants and Other Eating Places | Full-Service Restaurants | Bar or Pub,Cocktail Lounge,Fine Dining,French Food,Late Night |
Ruth's Chris Steakhouse | 722511 | Restaurants and Other Eating Places | Full-Service Restaurants | Dinner,Drinks,Fine Dining,Seafood,Steak House,Wine Bar |
Shanghai Dumpling Shop | 722511 | Restaurants and Other Eating Places | Full-Service Restaurants | Chinese Food |
McDonald's | 722513 | Restaurants and Other Eating Places | Limited-Service Restaurants | Breakfast,Burgers,Counter Service,Dinner,Drive Through,Fast Food,Lunch |
Baygreens | 722513 | Restaurants and Other Eating Places | Limited-Service Restaurants | American Food,Brunch,Salad |
-
We've added net 101 new brands 🎊
New Brands Include...- Marathon (marathonbrand.com), SG_BRAND_faaaac9cb18c500a97c03eec92d6b8fc) with 5688 US places.
- ZIPS Car Wash (zipscarwash.com), SG_BRAND_79b040f298b5ac9) with 162 US places.
- American 1 Credit Union (american1cu.org), SG_BRAND_6980414328db21c5) with 113 US places.
- Tiger Rock Martial Arts (tigerrockmartialarts.com), SG_BRAND_3819b4b88d9159eb) with 96 US places.
- Cookie Cutters Haircuts for Kids (haircutsarefun.com), SG_BRAND_10e24896f0b4091) with 89 US & 5 CA places.
- Ombudsman (ombudsman.com), SG_BRAND_f36b7f9b8c409989) with 70 US places.
- One Medical (onemedical.com), SG_BRAND_00f0efce83a7ff22) with 65 US places.
- Quality Plus (qualityplusnc.com), SG_BRAND_7b084b361448eb34) with 58 US places.
- Ascend Resort Collection (choicehotels.com/ascend), SG_BRAND_3087c2c140a9cb9b54e2ce2e739c00a7) with 56 US & 8 CA places.
- Children's Lighthouse (childrenslighthouse.com), SG_BRAND_134c5fe6e47f265e) with 54 US places.
- GUESS (guess.ca), SG_BRAND_44fc6c9812b78f5e76f9b25892fe6ad9) with 57 CA places. (NB: GUESS was already a SafeGraph Brand, but previously did not include coverage of Canadian locations).
- And 90 more!! 📈
-
Improved coverage for high-value categories like Airports, Airport Terminals, Casinos, and Mini Golf We are making a concerted effort to improve coverage for many non-branded POI. Here are some of those improved categories (NAICS) in this release:
- Airports Terminals/Concourses (488119) Net POI count change: 103. ✈️
- Casinos (713210) Net POI count change: 1100. 🎰
- Miniature Golf (713910) Net POI count change: 196 (154 in US, 51 in CA). ⛳
Bug Fixes and Known Issues - Core Places and Brands
-
We found some errors involving over-labeling of POI for some brands. In other words, we were creating branded POI incorrectly at some locations. These fixes resulted in significant decreases in the total number of POI for those affected brands. The new count is correct, and for transparency we'd like to list some of these fixes as examples in no particular order.
-
Sears Hardware Stores
, (SG_BRAND_a89b31a8e736119a). Net POI count change: -6 US. Bug: We discovered that Sears Hardware Stores is not a separate brand from Sears Hometown Stores (SG_BRAND_305b0a21405390e3
), so these have been merged into the latter. -
Petco
, (SG_BRAND_c5bc0c313e3f7af2ff0291d8846671ab). Net POI count change: -68 US. Bug: Incorrectly including subsidiaryUnleashed by Petco
locations (these were duplicates because locations are already accounted for with their own brandSG_BRAND_0c506e30249aff4bd40c279d2c1daac9
). -
We found some errors where we were missing some branded POI, and these fixes resulted in significant increases in the total number of POI for those affected brands. The new count is correct, and for transparency we'd like to list some of these fixes as examples in no particular order.
-
Weight Watchers (WW)
, (SG_BRAND_b0980641a37b38460ec96d65bd20fd9d). Net POI count change: +2,608 US +318 CA. Bug: Was incorrectly filtering out locations that had "studio" in the name, but these are true Weight Watchers POI. -
Hucks
, (SG_BRAND_26d489e6f425ebba). Net POI count change: 18. Bug: Improved our data sourcing for more complete coverage. -
Bad SGPID Churn -- Bad sgpid churn are undesired failures to maintain a consistent
safegraph_place_id
(sgpid) between releases (see discussion in March 2019 release). We internally track and estimate our performance in this domain and share these numbers in our release notes for maximum transparency. In this release:- We dropped 56,704 sgpids (8,754 branded and 47,950 non-branded).
- We added 71,537 sgpids (18,201 branded and 53,336 non-branded).
- Note: We intentionally dropped many POI as part of a general cleanup from some of our most noisy and duplicative sources. Most of these drops are duplicates being removed. Some percent of these are true openings and closings (or new brands); the remainder are bad sgpid churn. We are continuing to work on better metrics to distinguish these cases.
-
Category Fill Rate We monitor category fill rate with 3 metrics: (1) category fill rate across the entire dataset, (2) category fill rate for branded POI, (3) category fill rate in the brand_info file (brand-level categories). We want all of these numbers to be 100%.
- (1) All POI category fill rate. Last month 96.25%. This month 96.27%.
- (2) Branded POI category fill rate. Last month 100%. This month 100% 💯
- (3) Brand-level category fill rate (brand_info file). Last month 100%. This month 100%. 💯
Enhancements - Geometry
- Improved and additional cartography and polygons. New or improved polygon geometries for ~ 2900 POI in US and CA focused on improving branded POI (see how we are moving the needle on the percent of branded-no-parent polygons with polygon_class = OWNED). POI :diamond-shape:
Bug Fixes and Known Issues - Geometry
- Centroid-Radius Polygons -- As discussed in March 2019 release notes. We internally track centroid-radius polygons vs precise polygons and strive for 100% precise polygons. You can measure this yourself using the
is_synthetic
column. Last release was 93.8%, this release is 93.7%. Here is how we are tracking on that metric over recent releases: Centroid-Radius Polygon Tracking. - Percent polygon_class = OWNED (as described in Oct 2019 release notes. We examine
polygon_class
for allsafegraph_place_id
that are both (i) branded and (ii) do NOT have aparent_safegraph_place_id
; we call this group "branded, no-parent". We want 100% of "branded, no-parent" POI to havepolygon_class
= OWNED_POLYGON. Last month, the percent OWNED polygons for branded, no-parent was 75.6%. This month it is 76.1%. 👍 We continue to work on this. Here is how we are tracking on this metric in recent releases: OWNED vs SHARED Polygons in SafeGraph Places Release History.
Enhancements - Patterns
- In last month's delivery SG Patterns had 3,569,113 points-of-interest (US only). This month SG Patterns has 3,714,581 points-of-interest (US only) (net + 145,468 places) . 📈 😻
- Last month SG Patterns had 956,256,848 visits from 33,829,366 visitors. This month SG Patterns has 1,073,278,638 visits from 40,577,097 visitors (delta +117,021,790 visits, +6,747,731 visitors). 🎄 👪 📈
Also check out these new ways to get SafeGraph data:
* Need some extra data on other SafeGraph products? Check out the [SafeGraph Data Bar.](https://shop.safegraph.com/)
* Heavy AWS User? Check out our [listings in the AWS Data Exchange](https://aws.amazon.com/marketplace/search/results?filters=vendor_id&vendor_id=7d5ff8ca-105f-4856-9d99-5f2f1d83223c).
* Are you an Esri or ArcGIS user? Check out our FREE data [SafeGraph Places in the Esri Marketplace](https://marketplace.arcgis.com/listing.html?id=3425348e4bee4059af2b353e52df43c2).
* Or just drop us a line! Your data needs are our data delights!
p.s. **[SafeGraph Core Places & Geometry is now available in both US & Canada](https://docs.safegraph.com/changelog/october-2020-release-notes#section-canada-places-version-1-0-available-for-core-places-and-geometry-in-october-release)**