In skincare the buying decision hinges on what's inside and what it does. Here's how to turn INCI strings and active ingredients into structured attributes and content — across a mixed delivery of GDSN brands and indie Excel.
In most retail categories, a customer buys a brand, a price and a look. In skincare, they buy an effect. A serum sells because it contains 10% niacinamide, because it suits oily and blemish-prone skin, because it targets fine lines. The product data that closes the sale is not the SKU or the price — it's the ingredients and what they do. And that's exactly the data that arrives in the worst possible shape.
Product data for skincare is data about ingredients and efficacy: INCI lists, active concentrations, skin type and effect — the attributes that actually drive the purchase. This is a focused corner of the broader beauty & cosmetics data challenge, and it has its own specific pain: the most valuable information hides inside a legally-formatted ingredient string that no shop filter can read as delivered.
The core skincare attributes are unusually rich — and unusually unstructured on delivery:
None of this is a simple column in most supplier files. The job is to parse and derive it — to turn one INCI string into a set of structured, filterable attributes plus content that explains them. That's a consolidate, normalize and enrich problem with a heavy enrichment tail.
Skincare data arrives on two very different tracks, and most retailers carry both at once:
| Data layer | What GDSN delivers | Where it stops |
|---|---|---|
| Master data & GTIN | Clean for listed brands | Nothing for indie / naturals suppliers |
| INCI list | Often present as a raw string | Not parsed into filterable ingredients |
| Active concentration | Rarely a structured field | Buried in datasheets or absent |
| Skin type & effect | Not a GDSN attribute | Must be derived, not delivered |
| Sales content | Not the job of a data pool | Descriptions, benefit copy, SEO absent |
So even where GDSN does its job, it hands you a clean record — not a shoppable one. The efficacy story, the skin-type filters and the readable content all still have to be built. And for the indie longtail, even the clean record is missing.
The throughline is a three-step job, and skincare leans hardest on the middle step — and that's exactly what Productbay is built for:
The result: an INCI string becomes a set of searchable ingredient filters, a raw indie spreadsheet becomes a structured product with skin-type and effect attributes, and every SKU carries content that actually explains what it does. Productbay is built for specialist retailers running multi-supplier, multi-channel catalogs. For the full category picture, see product data in beauty & cosmetics retail.
INCI parsing, active ingredients, skin-type filters, efficacy content — across clean GDSN records and raw indie spreadsheets. See how Productbay structures skincare attributes and drafts content in a 30-minute walkthrough.
Get started