Paper is a spec product: format, grammage and whiteness decide the sale. Here's how those attributes get structured, where BMEcat helps, and where AI takes over.
A ream of copy paper looks like the simplest product in the catalog. It isn't — not as data. The customer who buys it never reads a description; they filter for DIN A4, 80 g/m², high whiteness, FSC-certified, and buy whatever matches. The entire sale hinges on a few technical attributes being present, correct and comparable. And that is exactly where paper ranges fall apart online.
Product data for paper and forms is attribute data first: DIN format, grammage in g/m² and whiteness carry the buying decision, not brand or marketing copy. This is a sub-topic of office supplies more broadly, but paper deserves its own look because it is the most attribute-driven, most filter-dependent part of the assortment.
Almost the entire value of a paper record sits in a compact set of standardized attributes. Get these clean and the product is findable and filterable; get them wrong and it's invisible:
The trouble is never the standard — DIN and g/m² are unambiguous. The trouble is that suppliers deliver the values inconsistently: one writes "80g", another "80 g/m²", a third puts the grammage in the title and leaves the attribute field empty. Multiply that across dozens of suppliers and your filter facets fill with noise.
Office and paper supply has a genuine exchange standard: BMEcat, the B2B catalog format widely used in this sector. Where a supplier ships a clean BMEcat file with proper feature groups, format and grammage arrive as structured, typed attributes — which is a real head start. But it's worth being honest about the coverage:
| Data layer | What BMEcat / feeds deliver | Where it stops |
|---|---|---|
| Format & grammage | Structured attributes when the supplier fills the feature group | Free-text or title-buried values in weaker feeds |
| Supplier coverage | Established suppliers ship valid BMEcat | Many still send Excel / PDF price lists |
| Whiteness / certification | Sometimes present | Frequently blank or non-standardized |
| Sales content | Not the job of an exchange format | Descriptions, SEO text, benefit copy absent |
| Form-specific attributes | Basic classification | Ply count, perforation, printer compatibility thin |
So BMEcat solves the well-organized part of the range — the established suppliers who fill their feature groups properly. What it doesn't cover is the supplier who still sends a PDF, the half-populated attribute fields, the whiteness left blank, and every bit of sales content. For how classification standards fit together more broadly, see GDSN, ETIM and eCl@ss explained.
The job is the same three steps every multi-supplier retailer runs — and for paper the enrichment step is unusually high-leverage because so much value sits in a few normalizable attributes. That's exactly what Productbay is built for:
Productbay starts where BMEcat and the supplier feed stop: the messy suppliers, the empty attribute fields, the form-specific specs and the sales content no standard carries. For the broader picture, see product data in office supplies. Productbay is built for specialist retailers running multi-supplier, multi-channel catalogs — from mid-sized shops to large chains. To dig into the normalization mechanics, read how we enrich and normalize data from multiple suppliers.
Grammage in three different notations, formats hidden in titles, forms with ply counts — paper is all attributes. See how Productbay normalizes them into clean filters in a 30-minute walkthrough.
Get started