Skip to main content
Flintmere

Flintmere · FlintmereBot · v1.0

Watch us read your storefront, FlintmereBot.

The bot, in plain text. The user-agent it carries. What it reads. What it ignores. The leash. The opt-out. Where the data ends up.

the passport

User-agent string

RFC 7231–compliant. The +URL resolves to this page so any admin reading their access log can verify who we are.

what flintmerebot extracts

Watch a parser read three documents.

FlintmereBot reads three public surfaces on a Shopify storefront: robots.txt, /products.json, and JSON-LD on individual product pages. From these we extract identifiers like GTIN, structured fields like brand, and prices.

01 / 03

From your /robots.txt

the negative space

What we never touch.

  1. We never sign in.

  2. We never submit forms.

  3. We never read data behind authentication.

  4. We never crawl customer or order pages.

the leash

Polite by spec.

  1. between requests

  2. URLs fetched

  3. revisit

the off-switch

Two lines. Twenty-four hours.

User-agent: FlintmereBot
Disallow: /

We respect robots.txt. New directives are picked up within 24 hours. To remove existing data from the benchmark entirely, contact us via our contact form (Privacy topic) — we reply within two working days.

what we publish

We publish a vertical's median, never a store's name.

Naming individual stores in a league table would be unfair and unhelpful. The research at /research is drawn from these aggregates — vertical medians, distributions, and the single biggest catalog mistake by category. Never a list of stores by grade.

FlintmereBot — the Flintmere catalog scanner · Flintmere