Taobao product data API
ShopAPIS extracts structured product data from Taobao — China’s largest consumer marketplace — and returns it as clean, normalized JSON. For any Taobao listing you get title, live and historical promo price, full SKU variant matrix (size/color), monthly sales volume (“月销”), seller DSR rating, review text with image reviews, shipping origin and the full image gallery, without operating your own China-based scraping stack.
Taobao is the deepest long-tail catalog on the planet, with an estimated ~895M monthly app users and roughly $724B GMV in 2024 (estimate — Alibaba does not disclose per-app figures; plan2 research). Together with Tmall it accounts for ~44% of Chinese domestic e-commerce.
Why Taobao data is hard to get
Taobao runs one of the heaviest anti-bot stacks in commerce. Listings sit behind GeeTest v4 slider CAPTCHAs, behavioral device fingerprinting and adaptive challenge difficulty, while the Great Firewall (GFW) geo-blocks and throttles requests from outside mainland China. Prices and variants render client-side from obfuscated, frequently-rotating endpoints with server-side token verification (GeeTest ). Alibaba’s Open Platform exists but is gated and does not expose competitor catalogs. ShopAPIS solves this with in-region infrastructure, mobile fingerprints and slider solvers — you query a REST endpoint, we return the JSON.
Data fields returned
- Identity — item ID, title, category path, brand, shop ID
- Price — current price, original price, promo/coupon price, currency (CNY)
- Variants — SKU matrix (color, size, spec), per-SKU price and stock
- Availability — in-stock flag, stock count where exposed
- Sales signal — monthly sales volume (“月销”), cumulative sold
- Seller — shop name, seller DSR rating (description/service/logistics), location
- Reviews — review count, average rating, review text, image reviews
- Media — main image gallery, SKU thumbnails
- Logistics — ships-from origin, shipping fee
Sample response
{
"platform": "taobao",
"item_id": "678901234567",
"url": "https://item.taobao.com/item.htm?id=678901234567",
"title": "纯棉短袖T恤 男士夏季宽松圆领",
"brand": "MUJI风",
"category_path": ["服饰", "男装", "T恤"],
"price": { "current": 59.00, "original": 99.00, "coupon_price": 49.00, "currency": "CNY" },
"monthly_sales": 8421,
"skus": [
{ "sku_id": "5001-白色-M", "color": "白色", "size": "M", "price": 59.00, "stock": 312 },
{ "sku_id": "5001-黑色-L", "color": "黑色", "size": "L", "price": 59.00, "stock": 0 }
],
"seller": { "shop_name": "优衣良品旗舰店", "shop_id": "112233445", "dsr": { "description": 4.9, "service": 4.8, "logistics": 4.9 }, "location": "浙江 杭州" },
"reviews": { "count": 23107, "rating": 4.8, "with_images": 1840 },
"ships_from": "浙江 杭州",
"images": ["https://img.alicdn.com/.../1.jpg", "https://img.alicdn.com/.../2.jpg"]
}Markets and domains covered
Taobao is China-domestic: primary domains taobao.com and item.taobao.com, served in Simplified Chinese with CNY pricing. ShopAPIS resolves listings by item ID or full URL and returns both raw Chinese fields and normalized values.
Use cases
- Price-floor benchmarking — Taobao’s promo and coupon-stacking prices set the lowest market reference for many SKUs.
- Variant and review mining — the richest size/color matrices and image-review corpus in e-commerce, ideal for review analytics.
- Competitive intelligence — track seller assortment and sales velocity (/solutions/competitive-intelligence).
- Catalog enrichment — pull titles, images and specs to enrich your own listings.
Taobao listings are CNY-priced and Chinese-language. ShopAPIS returns original fields plus normalized translations, but cross-border buyers should map prices to their target currency at query time.