🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 5 (from laksa099)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📄
INDEXABLE
CRAWLED
1 day ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://xie.infoq.cn/article/bf3c1215455491a1c741f34ac
Last Crawled2026-04-12 18:40:50 (1 day ago)
First Indexed2025-10-02 23:35:41 (6 months ago)
HTTP Status Code200
Meta Title大型广告系统架构与设计_Java_加勒比海带_InfoQ写作社区
Meta Description业务背景: 在数字化平台发展过程中,随着用户规模持续扩大与行为数据的不断积累,流量与数据逐渐成为平台最具价值的资产之一。为了进一步提升用户生命周期价值、加强平台的运营能力,同时打造自主可控的商业化基础
Meta Canonicalnull
Boilerpipe Textnull
Markdown
![大型广告系统架构与设计\_Java\_加勒比海带\_InfoQ写作社区](https://static001.infoq.cn/static/infoq/img/logo-121-75.yuij86g.png)
Readable Markdownnull
Shard5 (laksa)
Root Hash3265164560527275005
Unparsed URLcn,infoq!xie,/article/bf3c1215455491a1c741f34ac s443