đŸ•ˇī¸ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 38 (from laksa049)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

â„šī¸ Skipped - page is already crawled

đŸšĢ
NOT INDEXABLE
✅
CRAWLED
1 year ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusFAILdownload_http_code = 200HTTP 429
Age cutoffFAILdownload_stamp > now() - 6 MONTH17.1 months ago
History dropFAILisNull(history_drop_reason)oldunavailable
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://cn.ambafrance.org/%E7%94%B3%E8%AF%B7%E8%B5%B4%E6%B3%95%E7%AD%BE%E8%AF%81-%E4%B8%8A%E6%B5%B7
Last Crawled2024-11-20 05:36:00 (1 year ago)
First Indexed2017-03-31 07:04:51 (9 years ago)
HTTP Status Code429
Meta Titlenull
Meta Descriptionnull
Meta Canonicalnull
Boilerpipe Textnull
Markdownnull
Readable Markdownnull
Shard38 (laksa)
Root Hash10175257910872821838
Unparsed URLorg,ambafrance!cn,/%E7%94%B3%E8%AF%B7%E8%B5%B4%E6%B3%95%E7%AD%BE%E8%AF%81-%E4%B8%8A%E6%B5%B7 s443