🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 138 (from laksa091)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 138 · Partition 13
laksa138
5865175494156842738
📄
INDEXABLE
CRAWLED
2 days ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.1 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.thepaper.cn/newsDetail_forward_6644710
Last Crawled2026-06-01 02:29:07 (2 days ago)
First Indexed2023-07-03 14:12:05 (2 years ago)
HTTP Status Code200
Content
Meta Title孩子有痰别乱用止咳药,7大排痰法请收好_澎湃号·湃客_澎湃新闻-The Paper
Meta Description小婴儿咳嗽能力较弱,加上气管较细,通常不会主动咳痰。所以,有时需要家长帮助宝宝排痰,让宝宝呼吸顺畅一些。
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categoriesnull
ML Page Typesnull
ML Intent Typesnull
Content Metadata
Languagenull
Authornull
Publish Timenot set
Original Publish Time2023-07-03 14:12:05 (2 years ago)
RepublishedNo
Word Count (Total)133
Word Count (Content)69
Links
External Links15
Internal Links7
Technical SEO
Meta NofollowNo
Meta NoarchiveNo
JS RenderedNo
Redirect Targetnull
Performance
Download Time (ms)573
TTFB (ms)570
Download Size (bytes)9,999
Location
Host ID138 (laksa138)
Partition ID13
Root Hash5865175494156842738
Unparsed URLcn,thepaper!www,/newsDetail_forward_6644710 s443