🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 55 (from laksa167)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 55 · Partition 0
laksa055
10153935897910600055
📄
INDEXABLE
CRAWLED
3 days ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.1 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.newsweek.com/fukushima-nuclear-plant-radioactive-water-leaking-months-674434
Last Crawled2026-05-31 05:21:13 (3 days ago)
First Indexed2018-07-21 00:21:41 (7 years ago)
HTTP Status Code200
Content
Meta TitleFukushima Nuclear Disaster: Radioactive Water May Have Been Leaking From Reactors for Months - Newsweek
Meta DescriptionThe estimated cost of the long-term cleanup following the nuclear disaster is $192 billion.
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/News
85.1%
/News/World_News
75.5%
/Science
56.7%
/Science/Physics
24.6%
Raw JSON
{
    "/News": 851,
    "/News/World_News": 755,
    "/Science": 567,
    "/Science/Physics": 246
}
ML Page Types
/Article
99.5%
/Article/News_Update
99.4%
Raw JSON
{
    "/Article": 995,
    "/Article/News_Update": 994
}
ML Intent Types
Informational
99.9%
Raw JSON
{
    "Informational": 999
}
Content Metadata
Languageen
AuthorTom O'Connor
Publish Timenot set
Original Publish Time2018-07-21 00:21:41 (7 years ago)
RepublishedNo
Word Count (Total)1,474
Word Count (Content)642
Links
External Links22
Internal Links94
Technical SEO
Meta NofollowNo
Meta NoarchiveNo
JS RenderedYes
Redirect Targetnull
Performance
Download Time (ms)1,401
TTFB (ms)1,391
Download Size (bytes)47,043
Location
Host ID55 (laksa055)
Partition ID0
Root Hash10153935897910600055
Unparsed URLcom,newsweek!www,/fukushima-nuclear-plant-radioactive-water-leaking-months-674434 s443