🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 37 (from laksa067)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 37 · Partition 41
laksa037
13831140440916828237
📄
INDEXABLE
CRAWLED
6 days ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.2 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.self.com/story/morning-headaches
Last Crawled2026-05-27 21:39:10 (6 days ago)
First Indexed2017-08-24 15:47:26 (8 years ago)
HTTP Status Code200
Content
Meta Title11 Reasons Why You Are Waking Up With a Headache | SELF
Meta DescriptionWaking up with a headache could be due to caffeine withdrawal, migraine, insomnia, and more. Learn what causes morning headaches and how to find relief here.
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/Health
99.8%
/Health/Health_Conditions
95.5%
/Health/Health_Conditions/Neurological_Conditions
92.4%
Raw JSON
{
    "/Health": 998,
    "/Health/Health_Conditions": 955,
    "/Health/Health_Conditions/Neurological_Conditions": 924
}
ML Page Types
/Article
99.8%
/Article/How_to
57.9%
Raw JSON
{
    "/Article": 998,
    "/Article/How_to": 579
}
ML Intent Types
Informational
99.9%
Raw JSON
{
    "Informational": 999
}
Content Metadata
Languageen-us
AuthorRebecca Joy Stanborough
Publish Time2017-08-24 15:38:16 (8 years ago)
Original Publish Time2017-08-24 15:38:16 (8 years ago)
RepublishedNo
Word Count (Total)3,045
Word Count (Content)2,657
Links
External Links40
Internal Links44
Technical SEO
Meta NofollowNo
Meta NoarchiveYes
JS RenderedYes
Redirect Targetnull
Performance
Download Time (ms)149
TTFB (ms)140
Download Size (bytes)98,514
Location
Host ID37 (laksa037)
Partition ID41
Root Hash13831140440916828237
Unparsed URLcom,self!www,/story/morning-headaches s443