🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 186 (from laksa056)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 186 · Partition 6
laksa186
5725827201641901386
📄
INDEXABLE
CRAWLED
1 day ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.1 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.wheresleep.com/paris.htm
Last Crawled2026-06-01 17:51:33 (1 day ago)
First Indexed2022-03-11 23:14:56 (4 years ago)
HTTP Status Code200
Content
Meta TitleWhere to stay in Paris: best areas and neighborhoods
Meta DescriptionWhere to stay in Paris: neighborhoods and the best area to stay in Paris. A useful guide with map and accommodation for families, cheap hotels and hostels for young people in Paris and surroundings
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/Travel_and_Transportation
99.2%
/Travel_and_Transportation/Hotels_and_Accommodations
96.5%
/Travel_and_Transportation/Hotels_and_Accommodations/Vacation_Rentals_and_Short-Term_Stays
82.3%
Raw JSON
{
    "/Travel_and_Transportation": 992,
    "/Travel_and_Transportation/Hotels_and_Accommodations": 965,
    "/Travel_and_Transportation/Hotels_and_Accommodations/Vacation_Rentals_and_Short-Term_Stays": 823
}
ML Page Types
/Article
94.7%
/Article/Tutorial_or_Guide
46.8%
Raw JSON
{
    "/Article": 947,
    "/Article/Tutorial_or_Guide": 468
}
ML Intent Types
Informational
83.9%
Commercial
63.2%
Raw JSON
{
    "Informational": 839,
    "Commercial": 632
}
Content Metadata
Languageen
Authornull
Publish Timenot set
Original Publish Time2022-03-11 23:14:56 (4 years ago)
RepublishedNo
Word Count (Total)5,828
Word Count (Content)3,993
Links
External Links121
Internal Links3
Technical SEO
Meta NofollowNo
Meta NoarchiveNo
JS RenderedNo
Redirect Targetnull
Performance
Download Time (ms)33
TTFB (ms)32
Download Size (bytes)26,108
Location
Host ID186 (laksa186)
Partition ID6
Root Hash5725827201641901386
Unparsed URLcom,wheresleep!www,/paris.htm s443