🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 49 (from laksa124)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 49 · Partition 42
laksa049
10886942705093948449
📄
INDEXABLE
CRAWLED
1 day ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.1 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.green-card.com/
Last Crawled2026-06-01 13:45:32 (1 day ago)
First Indexed2016-11-28 05:52:12 (9 years ago)
HTTP Status Code200
Content
Meta TitleGreen Card: living and working in the USA » Greencard
Meta DescriptionThe Green Card grants permanent residence and an unlimited work permit for the USA. Learn how to get the popular immigrant visa for America!
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/Law_and_Government
82.4%
/Law_and_Government/Legal
70.7%
/Law_and_Government/Legal/Legal_Services
52.4%
/Jobs_and_Education
34.2%
/Jobs_and_Education/Jobs
33.6%
/Jobs_and_Education/Jobs/Career_Resources_and_Planning
25.9%
Raw JSON
{
    "/Law_and_Government": 824,
    "/Law_and_Government/Legal": 707,
    "/Law_and_Government/Legal/Legal_Services": 524,
    "/Jobs_and_Education": 342,
    "/Jobs_and_Education/Jobs": 336,
    "/Jobs_and_Education/Jobs/Career_Resources_and_Planning": 259
}
ML Page Types
/Article
85.4%
/Article/FAQ
41.1%
Raw JSON
{
    "/Article": 854,
    "/Article/FAQ": 411
}
ML Intent Types
Informational
99.3%
Commercial
22.4%
Raw JSON
{
    "Informational": 993,
    "Commercial": 224
}
Content Metadata
Languagede
Authornull
Publish Timenot set
Original Publish Time2016-11-28 05:52:12 (9 years ago)
RepublishedNo
Word Count (Total)3,526
Word Count (Content)3,269
Links
External Links11
Internal Links9
Technical SEO
Meta NofollowNo
Meta NoarchiveNo
JS RenderedNo
Redirect Targetnull
Performance
Download Time (ms)798
TTFB (ms)796
Download Size (bytes)17,479
Location
Host ID49 (laksa049)
Partition ID42
Root Hash10886942705093948449
Unparsed URLcom,green-card!www,/ s443