🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 155 (from laksa107)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 155 · Partition 36
laksa155
6314941129111807355
📄
INDEXABLE
CRAWLED
8 days ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.3 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.ucas.com/careers-advice/how-to-become/pilot
Last Crawled2026-05-26 13:23:32 (8 days ago)
First Indexed2023-06-08 18:36:50 (2 years ago)
HTTP Status Code200
Content
Meta TitleHow to become a pilot | UCAS
Meta DescriptionBeing a pilot is a fantastic job for people who enjoy responsibility, technology, meeting people and the excitement of flying a commercial aircraft.
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/Jobs_and_Education
96.5%
/Jobs_and_Education/Education
85.4%
/Jobs_and_Education/Education/Vocational_and_Continuing_Education
67.0%
Raw JSON
{
    "/Jobs_and_Education": 965,
    "/Jobs_and_Education/Education": 854,
    "/Jobs_and_Education/Education/Vocational_and_Continuing_Education": 670
}
ML Page Types
/Article
94.5%
/Article/How_to
89.8%
Raw JSON
{
    "/Article": 945,
    "/Article/How_to": 898
}
ML Intent Types
Informational
99.8%
Raw JSON
{
    "Informational": 998
}
Content Metadata
Languageen
Authornull
Publish Timenot set
Original Publish Time2023-06-08 18:36:50 (2 years ago)
RepublishedNo
Word Count (Total)3,838
Word Count (Content)2,524
Links
External Links26
Internal Links320
Technical SEO
Meta NofollowNo
Meta NoarchiveNo
JS RenderedNo
Redirect Targetnull
Performance
Download Time (ms)90
TTFB (ms)90
Download Size (bytes)23,205
Location
Host ID155 (laksa155)
Partition ID36
Root Hash6314941129111807355
Unparsed URLcom,ucas!www,/careers-advice/how-to-become/pilot s443