đŸ•ˇī¸ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 183 (from laksa007)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

â„šī¸ Skipped - page is already crawled

đŸšĢ
NOT INDEXABLE
✅
CRAWLED
8 months ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffFAILdownload_stamp > now() - 6 MONTH8.4 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.gla.ac.uk/media/Media_141049_smxx.pdf
Last Crawled2025-08-06 14:01:07 (8 months ago)
First Indexednot set
HTTP Status Code200
Meta Titlenull
Meta Descriptionnull
Meta Canonicalnull
Boilerpipe Textnull
Markdownnull
Readable Markdownnull
Shard183 (laksa)
Root Hash13615915157601171583
Unparsed URLuk,ac,gla!www,/media/Media_141049_smxx.pdf s443