âšī¸ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | FAIL | download_http_code = 200 | HTTP 403 |
| Age cutoff | FAIL | download_stamp > now() - 6 MONTH | 7.7 months ago |
| History drop | FAIL | isNull(history_drop_reason) | oldunavailable |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://www.repository.cam.ac.uk/bitstreams/98a85f6a-0751-4e66-85e5-b2c371d54edb/download |
| Last Crawled | 2025-08-30 02:34:02 (7 months ago) |
| First Indexed | not set |
| HTTP Status Code | 403 |
| Meta Title | null |
| Meta Description | null |
| Meta Canonical | null |
| Boilerpipe Text | null |
| Markdown | null |
| Readable Markdown | null |
| Shard | 64 (laksa) |
| Root Hash | 4147139263072824664 |
| Unparsed URL | uk,ac,cam!repository,www,/bitstreams/98a85f6a-0751-4e66-85e5-b2c371d54edb/download s443 |