âšī¸ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | FAIL | download_http_code = 200 | HTTP 403 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 1.5 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://www.cnbc.com/brexit/ |
| Last Crawled | 2026-02-20 18:14:57 (1 month ago) |
| First Indexed | not set |
| HTTP Status Code | 403 |
| Meta Title | null |
| Meta Description | null |
| Meta Canonical | null |
| Boilerpipe Text | null |
| Markdown | null |
| Readable Markdown | null |
| Shard | 110 (laksa) |
| Root Hash | 11604498698149289310 |
| Unparsed URL | com,cnbc!www,/brexit/ s443 |