ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | FAIL | download_stamp > now() - 6 MONTH | 6.1 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | http://www.guangshui.gov.cn/ywdt/gsxw/202108/t20210809_905085.shtml |
| Last Crawled | 2025-10-19 20:50:26 (6 months ago) |
| First Indexed | 2022-09-05 21:31:25 (3 years ago) |
| HTTP Status Code | 200 |
| Meta Title | Warning |
| Meta Description | null |
| Meta Canonical | null |
| Boilerpipe Text | The page you visited is not compliant and has been banned! If you think this is and error, please contact your network administrator. |
| Markdown |
# 403
您访问的网页不符合公司规定,已被禁止!
The page you visited is not compliant and has been banned\!
如果您认为这是一个错误,请联系网络管理员!
If you think this is and error, please contact your network administrator. |
| Readable Markdown | null |
| Shard | 58 (laksa) |
| Root Hash | 16231658633611224458 |
| Unparsed URL | cn,gov,guangshui!www,/ywdt/gsxw/202108/t20210809_905085.shtml h80 |