ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 0.3 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | FAIL | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=785 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://www.siu.com.cn/show-638.html |
| Last Crawled | 2026-05-26 05:15:23 (7 days ago) |
| First Indexed | 2025-08-02 19:57:10 (10 months ago) |
| HTTP Status Code | 200 |
| Content | |
| Meta Title | 重要通知!驻日大使馆发布赴华健康码最新政策_出国资讯_新闻中心_山东国际|国际联合|出国劳务|出国打工 |
| Meta Description | 中国驻日本大使馆提醒您“非必要,不旅行”。 |
| Meta Canonical | null |
| Boilerpipe Text | heavy column, fetched on demand |
| Markdown | heavy column, fetched on demand |
| Readable Markdown | heavy column, fetched on demand |
| ML Classification | |
| ML Categories | null |
| ML Page Types | null |
| ML Intent Types | null |
| Content Metadata | |
| Language | null |
| Author | null |
| Publish Time | not set |
| Original Publish Time | 2025-08-02 19:57:10 (10 months ago) |
| Republished | No |
| Word Count (Total) | 193 |
| Word Count (Content) | 109 |
| Links | |
| External Links | 12 |
| Internal Links | 34 |
| Technical SEO | |
| Meta Nofollow | No |
| Meta Noarchive | No |
| JS Rendered | No |
| Redirect Target | null |
| Performance | |
| Download Time (ms) | 2,094 |
| TTFB (ms) | 2,093 |
| Download Size (bytes) | 10,059 |
| Location | |
| Host ID | 95 (laksa095) |
| Partition ID | 54 |
| Root Hash | 270786207286870895 |
| Unparsed URL | cn,com,siu!www,/show-638.html s443 |