โน๏ธ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 0 months ago (distributed domain, exempt) |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value | ||||||
|---|---|---|---|---|---|---|---|
| URL | https://hk.news.yahoo.com/ | ||||||
| Last Crawled | 2026-06-02 04:01:58 (23 hours ago) | ||||||
| First Indexed | 2014-03-15 13:46:53 (12 years ago) | ||||||
| HTTP Status Code | 200 | ||||||
| Content | |||||||
| Meta Title | Yahooๆฐ่ | ||||||
| Meta Description | Yahooๆฐ่ๆไพๅ้กๆๆฐ็ฆ้ปๅ็ฑ้ๆฐ่ใ้ฑ่ฎๆทฑๅ ฅ็็ธ้ๅ ฑ้ใๆฐ่ๅฝฑ็ๅๅ็ใ | ||||||
| Meta Canonical | null | ||||||
| Boilerpipe Text | heavy column, fetched on demand | ||||||
| Markdown | heavy column, fetched on demand | ||||||
| Readable Markdown | heavy column, fetched on demand | ||||||
| ML Classification | |||||||
| ML Categories |
Raw JSON{
"/News": 948,
"/News/Local_News": 691
} | ||||||
| ML Page Types |
Raw JSON{
"/Article": 872,
"/Article/News_Update": 871
} | ||||||
| ML Intent Types |
Raw JSON{
"Informational": 999
} | ||||||
| Content Metadata | |||||||
| Language | zh-hant-hk | ||||||
| Author | null | ||||||
| Publish Time | not set | ||||||
| Original Publish Time | 2014-03-15 13:46:53 (12 years ago) | ||||||
| Republished | No | ||||||
| Word Count (Total) | 889 | ||||||
| Word Count (Content) | 532 | ||||||
| Links | |||||||
| External Links | 9 | ||||||
| Internal Links | 217 | ||||||
| Technical SEO | |||||||
| Meta Nofollow | No | ||||||
| Meta Noarchive | No | ||||||
| JS Rendered | Yes | ||||||
| Redirect Target | null | ||||||
| Performance | |||||||
| Download Time (ms) | 1,008 | ||||||
| TTFB (ms) | 583 | ||||||
| Download Size (bytes) | 65,996 | ||||||
| Location | |||||||
| Host ID | 192 (laksa192) | ||||||
| Partition ID | 63 | ||||||
| Root Hash | 3401813433841292792 | ||||||
| Unparsed URL | com,yahoo!news,hk,/ s443 | ||||||