ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 9.5 months ago (distributed domain, exempt) |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| URL | https://ko.wikipedia.org/wiki/%EB%82%A8%EB%8C%80%EB%AC%B8%EC%8B%9C%EC%9E%A5 | ||||||||||||
| Last Crawled | 2025-08-24 08:49:45 (9 months ago) | ||||||||||||
| First Indexed | 2015-06-18 17:49:38 (10 years ago) | ||||||||||||
| HTTP Status Code | 200 | ||||||||||||
| Content | |||||||||||||
| Meta Title | 남대문시장 | ||||||||||||
| Meta Description | null | ||||||||||||
| Meta Canonical | null | ||||||||||||
| Boilerpipe Text | heavy column, fetched on demand | ||||||||||||
| Markdown | heavy column, fetched on demand | ||||||||||||
| Readable Markdown | heavy column, fetched on demand | ||||||||||||
| ML Classification | |||||||||||||
| ML Categories |
Raw JSON{
"/Shopping": 731,
"/Shopping/Swap_Meets_and_Outdoor_Markets": 551,
"/Reference": 290,
"/Reference/Geographic_Reference": 240
} | ||||||||||||
| ML Page Types |
Raw JSON{
"/Article": 992,
"/Article/Wiki": 889
} | ||||||||||||
| ML Intent Types |
Raw JSON{
"Informational": 998
} | ||||||||||||
| Content Metadata | |||||||||||||
| Language | null | ||||||||||||
| Author | null | ||||||||||||
| Publish Time | not set | ||||||||||||
| Original Publish Time | 2015-06-18 17:49:38 (10 years ago) | ||||||||||||
| Republished | No | ||||||||||||
| Word Count (Total) | 630 | ||||||||||||
| Word Count (Content) | 158 | ||||||||||||
| Links | |||||||||||||
| External Links | 14 | ||||||||||||
| Internal Links | 230 | ||||||||||||
| Technical SEO | |||||||||||||
| Meta Nofollow | No | ||||||||||||
| Meta Noarchive | No | ||||||||||||
| JS Rendered | No | ||||||||||||
| Redirect Target | null | ||||||||||||
| Performance | |||||||||||||
| Download Time (ms) | 0 | ||||||||||||
| TTFB (ms) | 0 | ||||||||||||
| Download Size (bytes) | 81,396 | ||||||||||||
| Location | |||||||||||||
| Host ID | 152 (laksa152) | ||||||||||||
| Partition ID | 74 | ||||||||||||
| Root Hash | 17790707453426894952 | ||||||||||||
| Unparsed URL | org,wikipedia!ko,/wiki/%EB%82%A8%EB%8C%80%EB%AC%B8%EC%8B%9C%EC%9E%A5 s443 | ||||||||||||