ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 0.1 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| URL | https://www.cnn.com/2020/11/16/tech/spacex-nasa-iss-docking-scn | ||||||||||||
| Last Crawled | 2026-05-31 01:16:15 (3 days ago) | ||||||||||||
| First Indexed | 2025-05-01 14:35:53 (1 year ago) | ||||||||||||
| HTTP Status Code | 200 | ||||||||||||
| Content | |||||||||||||
| Meta Title | SpaceX-NASA mission: Four astronauts arrive at International Space Station | CNN Business | ||||||||||||
| Meta Description | The SpaceX Crew Dragon spacecraft that launched from Florida’s Kennedy Space Center with four astronauts on board Sunday night safely docked with the International Space Station around 11 p.m. ET Monday. | ||||||||||||
| Meta Canonical | null | ||||||||||||
| Boilerpipe Text | heavy column, fetched on demand | ||||||||||||
| Markdown | heavy column, fetched on demand | ||||||||||||
| Readable Markdown | heavy column, fetched on demand | ||||||||||||
| ML Classification | |||||||||||||
| ML Categories |
Raw JSON{
"/Science": 945,
"/Science/Astronomy": 939,
"/News": 201,
"/News/Technology_News": 182
} | ||||||||||||
| ML Page Types |
Raw JSON{
"/Article": 986,
"/Article/News_Update": 985
} | ||||||||||||
| ML Intent Types |
Raw JSON{
"Informational": 988
} | ||||||||||||
| Content Metadata | |||||||||||||
| Language | en | ||||||||||||
| Author | Jackie Wattles | ||||||||||||
| Publish Time | 2020-11-16 23:45:00 (5 years ago) | ||||||||||||
| Original Publish Time | 2020-11-16 23:45:00 (5 years ago) | ||||||||||||
| Republished | No | ||||||||||||
| Word Count (Total) | 2,317 | ||||||||||||
| Word Count (Content) | 0 | ||||||||||||
| Links | |||||||||||||
| External Links | 18 | ||||||||||||
| Internal Links | 270 | ||||||||||||
| Technical SEO | |||||||||||||
| Meta Nofollow | No | ||||||||||||
| Meta Noarchive | No | ||||||||||||
| JS Rendered | Yes | ||||||||||||
| Redirect Target | null | ||||||||||||
| Performance | |||||||||||||
| Download Time (ms) | 1,363 | ||||||||||||
| TTFB (ms) | 1,262 | ||||||||||||
| Download Size (bytes) | 798,044 | ||||||||||||
| Location | |||||||||||||
| Host ID | 51 (laksa051) | ||||||||||||
| Partition ID | 20 | ||||||||||||
| Root Hash | 2312100192101524051 | ||||||||||||
| Unparsed URL | com,cnn!www,/2020/11/16/tech/spacex-nasa-iss-docking-scn s443 | ||||||||||||