ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 1.2 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| URL | https://docs.pytorch.org/tutorials/intermediate/nlp_from_scratch_index.html | |||||||||
| Last Crawled | 2026-04-28 00:00:28 (1 month ago) | |||||||||
| First Indexed | 2025-06-30 23:25:20 (11 months ago) | |||||||||
| HTTP Status Code | 200 | |||||||||
| Content | ||||||||||
| Meta Title | NLP from Scratch — PyTorch Tutorials 2.11.0+cu130 documentation | |||||||||
| Meta Description | null | |||||||||
| Meta Canonical | null | |||||||||
| Boilerpipe Text | heavy column, fetched on demand | |||||||||
| Markdown | heavy column, fetched on demand | |||||||||
| Readable Markdown | heavy column, fetched on demand | |||||||||
| ML Classification | ||||||||||
| ML Categories |
Raw JSON{
"/Computers_and_Electronics": 980,
"/Computers_and_Electronics/Programming": 548,
"/Computers_and_Electronics/Programming/Development_Tools": 297
} | |||||||||
| ML Page Types |
Raw JSON{
"/Article": 790,
"/Article/Tutorial_or_Guide": 786
} | |||||||||
| ML Intent Types |
Raw JSON{
"Informational": 999
} | |||||||||
| Content Metadata | ||||||||||
| Language | en | |||||||||
| Author | null | |||||||||
| Publish Time | 2022-07-20 23:02:43 (3 years ago) | |||||||||
| Original Publish Time | 2022-07-20 23:02:43 (3 years ago) | |||||||||
| Republished | No | |||||||||
| Word Count (Total) | 2,312 | |||||||||
| Word Count (Content) | 102 | |||||||||
| Links | ||||||||||
| External Links | 25 | |||||||||
| Internal Links | 170 | |||||||||
| Technical SEO | ||||||||||
| Meta Nofollow | No | |||||||||
| Meta Noarchive | No | |||||||||
| JS Rendered | Yes | |||||||||
| Redirect Target | null | |||||||||
| Performance | ||||||||||
| Download Time (ms) | 92 | |||||||||
| TTFB (ms) | 85 | |||||||||
| Download Size (bytes) | 27,930 | |||||||||
| Location | ||||||||||
| Host ID | 114 (laksa114) | |||||||||
| Partition ID | 47 | |||||||||
| Root Hash | 14416670112284949514 | |||||||||
| Unparsed URL | org,pytorch!docs,/tutorials/intermediate/nlp_from_scratch_index.html s443 | |||||||||