ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 0.2 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| URL | https://realpython.com/nltk-nlp-python/ | |||||||||
| Last Crawled | 2026-05-29 03:17:57 (5 days ago) | |||||||||
| First Indexed | 2021-04-22 08:54:15 (5 years ago) | |||||||||
| HTTP Status Code | 200 | |||||||||
| Content | ||||||||||
| Meta Title | Natural Language Processing With Python's NLTK Package – Real Python | |||||||||
| Meta Description | In this beginner-friendly tutorial, you'll take your first steps with Natural Language Processing (NLP) and Python's Natural Language Toolkit (NLTK). You'll learn how to process unstructured data in order to be able to analyze it and draw conclusions from it. | |||||||||
| Meta Canonical | null | |||||||||
| Boilerpipe Text | heavy column, fetched on demand | |||||||||
| Markdown | heavy column, fetched on demand | |||||||||
| Readable Markdown | heavy column, fetched on demand | |||||||||
| ML Classification | ||||||||||
| ML Categories |
Raw JSON{
"/Computers_and_Electronics": 981,
"/Computers_and_Electronics/Programming": 904,
"/Computers_and_Electronics/Programming/Scripting_Languages": 870
} | |||||||||
| ML Page Types |
Raw JSON{
"/Article": 998,
"/Article/Tutorial_or_Guide": 994
} | |||||||||
| ML Intent Types |
Raw JSON{
"Informational": 999
} | |||||||||
| Content Metadata | ||||||||||
| Language | en | |||||||||
| Author | Real Python | |||||||||
| Publish Time | not set | |||||||||
| Original Publish Time | 2021-04-22 08:54:15 (5 years ago) | |||||||||
| Republished | No | |||||||||
| Word Count (Total) | 9,057 | |||||||||
| Word Count (Content) | 4,996 | |||||||||
| Links | ||||||||||
| External Links | 55 | |||||||||
| Internal Links | 119 | |||||||||
| Technical SEO | ||||||||||
| Meta Nofollow | No | |||||||||
| Meta Noarchive | No | |||||||||
| JS Rendered | Yes | |||||||||
| Redirect Target | null | |||||||||
| Performance | ||||||||||
| Download Time (ms) | 249 | |||||||||
| TTFB (ms) | 208 | |||||||||
| Download Size (bytes) | 40,700 | |||||||||
| Location | ||||||||||
| Host ID | 71 (laksa071) | |||||||||
| Partition ID | 28 | |||||||||
| Root Hash | 13351397557425671 | |||||||||
| Unparsed URL | com,realpython!/nltk-nlp-python/ s443 | |||||||||