âšī¸ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | FAIL | download_http_code = 200 | HTTP 403 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 3.8 months ago |
| History drop | FAIL | isNull(history_drop_reason) | oldunavailable |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://direct.mit.edu/dint/article/5/3/707/115133/The-State-of-the-Art-of-Natural-Language |
| Last Crawled | 2025-12-20 08:45:00 (3 months ago) |
| First Indexed | not set |
| HTTP Status Code | 403 |
| Meta Title | null |
| Meta Description | null |
| Meta Canonical | null |
| Boilerpipe Text | null |
| Markdown | null |
| Readable Markdown | null |
| Shard | 180 (laksa) |
| Root Hash | 10722954425220430980 |
| Unparsed URL | edu,mit!direct,/dint/article/5/3/707/115133/The-State-of-the-Art-of-Natural-Language s443 |