ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | FAIL | download_stamp > now() - 6 MONTH | 9.5 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://medium.com/data-science/building-a-question-answering-system-part-1-9388aadff507 |
| Last Crawled | 2025-08-23 05:05:11 (9 months ago) |
| First Indexed | not set |
| HTTP Status Code | 200 |
| Content | |
| Meta Title | Building a Question-Answering System from Scratch— Part 1 | by Alvira Swalin | TDS Archive | Medium |
| Meta Description | As my Masters is coming to an end, I wanted to work on an interesting NLP project where I can use all the techniques(not exactly) I have learned at USF. With the help of my professors and discussions… |
| Meta Canonical | null |
| Boilerpipe Text | heavy column, fetched on demand |
| Markdown | heavy column, fetched on demand |
| Readable Markdown | heavy column, fetched on demand |
| ML Classification | |
| ML Categories | null |
| ML Page Types | null |
| ML Intent Types | null |
| Content Metadata | |
| Language | en |
| Author | Alvira Swalin |
| Publish Time | 2018-06-01 22:45:11 (8 years ago) |
| Original Publish Time | 2018-06-01 22:45:11 (8 years ago) |
| Republished | No |
| Word Count (Total) | 2,226 |
| Word Count (Content) | 1,574 |
| Links | |
| External Links | 21 |
| Internal Links | 34 |
| Technical SEO | |
| Meta Nofollow | No |
| Meta Noarchive | Yes |
| JS Rendered | No |
| Redirect Target | null |
| Performance | |
| Download Time (ms) | 801 |
| TTFB (ms) | 800 |
| Download Size (bytes) | 43,812 |
| Location | |
| Host ID | 77 (laksa077) |
| Partition ID | 31 |
| Root Hash | 13179037029838926277 |
| Unparsed URL | com,medium!/data-science/building-a-question-answering-system-part-1-9388aadff507 s443 |