ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 1.6 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://www.cnblogs.com/ysngki/p/17814160.html |
| Last Crawled | 2026-04-17 19:36:45 (1 month ago) |
| First Indexed | 2023-11-30 05:29:46 (2 years ago) |
| HTTP Status Code | 200 |
| Content | |
| Meta Title | Fairseq 机器翻译全流程一文速通 (NMT, WMT, translation) - ysngki - 博客园 |
| Meta Description | 最新编辑于:2024年8月30日 一、摘要 fairseq 是个常用的机器翻译项目。它的优化很好,但代码晦涩难懂,限制了我们的使用。 本文旨在梳理如下流程:1)准备 WMT23 的数据 (其余生成任务皆可类比),2)训练模型,3)用 sacrebleu、COMET-22 评测模型。 不想要 wmt |
| Meta Canonical | null |
| Boilerpipe Text | heavy column, fetched on demand |
| Markdown | heavy column, fetched on demand |
| Readable Markdown | heavy column, fetched on demand |
| ML Classification | |
| ML Categories | null |
| ML Page Types | null |
| ML Intent Types | null |
| Content Metadata | |
| Language | zh-cn |
| Author | null |
| Publish Time | not set |
| Original Publish Time | 2023-11-30 05:29:46 (2 years ago) |
| Republished | No |
| Word Count (Total) | 1,767 |
| Word Count (Content) | 1,703 |
| Links | |
| External Links | 15 |
| Internal Links | 27 |
| Technical SEO | |
| Meta Nofollow | No |
| Meta Noarchive | No |
| JS Rendered | No |
| Redirect Target | null |
| Performance | |
| Download Time (ms) | 1,434 |
| TTFB (ms) | 1,317 |
| Download Size (bytes) | 17,516 |
| Location | |
| Host ID | 46 (laksa046) |
| Partition ID | 26 |
| Root Hash | 10938660598884985246 |
| Unparsed URL | com,cnblogs!www,/ysngki/p/17814160.html s443 |