🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 111 (from laksa155)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 111 · Partition 68
laksa111
1761839161268513711
🚫
NOT INDEXABLE
CRAWLED
2 months ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH2.2 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalFAILmeta_canonical IS NULL OR = '' OR = src_unparsedru,kommersant!www,/doc/8120995 s443

Page Details

PropertyValue
URLhttps://www.kommersant.ru/doc/8021948
Last Crawled2026-03-29 04:03:32 (2 months ago)
First Indexed2025-09-28 03:41:00 (8 months ago)
HTTP Status Code200
Content
Meta TitleЛионеля Месси привлекли к суду за громкие слова - Коммерсантъ
Meta DescriptionВосьмикратного обладателя «Золотого мяча» обвинили в «недобросовестной торговой практике»
Meta Canonicalru,kommersant!www,/doc/8120995 s443
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/Law_and_Government
60.3%
/Law_and_Government/Legal
59.7%
/Law_and_Government/Legal/Business_and_Corporate_Law
51.2%
/News
40.7%
/News/Politics
29.5%
/News/Politics/Campaigns_and_Elections
25.4%
Raw JSON
{
    "/Law_and_Government": 603,
    "/Law_and_Government/Legal": 597,
    "/Law_and_Government/Legal/Business_and_Corporate_Law": 512,
    "/News": 407,
    "/News/Politics": 295,
    "/News/Politics/Campaigns_and_Elections": 254
}
ML Page Types
/Article
95.3%
/Article/News_Update
95.2%
Raw JSON
{
    "/Article": 953,
    "/Article/News_Update": 952
}
ML Intent Types
Informational
99.8%
Raw JSON
{
    "Informational": 998
}
Content Metadata
Languageru
AuthorНаталия Портякова
Publish Time2025-09-05 17:20:54 (9 months ago)
Original Publish Time2025-09-05 17:20:54 (9 months ago)
RepublishedNo
Word Count (Total)16,607
Word Count (Content)3,397
Links
External Links0
Internal Links2
Technical SEO
Meta NofollowNo
Meta NoarchiveYes
JS RenderedYes
Redirect Targetnull
Performance
Download Time (ms)727
TTFB (ms)652
Download Size (bytes)52,283
Location
Host ID111 (laksa111)
Partition ID68
Root Hash1761839161268513711
Unparsed URLru,kommersant!www,/doc/8021948 s443