🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:
Response:
Calculated Shard: 143 (from laksa110)

2. Crawled Status Check

Query:
Response:

3. Robots.txt Check

Query:
Response:

4. Spam/Ban Check

Query:
Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📍
LOCATION
Host 143 · Partition 15
laksa143
7740706156215403143
📄
INDEXABLE
CRAWLED
15 days ago
🤖
ROBOTS ALLOWED

Page Info Filters

FilterStatusConditionDetails
HTTP statusPASSdownload_http_code = 200HTTP 200
Age cutoffPASSdownload_stamp > now() - 6 MONTH0.5 months ago
History dropPASSisNull(history_drop_reason)No drop reason
Spam/banPASSfh_dont_index != 1 AND ml_spam_score = 0ml_spam_score=0
CanonicalPASSmeta_canonical IS NULL OR = '' OR = src_unparsedNot set

Page Details

PropertyValue
URLhttps://www.insurancejournal.com/news/international/2024/06/20/780456.htm
Last Crawled2026-05-19 12:21:19 (15 days ago)
First Indexed2024-06-20 10:04:09 (1 year ago)
HTTP Status Code200
Content
Meta TitleSingapore Highlights Banks as Posing Highest Money Laundering Risk
Meta DescriptionSingapore's banking sector, including wealth management, poses the highest money laundering risk in the city-state, the government said in a money
Meta Canonicalnull
Boilerpipe Text
heavy column, fetched on demand
Markdown
heavy column, fetched on demand
Readable Markdown
heavy column, fetched on demand
ML Classification
ML Categories
/Finance
95.6%
/Finance/Banking
95.1%
/Law_and_Government
58.8%
/Finance/Banking/Other
53.0%
/Law_and_Government/Legal
48.2%
/Law_and_Government/Legal/Business_and_Corporate_Law
34.3%
/News
12.0%
/News/Business_News
11.0%
Raw JSON
{
    "/Finance": 956,
    "/Finance/Banking": 951,
    "/Law_and_Government": 588,
    "/Finance/Banking/Other": 530,
    "/Law_and_Government/Legal": 482,
    "/Law_and_Government/Legal/Business_and_Corporate_Law": 343,
    "/News": 120,
    "/News/Business_News": 110
}
ML Page Types
/Article
99.8%
/Article/News_Update
99.7%
Raw JSON
{
    "/Article": 998,
    "/Article/News_Update": 997
}
ML Intent Types
Informational
99.9%
Raw JSON
{
    "Informational": 999
}
Content Metadata
Languageen-us
AuthorYantoultra Ngui
Publish Time2024-06-20 08:44:17 (1 year ago)
Original Publish Time2024-06-20 08:44:17 (1 year ago)
RepublishedNo
Word Count (Total)1,062
Word Count (Content)400
Links
External Links25
Internal Links75
Technical SEO
Meta NofollowNo
Meta NoarchiveNo
JS RenderedYes
Redirect Targetnull
Performance
Download Time (ms)348
TTFB (ms)348
Download Size (bytes)17,320
Location
Host ID143 (laksa143)
Partition ID15
Root Hash7740706156215403143
Unparsed URLcom,insurancejournal!www,/news/international/2024/06/20/780456.htm s443