🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:

Response:

Calculated Shard: 23 (from laksa090)

2. Crawled Status Check

Query:

curl -X POST \
  'http://laksa023.int.ahrefs:8124/' \
  -H 'Content-Type: text/plain' \
  -H 'X-ClickHouse-Database: crawler3' \
  -H 'Authorization: Basic YXBpOg==' \
  -d 'SELECT getAhrefsURLFromUnparsed(src_unparsed) AS found_url, ifNull(toUnixTimestamp(download_stamp), 0) AS crawl_time, ifNull(toUnixTimestamp(props_url_first_seen), 0) AS first_indexed_time, download_http_code AS http_code, src_unparsed AS src_unparsed, src_root_hash AS src_root_hash, history_drop_reason AS history_drop_reason, meta_title AS meta_title, meta_descriptions AS meta_descriptions, attrs_boilerpipe_text AS attrs_boilerpipe_text, attrs_markdown AS attrs_markdown, attrs_readable_markdown AS attrs_readable_markdown, meta_canonical AS meta_canonical FROM crawler3.page_info_local FINAL PREWHERE (src_root_hash, src_unparsed) IN ((getAhrefsRootHashFromUnparsed(getAhrefsUnparsedNoserviceFromURL(\'https://www.altcademy.com/blog/how-to-drop-nan-values-in-pandas/\')), getAhrefsUnparsedNoserviceFromURL(\'https://www.altcademy.com/blog/how-to-drop-nan-values-in-pandas/\'))) FORMAT JSONEachRow'

Response:

{"found_url":"https:\/\/www.altcademy.com\/blog\/how-to-drop-nan-values-in-pandas\/","crawl_time":1774943841,"first_indexed_time":1705030028,"http_code":200,"src_unparsed":"com,altcademy!www,\/blog\/how-to-drop-nan-values-in-pandas\/ s443","src_root_hash":"13523176219864139623","history_drop_reason":null,"meta_title":"How to drop nan values in Pandas","meta_descriptions":[],"attrs_boilerpipe_text":"Understanding NaN Values in Pandas\nWhen you're working with data in Python, using the Pandas library is like having a Swiss Army knife for data manipulation. However, sometimes your data isn't perfect. It might contain gaps or \"holes\", known as missing values. In Pandas, these missing pieces are often represented as\nNaN\n, which stands for \"Not a Number\". It's a special floating-point value recognized by all systems that use the standard IEEE floating-point representation.\nThink of\nNaN\nlike a placeholder for something that is supposed to be a number but isn't there. Imagine you have a basket of fruits with labels on each fruit, but some labels have fallen off. Those fruits without labels could be thought of as\nNaN\nbecause, like the missing information, we know there's supposed to be something there, but it's just not.\nWhy Drop NaN Values?\nBefore we dive into how to drop\nNaN\nvalues, let's discuss why you might want to do this.\nNaN\nvalues can be problematic because they can distort statistical calculations and cause errors in machine learning models. It's like trying to make a fruit salad with some fruits missing; your salad won't be complete, and it won't taste as expected.\nSometimes, you can fill in these missing values with estimates or other data, but other times it's better to just remove them. Removing\nNaN\nvalues simplifies the dataset and can make your analysis more straightforward.\nDropping NaN Values with\ndropna()\nPandas provides a powerful method called\ndropna()\nto deal with missing values. This method scans through your DataFrame (a kind of data table in Pandas), finds the\nNaN\nvalues, and drops the rows or columns that contain them.\nHere's a basic example:\nimport pandas as pd\n\n# Creating a DataFrame with NaN values\ndata = {'Name': ['Anna', 'Bob', 'Charles', None],\n        'Age': [28, None, 30, 22],\n        'Gender': ['F', 'M', None, 'M']}\ndf = pd.DataFrame(data)\n\n# Dropping rows with any NaN values\ncleaned_df = df.dropna()\nprint(cleaned_df)\nThis code will output a DataFrame without any rows that had\nNaN\nvalues:\nName   Age Gender\n0    Anna  28.0      F\n2  Charles  30.0   None\nNotice that Charles's gender is still\nNone\n. That's because\ndropna()\nby default drops entire rows where any\nNaN\nis present. If we want to be more specific, we can use parameters.\nParameters of\ndropna()\nThe\ndropna()\nmethod can be fine-tuned with parameters. Two commonly used parameters are\naxis\nand\nhow\n.\naxis\n: Determines whether to drop rows or columns.\naxis=0\nor\naxis='index'\n(default): Drop rows with\nNaN\n.\naxis=1\nor\naxis='columns'\n: Drop columns with\nNaN\n.\nhow\n: Determines if a row or column should be dropped when it has at least one\nNaN\nor only if all values are\nNaN\n.\nhow='any'\n(default): Drop if any\nNaN\nvalues are present.\nhow='all'\n: Drop if all values are\nNaN\n.\nLet's see\naxis\nand\nhow\nin action:\n# Dropping columns with any NaN values\ncleaned_df_columns = df.dropna(axis='columns')\nprint(cleaned_df_columns)\n\n# Dropping rows where all values are NaN\ncleaned_df_all = df.dropna(how='all')\nprint(cleaned_df_all)\nThe first print statement will give you a DataFrame without the 'Age' column since it's the only one with\nNaN\nvalues. The second print statement won't change anything in our example because there's no row where all values are\nNaN\n.\nHandling NaN Values in a Series\nA Series is like a single column in your DataFrame, a list of data with an index. Dropping\nNaN\nvalues from a Series is similar to dropping them from a DataFrame:\n# Creating a Series with NaN values\nseries = pd.Series([1, 2, None, 4, None])\n\n# Dropping NaN values\ncleaned_series = series.dropna()\nprint(cleaned_series)\nThis will output a Series without the\nNone\nvalues:\n0    1.0\n1    2.0\n3    4.0\ndtype: float64\nFilling NaN Values Instead of Dropping\nSometimes, instead of dropping\nNaN\nvalues, you might want to replace them with a specific value. This is known as imputation. Pandas provides the\nfillna()\nmethod to do this. For example, you might want to replace all\nNaN\nvalues with the average of the non-missing values:\n# Replace NaN with the mean of the 'Age' column\ndf['Age'].fillna(df['Age'].mean(), inplace=True)\nprint(df)\nThis will fill the\nNaN\nvalue in the 'Age' column with the average age of Anna and Charles.\nA Real-World Example\nLet's consider a more realistic scenario where you have a dataset of survey responses, and not all questions were answered by every respondent. You might want to drop rows where crucial information is missing, like the respondent's age or gender, but keep rows where less important information is missing.\n# A more complex DataFrame\nsurvey_data = {\n    'Age': [25, None, 37, 22],\n    'Gender': ['F', 'M', 'F', None],\n    'Income': [50000, None, 80000, 75000],\n    'Satisfaction': [4, 3, None, 5]\n}\n\nsurvey_df = pd.DataFrame(survey_data)\n\n# Dropping rows where 'Age' or 'Gender' is NaN\nimportant_info_df = survey_df.dropna(subset=['Age', 'Gender'])\nprint(important_info_df)\nThis will keep rows where 'Income' or 'Satisfaction' might be\nNaN\n, but drop rows where 'Age' or 'Gender' is\nNaN\n.\nConclusion: Keeping Your Data Clean\nDropping\nNaN\nvalues in Pandas is like weeding a garden. You remove the unwanted elements to allow the rest of your data to flourish without interference. By using the\ndropna()\nmethod, you can ensure that your analyses are performed on complete cases, leading to more reliable results.\nRemember, though, that dropping data should not be done carelessly. Always consider the context of your data and whether dropping or imputing makes more sense for your specific situation. With the tools Pandas provides, you have the flexibility to handle missing data in a way that best suits your garden of information, helping it grow into a bountiful harvest of insights.","attrs_markdown":"[![Altcademy Blog](https:\/\/www.altcademy.com\/blog\/content\/images\/2022\/07\/for-enterprise--600---72-px--4.png)](https:\/\/www.altcademy.com\/?ref=blog)\n\n- [Blog Home](https:\/\/www.altcademy.com\/blog\/)\n- [Featured](https:\/\/www.altcademy.com\/blog\/tag\/featured\/)\n- [Career](https:\/\/www.altcademy.com\/blog\/tag\/career\/)\n- [How-To](https:\/\/www.altcademy.com\/blog\/tag\/how-to\/)\n- [Glossary](https:\/\/www.altcademy.com\/blog\/tag\/programming-glossary\/)\n- [Enroll Now](https:\/\/www.altcademy.com\/programs?ref=blog)\n#### [Altcademy](https:\/\/www.altcademy.com\/?ref=blog) - a  [![Forbes magazine logo](https:\/\/www.altcademy.com\/blog\/assets\/images\/forbes-logo-min.png?v=cb7a50b602) Best Coding Bootcamp 2023](https:\/\/www.forbes.com\/advisor\/education\/best-coding-bootcamps\/?award=best-coding-bootcamps-2023-altcademy)\n[How To](https:\/\/www.altcademy.com\/blog\/tag\/how-to\/)\n\n# How to drop nan values in Pandas\n![Altcademy Team](https:\/\/www.gravatar.com\/avatar\/5a8fd729d773c26ed96e4d39bdd0bbc6?s=250&r=x&d=mp)\n\n#### [Altcademy Team](https:\/\/www.altcademy.com\/blog\/author\/altcademy\/)\nJan 11, 2024\n\n4 min\n\n## Understanding NaN Values in Pandas\nWhen you're working with data in Python, using the Pandas library is like having a Swiss Army knife for data manipulation. However, sometimes your data isn't perfect. It might contain gaps or \"holes\", known as missing values. In Pandas, these missing pieces are often represented as `NaN`, which stands for \"Not a Number\". It's a special floating-point value recognized by all systems that use the standard IEEE floating-point representation.\n\nThink of `NaN` like a placeholder for something that is supposed to be a number but isn't there. Imagine you have a basket of fruits with labels on each fruit, but some labels have fallen off. Those fruits without labels could be thought of as `NaN` because, like the missing information, we know there's supposed to be something there, but it's just not.\n\n## Why Drop NaN Values?\nBefore we dive into how to drop `NaN` values, let's discuss why you might want to do this. `NaN` values can be problematic because they can distort statistical calculations and cause errors in machine learning models. It's like trying to make a fruit salad with some fruits missing; your salad won't be complete, and it won't taste as expected.\n\nSometimes, you can fill in these missing values with estimates or other data, but other times it's better to just remove them. Removing `NaN` values simplifies the dataset and can make your analysis more straightforward.\n\n## Dropping NaN Values with `dropna()`\nPandas provides a powerful method called `dropna()` to deal with missing values. This method scans through your DataFrame (a kind of data table in Pandas), finds the `NaN` values, and drops the rows or columns that contain them.\n\nHere's a basic example:\n```\nimport pandas as pd\n\n# Creating a DataFrame with NaN values\ndata = {'Name': ['Anna', 'Bob', 'Charles', None],\n        'Age': [28, None, 30, 22],\n        'Gender': ['F', 'M', None, 'M']}\ndf = pd.DataFrame(data)\n\n# Dropping rows with any NaN values\ncleaned_df = df.dropna()\nprint(cleaned_df)\n```\nThis code will output a DataFrame without any rows that had `NaN` values:\n```\n     Name   Age Gender\n0    Anna  28.0      F\n2  Charles  30.0   None\n```\nNotice that Charles's gender is still `None`. That's because `dropna()` by default drops entire rows where any `NaN` is present. If we want to be more specific, we can use parameters.\n\n## Parameters of `dropna()`\nThe `dropna()` method can be fine-tuned with parameters. Two commonly used parameters are `axis` and `how`.\n\n- `axis`: Determines whether to drop rows or columns.\n- `axis=0` or `axis='index'` (default): Drop rows with `NaN`.\n\n`axis=1` or `axis='columns'`: Drop columns with `NaN`.\n\n`how`: Determines if a row or column should be dropped when it has at least one `NaN` or only if all values are `NaN`.\n\n- `how='any'` (default): Drop if any `NaN` values are present.\n- `how='all'`: Drop if all values are `NaN`.\n\nLet's see `axis` and `how` in action:\n```\n# Dropping columns with any NaN values\ncleaned_df_columns = df.dropna(axis='columns')\nprint(cleaned_df_columns)\n\n# Dropping rows where all values are NaN\ncleaned_df_all = df.dropna(how='all')\nprint(cleaned_df_all)\n```\nThe first print statement will give you a DataFrame without the 'Age' column since it's the only one with `NaN` values. The second print statement won't change anything in our example because there's no row where all values are `NaN`.\n\n## Handling NaN Values in a Series\nA Series is like a single column in your DataFrame, a list of data with an index. Dropping `NaN` values from a Series is similar to dropping them from a DataFrame:\n```\n# Creating a Series with NaN values\nseries = pd.Series([1, 2, None, 4, None])\n\n# Dropping NaN values\ncleaned_series = series.dropna()\nprint(cleaned_series)\n```\nThis will output a Series without the `None` values:\n```\n0    1.0\n1    2.0\n3    4.0\ndtype: float64\n```\n## Filling NaN Values Instead of Dropping\nSometimes, instead of dropping `NaN` values, you might want to replace them with a specific value. This is known as imputation. Pandas provides the `fillna()` method to do this. For example, you might want to replace all `NaN` values with the average of the non-missing values:\n```\n# Replace NaN with the mean of the 'Age' column\ndf['Age'].fillna(df['Age'].mean(), inplace=True)\nprint(df)\n```\nThis will fill the `NaN` value in the 'Age' column with the average age of Anna and Charles.\n\n## A Real-World Example\nLet's consider a more realistic scenario where you have a dataset of survey responses, and not all questions were answered by every respondent. You might want to drop rows where crucial information is missing, like the respondent's age or gender, but keep rows where less important information is missing.\n```\n# A more complex DataFrame\nsurvey_data = {\n    'Age': [25, None, 37, 22],\n    'Gender': ['F', 'M', 'F', None],\n    'Income': [50000, None, 80000, 75000],\n    'Satisfaction': [4, 3, None, 5]\n}\n\nsurvey_df = pd.DataFrame(survey_data)\n\n# Dropping rows where 'Age' or 'Gender' is NaN\nimportant_info_df = survey_df.dropna(subset=['Age', 'Gender'])\nprint(important_info_df)\n```\nThis will keep rows where 'Income' or 'Satisfaction' might be `NaN`, but drop rows where 'Age' or 'Gender' is `NaN`.\n\n## Conclusion: Keeping Your Data Clean\nDropping `NaN` values in Pandas is like weeding a garden. You remove the unwanted elements to allow the rest of your data to flourish without interference. By using the `dropna()` method, you can ensure that your analyses are performed on complete cases, leading to more reliable results.\n\nRemember, though, that dropping data should not be done carelessly. Always consider the context of your data and whether dropping or imputing makes more sense for your specific situation. With the tools Pandas provides, you have the flexibility to handle missing data in a way that best suits your garden of information, helping it grow into a bountiful harvest of insights.\n\n#### Read next\n[How to style two classes in ReactJS as under each other Getting Started Welcome to another tutorial, dear reader! Today, we'll be diving into the world of ReactJS, a popular library used for building interactive user interfaces. Specifically, we're going to explore how to style two classes in ReactJS as under each other. Now, you might be wondering, \"What does it By Altcademy Team Nov 12, 2023](https:\/\/www.altcademy.com\/blog\/how-to-style-two-classes-in-reactjs-as-under-each-other\/)\n\n[How to set options as values from a json object in ReactJS Understanding JSON and its Role in ReactJS Before diving into the main topic, let's quickly understand what JSON is. JSON, an acronym for JavaScript Object Notation, is a lightweight format for storing and transferring data. It's often used when data is sent from a server to a web page. It's By Altcademy Team Nov 12, 2023](https:\/\/www.altcademy.com\/blog\/how-to-set-options-as-values-from-a-json-object-in-reactjs\/)\n\n[How to use ReactJS in atom Getting Started with ReactJS in Atom First and foremost, we need to understand what ReactJS and Atom are. ReactJS is a JavaScript library that helps us to build user interfaces (the parts of a website you interact with). Atom, on the other hand, is a text editor where we write By Altcademy Team Nov 12, 2023](https:\/\/www.altcademy.com\/blog\/how-to-use-reactjs-in-atom\/)\n\n## Learn to code in our 100% online programs\nAltcademy coding bootcamp offers **beginner-friendly, online programs** designed by **industry experts** to help you become a coder. **85%+** of [Altcademy alumni](https:\/\/www.altcademy.com\/alumni?ref=blog) are hired within 6 months after graduation. See [how we teach](https:\/\/www.altcademy.com\/how?ref=blog), or click on one of the following programs to find out more.\n\n[Most Popular Most Popular7 Courses FSWD Front-end Back-end Full-stack Web Development Learn full-stack development with HTML, CSS, JavaScript, React, Ruby and Rails, Computer science fundamentals & programming skills. **VIEW DETAILS**](https:\/\/www.altcademy.com\/programs\/fswd?ref=blog)\n\n[**Upgrade** FSWD to include Python, Data Science, AI Application, TypeScript and more.](https:\/\/www.altcademy.com\/programs\/fsdsai?ref=blog)\n\n[3 Courses FEWD HTML CSS JavaScript Front-end Web Development Learn front-end development with HTML, CSS, JavaScript, and jQuery. Computer science fundamentals & programming skills. **VIEW DETAILS**](https:\/\/www.altcademy.com\/programs\/fewd?ref=blog)\n\n[2 Courses BEWD Database API Testing Back-end Web Development Learn back-end development with Ruby and Rails, M-V-C. Computer science fundamentals with practical programming skills. **VIEW DETAILS**](https:\/\/www.altcademy.com\/programs\/bewd?ref=blog)\n\n## Join the upcoming Cohort and learn web development online\\!\n\n#### Altcademy\nOnline Coding Bootcamp - Become a professional coder\n\n[Enroll now](https:\/\/www.altcademy.com\/enroll?ref=blog)\n\n- [Back to Altcademy.com](https:\/\/www.altcademy.com\/?ref=blog)\n- [Featured](https:\/\/www.altcademy.com\/blog\/tag\/featured\/)\n- [Career](https:\/\/www.altcademy.com\/blog\/tag\/career\/)\n- [Glossary](https:\/\/www.altcademy.com\/blog\/tag\/programming-glossary\/)\n- [JavaScript](https:\/\/www.altcademy.com\/blog\/tag\/javascript\/)\n- [React](https:\/\/www.altcademy.com\/blog\/tag\/react\/)\n- [Python](https:\/\/www.altcademy.com\/blog\/tag\/python\/)\n- [TypeScript](https:\/\/www.altcademy.com\/blog\/tag\/typescript\/)\n- [Enroll in Altcademy](https:\/\/www.altcademy.com\/programs?ref=blog)\n\nAltcademy Blog © 2026. Powered by [Ghost](https:\/\/ghost.org\/)","attrs_readable_markdown":"## Understanding NaN Values in Pandas\nWhen you're working with data in Python, using the Pandas library is like having a Swiss Army knife for data manipulation. However, sometimes your data isn't perfect. It might contain gaps or \"holes\", known as missing values. In Pandas, these missing pieces are often represented as `NaN`, which stands for \"Not a Number\". It's a special floating-point value recognized by all systems that use the standard IEEE floating-point representation.\n\nThink of `NaN` like a placeholder for something that is supposed to be a number but isn't there. Imagine you have a basket of fruits with labels on each fruit, but some labels have fallen off. Those fruits without labels could be thought of as `NaN` because, like the missing information, we know there's supposed to be something there, but it's just not.\n\n## Why Drop NaN Values?\nBefore we dive into how to drop `NaN` values, let's discuss why you might want to do this. `NaN` values can be problematic because they can distort statistical calculations and cause errors in machine learning models. It's like trying to make a fruit salad with some fruits missing; your salad won't be complete, and it won't taste as expected.\n\nSometimes, you can fill in these missing values with estimates or other data, but other times it's better to just remove them. Removing `NaN` values simplifies the dataset and can make your analysis more straightforward.\n\n## Dropping NaN Values with `dropna()`\nPandas provides a powerful method called `dropna()` to deal with missing values. This method scans through your DataFrame (a kind of data table in Pandas), finds the `NaN` values, and drops the rows or columns that contain them.\n\nHere's a basic example:\n```\nimport pandas as pd\n\n# Creating a DataFrame with NaN values\ndata = {'Name': ['Anna', 'Bob', 'Charles', None],\n        'Age': [28, None, 30, 22],\n        'Gender': ['F', 'M', None, 'M']}\ndf = pd.DataFrame(data)\n\n# Dropping rows with any NaN values\ncleaned_df = df.dropna()\nprint(cleaned_df)\n```\nThis code will output a DataFrame without any rows that had `NaN` values:\n```\n     Name   Age Gender\n0    Anna  28.0      F\n2  Charles  30.0   None\n```\nNotice that Charles's gender is still `None`. That's because `dropna()` by default drops entire rows where any `NaN` is present. If we want to be more specific, we can use parameters.\n\n## Parameters of `dropna()`\nThe `dropna()` method can be fine-tuned with parameters. Two commonly used parameters are `axis` and `how`.\n\n- `axis`: Determines whether to drop rows or columns.\n- `axis=0` or `axis='index'` (default): Drop rows with `NaN`.\n\n`axis=1` or `axis='columns'`: Drop columns with `NaN`.\n\n`how`: Determines if a row or column should be dropped when it has at least one `NaN` or only if all values are `NaN`.\n\n- `how='any'` (default): Drop if any `NaN` values are present.\n- `how='all'`: Drop if all values are `NaN`.\n\nLet's see `axis` and `how` in action:\n```\n# Dropping columns with any NaN values\ncleaned_df_columns = df.dropna(axis='columns')\nprint(cleaned_df_columns)\n\n# Dropping rows where all values are NaN\ncleaned_df_all = df.dropna(how='all')\nprint(cleaned_df_all)\n```\nThe first print statement will give you a DataFrame without the 'Age' column since it's the only one with `NaN` values. The second print statement won't change anything in our example because there's no row where all values are `NaN`.\n\n## Handling NaN Values in a Series\nA Series is like a single column in your DataFrame, a list of data with an index. Dropping `NaN` values from a Series is similar to dropping them from a DataFrame:\n```\n# Creating a Series with NaN values\nseries = pd.Series([1, 2, None, 4, None])\n\n# Dropping NaN values\ncleaned_series = series.dropna()\nprint(cleaned_series)\n```\nThis will output a Series without the `None` values:\n```\n0    1.0\n1    2.0\n3    4.0\ndtype: float64\n```\n## Filling NaN Values Instead of Dropping\nSometimes, instead of dropping `NaN` values, you might want to replace them with a specific value. This is known as imputation. Pandas provides the `fillna()` method to do this. For example, you might want to replace all `NaN` values with the average of the non-missing values:\n```\n# Replace NaN with the mean of the 'Age' column\ndf['Age'].fillna(df['Age'].mean(), inplace=True)\nprint(df)\n```\nThis will fill the `NaN` value in the 'Age' column with the average age of Anna and Charles.\n\n## A Real-World Example\nLet's consider a more realistic scenario where you have a dataset of survey responses, and not all questions were answered by every respondent. You might want to drop rows where crucial information is missing, like the respondent's age or gender, but keep rows where less important information is missing.\n```\n# A more complex DataFrame\nsurvey_data = {\n    'Age': [25, None, 37, 22],\n    'Gender': ['F', 'M', 'F', None],\n    'Income': [50000, None, 80000, 75000],\n    'Satisfaction': [4, 3, None, 5]\n}\n\nsurvey_df = pd.DataFrame(survey_data)\n\n# Dropping rows where 'Age' or 'Gender' is NaN\nimportant_info_df = survey_df.dropna(subset=['Age', 'Gender'])\nprint(important_info_df)\n```\nThis will keep rows where 'Income' or 'Satisfaction' might be `NaN`, but drop rows where 'Age' or 'Gender' is `NaN`.\n\n## Conclusion: Keeping Your Data Clean\nDropping `NaN` values in Pandas is like weeding a garden. You remove the unwanted elements to allow the rest of your data to flourish without interference. By using the `dropna()` method, you can ensure that your analyses are performed on complete cases, leading to more reliable results.\n\nRemember, though, that dropping data should not be done carelessly. Always consider the context of your data and whether dropping or imputing makes more sense for your specific situation. With the tools Pandas provides, you have the flexibility to handle missing data in a way that best suits your garden of information, helping it grow into a bountiful harvest of insights.","meta_canonical":null}

3. Robots.txt Check

Query:

Response:

4. Spam/Ban Check

Query:

Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📄

INDEXABLE

✅

CRAWLED

11 days ago

🤖

ROBOTS ALLOWED

Page Info Filters

Filter	Status	Condition	Details
HTTP status	PASS	`download_http_code = 200`	HTTP 200
Age cutoff	PASS	`download_stamp > now() - 6 MONTH`	0.4 months ago
History drop	PASS	`isNull(history_drop_reason)`	No drop reason
Spam/ban	PASS	`fh_dont_index != 1 AND ml_spam_score = 0`	ml_spam_score=0
Canonical	PASS	`meta_canonical IS NULL OR = '' OR = src_unparsed`	Not set

Page Details

Property	Value
URL	https://www.altcademy.com/blog/how-to-drop-nan-values-in-pandas/
Last Crawled	2026-03-31 07:57:21 (11 days ago)
First Indexed	2024-01-12 03:27:08 (2 years ago)
HTTP Status Code	200
Meta Title	How to drop nan values in Pandas
Meta Description	null
Meta Canonical	null
Boilerpipe Text	Understanding NaN Values in Pandas When you're working with data in Python, using the Pandas library is like having a Swiss Army knife for data manipulation. However, sometimes your data isn't perfect. It might contain gaps or "holes", known as missing values. In Pandas, these missing pieces are often represented as NaN , which stands for "Not a Number". It's a special floating-point value recognized by all systems that use the standard IEEE floating-point representation. Think of NaN like a placeholder for something that is supposed to be a number but isn't there. Imagine you have a basket of fruits with labels on each fruit, but some labels have fallen off. Those fruits without labels could be thought of as NaN because, like the missing information, we know there's supposed to be something there, but it's just not. Why Drop NaN Values? Before we dive into how to drop NaN values, let's discuss why you might want to do this. NaN values can be problematic because they can distort statistical calculations and cause errors in machine learning models. It's like trying to make a fruit salad with some fruits missing; your salad won't be complete, and it won't taste as expected. Sometimes, you can fill in these missing values with estimates or other data, but other times it's better to just remove them. Removing NaN values simplifies the dataset and can make your analysis more straightforward. Dropping NaN Values with dropna() Pandas provides a powerful method called dropna() to deal with missing values. This method scans through your DataFrame (a kind of data table in Pandas), finds the NaN values, and drops the rows or columns that contain them. Here's a basic example: import pandas as pd # Creating a DataFrame with NaN values data = {'Name': ['Anna', 'Bob', 'Charles', None], 'Age': [28, None, 30, 22], 'Gender': ['F', 'M', None, 'M']} df = pd.DataFrame(data) # Dropping rows with any NaN values cleaned_df = df.dropna() print(cleaned_df) This code will output a DataFrame without any rows that had NaN values: Name Age Gender 0 Anna 28.0 F 2 Charles 30.0 None Notice that Charles's gender is still None . That's because dropna() by default drops entire rows where any NaN is present. If we want to be more specific, we can use parameters. Parameters of dropna() The dropna() method can be fine-tuned with parameters. Two commonly used parameters are axis and how . axis : Determines whether to drop rows or columns. axis=0 or axis='index' (default): Drop rows with NaN . axis=1 or axis='columns' : Drop columns with NaN . how : Determines if a row or column should be dropped when it has at least one NaN or only if all values are NaN . how='any' (default): Drop if any NaN values are present. how='all' : Drop if all values are NaN . Let's see axis and how in action: # Dropping columns with any NaN values cleaned_df_columns = df.dropna(axis='columns') print(cleaned_df_columns) # Dropping rows where all values are NaN cleaned_df_all = df.dropna(how='all') print(cleaned_df_all) The first print statement will give you a DataFrame without the 'Age' column since it's the only one with NaN values. The second print statement won't change anything in our example because there's no row where all values are NaN . Handling NaN Values in a Series A Series is like a single column in your DataFrame, a list of data with an index. Dropping NaN values from a Series is similar to dropping them from a DataFrame: # Creating a Series with NaN values series = pd.Series([1, 2, None, 4, None]) # Dropping NaN values cleaned_series = series.dropna() print(cleaned_series) This will output a Series without the None values: 0 1.0 1 2.0 3 4.0 dtype: float64 Filling NaN Values Instead of Dropping Sometimes, instead of dropping NaN values, you might want to replace them with a specific value. This is known as imputation. Pandas provides the fillna() method to do this. For example, you might want to replace all NaN values with the average of the non-missing values: # Replace NaN with the mean of the 'Age' column df['Age'].fillna(df['Age'].mean(), inplace=True) print(df) This will fill the NaN value in the 'Age' column with the average age of Anna and Charles. A Real-World Example Let's consider a more realistic scenario where you have a dataset of survey responses, and not all questions were answered by every respondent. You might want to drop rows where crucial information is missing, like the respondent's age or gender, but keep rows where less important information is missing. # A more complex DataFrame survey_data = { 'Age': [25, None, 37, 22], 'Gender': ['F', 'M', 'F', None], 'Income': [50000, None, 80000, 75000], 'Satisfaction': [4, 3, None, 5] } survey_df = pd.DataFrame(survey_data) # Dropping rows where 'Age' or 'Gender' is NaN important_info_df = survey_df.dropna(subset=['Age', 'Gender']) print(important_info_df) This will keep rows where 'Income' or 'Satisfaction' might be NaN , but drop rows where 'Age' or 'Gender' is NaN . Conclusion: Keeping Your Data Clean Dropping NaN values in Pandas is like weeding a garden. You remove the unwanted elements to allow the rest of your data to flourish without interference. By using the dropna() method, you can ensure that your analyses are performed on complete cases, leading to more reliable results. Remember, though, that dropping data should not be done carelessly. Always consider the context of your data and whether dropping or imputing makes more sense for your specific situation. With the tools Pandas provides, you have the flexibility to handle missing data in a way that best suits your garden of information, helping it grow into a bountiful harvest of insights.
Markdown	[![Altcademy Blog](https://www.altcademy.com/blog/content/images/2022/07/for-enterprise--600---72-px--4.png)](https://www.altcademy.com/?ref=blog) - [Blog Home](https://www.altcademy.com/blog/) - [Featured](https://www.altcademy.com/blog/tag/featured/) - [Career](https://www.altcademy.com/blog/tag/career/) - [How-To](https://www.altcademy.com/blog/tag/how-to/) - [Glossary](https://www.altcademy.com/blog/tag/programming-glossary/) - [Enroll Now](https://www.altcademy.com/programs?ref=blog) #### [Altcademy](https://www.altcademy.com/?ref=blog) - a [![Forbes magazine logo](https://www.altcademy.com/blog/assets/images/forbes-logo-min.png?v=cb7a50b602) Best Coding Bootcamp 2023](https://www.forbes.com/advisor/education/best-coding-bootcamps/?award=best-coding-bootcamps-2023-altcademy) [How To](https://www.altcademy.com/blog/tag/how-to/) # How to drop nan values in Pandas ![Altcademy Team](https://www.gravatar.com/avatar/5a8fd729d773c26ed96e4d39bdd0bbc6?s=250&r=x&d=mp) #### [Altcademy Team](https://www.altcademy.com/blog/author/altcademy/) Jan 11, 2024 4 min ## Understanding NaN Values in Pandas When you're working with data in Python, using the Pandas library is like having a Swiss Army knife for data manipulation. However, sometimes your data isn't perfect. It might contain gaps or "holes", known as missing values. In Pandas, these missing pieces are often represented as `NaN`, which stands for "Not a Number". It's a special floating-point value recognized by all systems that use the standard IEEE floating-point representation. Think of `NaN` like a placeholder for something that is supposed to be a number but isn't there. Imagine you have a basket of fruits with labels on each fruit, but some labels have fallen off. Those fruits without labels could be thought of as `NaN` because, like the missing information, we know there's supposed to be something there, but it's just not. ## Why Drop NaN Values? Before we dive into how to drop `NaN` values, let's discuss why you might want to do this. `NaN` values can be problematic because they can distort statistical calculations and cause errors in machine learning models. It's like trying to make a fruit salad with some fruits missing; your salad won't be complete, and it won't taste as expected. Sometimes, you can fill in these missing values with estimates or other data, but other times it's better to just remove them. Removing `NaN` values simplifies the dataset and can make your analysis more straightforward. ## Dropping NaN Values with `dropna()` Pandas provides a powerful method called `dropna()` to deal with missing values. This method scans through your DataFrame (a kind of data table in Pandas), finds the `NaN` values, and drops the rows or columns that contain them. Here's a basic example: ``` import pandas as pd # Creating a DataFrame with NaN values data = {'Name': ['Anna', 'Bob', 'Charles', None], 'Age': [28, None, 30, 22], 'Gender': ['F', 'M', None, 'M']} df = pd.DataFrame(data) # Dropping rows with any NaN values cleaned_df = df.dropna() print(cleaned_df) ``` This code will output a DataFrame without any rows that had `NaN` values: ``` Name Age Gender 0 Anna 28.0 F 2 Charles 30.0 None ``` Notice that Charles's gender is still `None`. That's because `dropna()` by default drops entire rows where any `NaN` is present. If we want to be more specific, we can use parameters. ## Parameters of `dropna()` The `dropna()` method can be fine-tuned with parameters. Two commonly used parameters are `axis` and `how`. - `axis`: Determines whether to drop rows or columns. - `axis=0` or `axis='index'` (default): Drop rows with `NaN`. `axis=1` or `axis='columns'`: Drop columns with `NaN`. `how`: Determines if a row or column should be dropped when it has at least one `NaN` or only if all values are `NaN`. - `how='any'` (default): Drop if any `NaN` values are present. - `how='all'`: Drop if all values are `NaN`. Let's see `axis` and `how` in action: ``` # Dropping columns with any NaN values cleaned_df_columns = df.dropna(axis='columns') print(cleaned_df_columns) # Dropping rows where all values are NaN cleaned_df_all = df.dropna(how='all') print(cleaned_df_all) ``` The first print statement will give you a DataFrame without the 'Age' column since it's the only one with `NaN` values. The second print statement won't change anything in our example because there's no row where all values are `NaN`. ## Handling NaN Values in a Series A Series is like a single column in your DataFrame, a list of data with an index. Dropping `NaN` values from a Series is similar to dropping them from a DataFrame: ``` # Creating a Series with NaN values series = pd.Series([1, 2, None, 4, None]) # Dropping NaN values cleaned_series = series.dropna() print(cleaned_series) ``` This will output a Series without the `None` values: ``` 0 1.0 1 2.0 3 4.0 dtype: float64 ``` ## Filling NaN Values Instead of Dropping Sometimes, instead of dropping `NaN` values, you might want to replace them with a specific value. This is known as imputation. Pandas provides the `fillna()` method to do this. For example, you might want to replace all `NaN` values with the average of the non-missing values: ``` # Replace NaN with the mean of the 'Age' column df['Age'].fillna(df['Age'].mean(), inplace=True) print(df) ``` This will fill the `NaN` value in the 'Age' column with the average age of Anna and Charles. ## A Real-World Example Let's consider a more realistic scenario where you have a dataset of survey responses, and not all questions were answered by every respondent. You might want to drop rows where crucial information is missing, like the respondent's age or gender, but keep rows where less important information is missing. ``` # A more complex DataFrame survey_data = { 'Age': [25, None, 37, 22], 'Gender': ['F', 'M', 'F', None], 'Income': [50000, None, 80000, 75000], 'Satisfaction': [4, 3, None, 5] } survey_df = pd.DataFrame(survey_data) # Dropping rows where 'Age' or 'Gender' is NaN important_info_df = survey_df.dropna(subset=['Age', 'Gender']) print(important_info_df) ``` This will keep rows where 'Income' or 'Satisfaction' might be `NaN`, but drop rows where 'Age' or 'Gender' is `NaN`. ## Conclusion: Keeping Your Data Clean Dropping `NaN` values in Pandas is like weeding a garden. You remove the unwanted elements to allow the rest of your data to flourish without interference. By using the `dropna()` method, you can ensure that your analyses are performed on complete cases, leading to more reliable results. Remember, though, that dropping data should not be done carelessly. Always consider the context of your data and whether dropping or imputing makes more sense for your specific situation. With the tools Pandas provides, you have the flexibility to handle missing data in a way that best suits your garden of information, helping it grow into a bountiful harvest of insights. #### Read next [How to style two classes in ReactJS as under each other Getting Started Welcome to another tutorial, dear reader! Today, we'll be diving into the world of ReactJS, a popular library used for building interactive user interfaces. Specifically, we're going to explore how to style two classes in ReactJS as under each other. Now, you might be wondering, "What does it By Altcademy Team Nov 12, 2023](https://www.altcademy.com/blog/how-to-style-two-classes-in-reactjs-as-under-each-other/) [How to set options as values from a json object in ReactJS Understanding JSON and its Role in ReactJS Before diving into the main topic, let's quickly understand what JSON is. JSON, an acronym for JavaScript Object Notation, is a lightweight format for storing and transferring data. It's often used when data is sent from a server to a web page. It's By Altcademy Team Nov 12, 2023](https://www.altcademy.com/blog/how-to-set-options-as-values-from-a-json-object-in-reactjs/) [How to use ReactJS in atom Getting Started with ReactJS in Atom First and foremost, we need to understand what ReactJS and Atom are. ReactJS is a JavaScript library that helps us to build user interfaces (the parts of a website you interact with). Atom, on the other hand, is a text editor where we write By Altcademy Team Nov 12, 2023](https://www.altcademy.com/blog/how-to-use-reactjs-in-atom/) ## Learn to code in our 100% online programs Altcademy coding bootcamp offers beginner-friendly, online programs designed by industry experts to help you become a coder. 85%+ of [Altcademy alumni](https://www.altcademy.com/alumni?ref=blog) are hired within 6 months after graduation. See [how we teach](https://www.altcademy.com/how?ref=blog), or click on one of the following programs to find out more. [Most Popular Most Popular7 Courses FSWD Front-end Back-end Full-stack Web Development Learn full-stack development with HTML, CSS, JavaScript, React, Ruby and Rails, Computer science fundamentals & programming skills. VIEW DETAILS](https://www.altcademy.com/programs/fswd?ref=blog) [Upgrade FSWD to include Python, Data Science, AI Application, TypeScript and more.](https://www.altcademy.com/programs/fsdsai?ref=blog) [3 Courses FEWD HTML CSS JavaScript Front-end Web Development Learn front-end development with HTML, CSS, JavaScript, and jQuery. Computer science fundamentals & programming skills. VIEW DETAILS](https://www.altcademy.com/programs/fewd?ref=blog) [2 Courses BEWD Database API Testing Back-end Web Development Learn back-end development with Ruby and Rails, M-V-C. Computer science fundamentals with practical programming skills. VIEW DETAILS](https://www.altcademy.com/programs/bewd?ref=blog) ## Join the upcoming Cohort and learn web development online\! #### Altcademy Online Coding Bootcamp - Become a professional coder [Enroll now](https://www.altcademy.com/enroll?ref=blog) - [Back to Altcademy.com](https://www.altcademy.com/?ref=blog) - [Featured](https://www.altcademy.com/blog/tag/featured/) - [Career](https://www.altcademy.com/blog/tag/career/) - [Glossary](https://www.altcademy.com/blog/tag/programming-glossary/) - [JavaScript](https://www.altcademy.com/blog/tag/javascript/) - [React](https://www.altcademy.com/blog/tag/react/) - [Python](https://www.altcademy.com/blog/tag/python/) - [TypeScript](https://www.altcademy.com/blog/tag/typescript/) - [Enroll in Altcademy](https://www.altcademy.com/programs?ref=blog) Altcademy Blog © 2026. Powered by [Ghost](https://ghost.org/)
Readable Markdown	## Understanding NaN Values in Pandas When you're working with data in Python, using the Pandas library is like having a Swiss Army knife for data manipulation. However, sometimes your data isn't perfect. It might contain gaps or "holes", known as missing values. In Pandas, these missing pieces are often represented as `NaN`, which stands for "Not a Number". It's a special floating-point value recognized by all systems that use the standard IEEE floating-point representation. Think of `NaN` like a placeholder for something that is supposed to be a number but isn't there. Imagine you have a basket of fruits with labels on each fruit, but some labels have fallen off. Those fruits without labels could be thought of as `NaN` because, like the missing information, we know there's supposed to be something there, but it's just not. ## Why Drop NaN Values? Before we dive into how to drop `NaN` values, let's discuss why you might want to do this. `NaN` values can be problematic because they can distort statistical calculations and cause errors in machine learning models. It's like trying to make a fruit salad with some fruits missing; your salad won't be complete, and it won't taste as expected. Sometimes, you can fill in these missing values with estimates or other data, but other times it's better to just remove them. Removing `NaN` values simplifies the dataset and can make your analysis more straightforward. ## Dropping NaN Values with `dropna()` Pandas provides a powerful method called `dropna()` to deal with missing values. This method scans through your DataFrame (a kind of data table in Pandas), finds the `NaN` values, and drops the rows or columns that contain them. Here's a basic example: ``` import pandas as pd # Creating a DataFrame with NaN values data = {'Name': ['Anna', 'Bob', 'Charles', None], 'Age': [28, None, 30, 22], 'Gender': ['F', 'M', None, 'M']} df = pd.DataFrame(data) # Dropping rows with any NaN values cleaned_df = df.dropna() print(cleaned_df) ``` This code will output a DataFrame without any rows that had `NaN` values: ``` Name Age Gender 0 Anna 28.0 F 2 Charles 30.0 None ``` Notice that Charles's gender is still `None`. That's because `dropna()` by default drops entire rows where any `NaN` is present. If we want to be more specific, we can use parameters. ## Parameters of `dropna()` The `dropna()` method can be fine-tuned with parameters. Two commonly used parameters are `axis` and `how`. - `axis`: Determines whether to drop rows or columns. - `axis=0` or `axis='index'` (default): Drop rows with `NaN`. `axis=1` or `axis='columns'`: Drop columns with `NaN`. `how`: Determines if a row or column should be dropped when it has at least one `NaN` or only if all values are `NaN`. - `how='any'` (default): Drop if any `NaN` values are present. - `how='all'`: Drop if all values are `NaN`. Let's see `axis` and `how` in action: ``` # Dropping columns with any NaN values cleaned_df_columns = df.dropna(axis='columns') print(cleaned_df_columns) # Dropping rows where all values are NaN cleaned_df_all = df.dropna(how='all') print(cleaned_df_all) ``` The first print statement will give you a DataFrame without the 'Age' column since it's the only one with `NaN` values. The second print statement won't change anything in our example because there's no row where all values are `NaN`. ## Handling NaN Values in a Series A Series is like a single column in your DataFrame, a list of data with an index. Dropping `NaN` values from a Series is similar to dropping them from a DataFrame: ``` # Creating a Series with NaN values series = pd.Series([1, 2, None, 4, None]) # Dropping NaN values cleaned_series = series.dropna() print(cleaned_series) ``` This will output a Series without the `None` values: ``` 0 1.0 1 2.0 3 4.0 dtype: float64 ``` ## Filling NaN Values Instead of Dropping Sometimes, instead of dropping `NaN` values, you might want to replace them with a specific value. This is known as imputation. Pandas provides the `fillna()` method to do this. For example, you might want to replace all `NaN` values with the average of the non-missing values: ``` # Replace NaN with the mean of the 'Age' column df['Age'].fillna(df['Age'].mean(), inplace=True) print(df) ``` This will fill the `NaN` value in the 'Age' column with the average age of Anna and Charles. ## A Real-World Example Let's consider a more realistic scenario where you have a dataset of survey responses, and not all questions were answered by every respondent. You might want to drop rows where crucial information is missing, like the respondent's age or gender, but keep rows where less important information is missing. ``` # A more complex DataFrame survey_data = { 'Age': [25, None, 37, 22], 'Gender': ['F', 'M', 'F', None], 'Income': [50000, None, 80000, 75000], 'Satisfaction': [4, 3, None, 5] } survey_df = pd.DataFrame(survey_data) # Dropping rows where 'Age' or 'Gender' is NaN important_info_df = survey_df.dropna(subset=['Age', 'Gender']) print(important_info_df) ``` This will keep rows where 'Income' or 'Satisfaction' might be `NaN`, but drop rows where 'Age' or 'Gender' is `NaN`. ## Conclusion: Keeping Your Data Clean Dropping `NaN` values in Pandas is like weeding a garden. You remove the unwanted elements to allow the rest of your data to flourish without interference. By using the `dropna()` method, you can ensure that your analyses are performed on complete cases, leading to more reliable results. Remember, though, that dropping data should not be done carelessly. Always consider the context of your data and whether dropping or imputing makes more sense for your specific situation. With the tools Pandas provides, you have the flexibility to handle missing data in a way that best suits your garden of information, helping it grow into a bountiful harvest of insights.
Shard	23 (laksa)
Root Hash	13523176219864139623
Unparsed URL	com,altcademy!www,/blog/how-to-drop-nan-values-in-pandas/ s443