2020-04-21 12:42:19 -04:00
---
id: 5e9a093a74c4063ca6f7c15f
title: Data Cleaning Duplicates
challengeType: 11
videoId: kj7QqjXhH6A
---
## Description
2020-08-04 20:56:41 +01:00
2020-04-21 12:42:19 -04:00
< section id = 'description' >
2020-09-03 12:48:42 -04:00
< em > Instead of using notebooks.ai like it shows in the video, you can use Google Colab instead.< / em >
2020-07-17 05:12:45 -04:00
More resources:
2020-09-03 12:48:42 -04:00
- < a href = "https://github.com/ine-rmotr-curriculum/data-cleaning-rmotr-freecodecamp" target = "_blank" rel = "noopener noreferrer" > Notebooks on GitHub</ a >
- < a href = "https://colab.research.google.com/github/googlecolab/colabtools/blob/master/notebooks/colab-github-demo.ipynb" target = "_blank" rel = "noopener noreferrer" > How to open Notebooks from GitHub using Google Colab.</ a >
2020-04-21 12:42:19 -04:00
< / section >
## Tests
2020-08-04 20:56:41 +01:00
2020-04-21 12:42:19 -04:00
< section id = 'tests' >
```yml
question:
2020-05-28 22:40:36 +09:00
text: |
The Python method `.duplicated()` returns a boolean Series for your DataFrame. `True` is the return value for rows that:
2020-08-04 20:56:41 +01:00
2020-04-21 12:42:19 -04:00
answers:
2020-08-04 20:56:41 +01:00
- |
contain a duplicate, where the value for the row contains the first occurrence of that value.
- |
contain a duplicate, where the value for the row is at least the second occurrence of that value.
- |
contain a duplicate, where the value for the row contains either the first or second occurrence.
2020-05-10 00:16:11 -05:00
solution: 2
2020-04-21 12:42:19 -04:00
```
< / section >