2020-04-21 12:42:19 -04:00
---
id: 5e9a093a74c4063ca6f7c15f
title: Data Cleaning Duplicates
challengeType: 11
videoId: kj7QqjXhH6A
---
## Description
2020-08-04 20:56:41 +01:00
2020-04-21 12:42:19 -04:00
<section id='description'>
2020-09-03 12:48:42 -04:00
<em>Instead of using notebooks.ai like it shows in the video, you can use Google Colab instead.</em>
2020-07-17 05:12:45 -04:00
More resources:
2020-09-03 12:48:42 -04:00
- <a href="https://github.com/ine-rmotr-curriculum/data-cleaning-rmotr-freecodecamp" target="_blank" rel="noopener noreferrer">Notebooks on GitHub</a>
- <a href="https://colab.research.google.com/github/googlecolab/colabtools/blob/master/notebooks/colab-github-demo.ipynb" target="_blank" rel="noopener noreferrer">How to open Notebooks from GitHub using Google Colab.</a>
2020-04-21 12:42:19 -04:00
</section>
## Tests
2020-08-04 20:56:41 +01:00
2020-04-21 12:42:19 -04:00
<section id='tests'>
```yml
question:
2020-05-28 22:40:36 +09:00
text: |
The Python method `.duplicated()` returns a boolean Series for your DataFrame. `True` is the return value for rows that:
2020-08-04 20:56:41 +01:00
2020-04-21 12:42:19 -04:00
answers:
2020-08-04 20:56:41 +01:00
- |
contain a duplicate, where the value for the row contains the first occurrence of that value.
- |
contain a duplicate, where the value for the row is at least the second occurrence of that value.
- |
contain a duplicate, where the value for the row contains either the first or second occurrence.
2020-05-10 00:16:11 -05:00
solution: 2
2020-04-21 12:42:19 -04:00
```
</section>