Co-authored-by: Kristofer Koishigawa <scissorsneedfoodtoo@gmail.com> Co-authored-by: Oliver Eyton-Williams <ojeytonwilliams@gmail.com>
44 lines
1.1 KiB
Markdown
44 lines
1.1 KiB
Markdown
---
|
|
id: 5e9a093a74c4063ca6f7c15f
|
|
title: Data Cleaning Duplicates
|
|
challengeType: 11
|
|
videoId: kj7QqjXhH6A
|
|
bilibiliIds:
|
|
aid: 675611672
|
|
bvid: BV1VU4y1A7tu
|
|
cid: 409019368
|
|
dashedName: data-cleaning-duplicates
|
|
---
|
|
|
|
# --description--
|
|
|
|
*Instead of using notebooks.ai like it shows in the video, you can use Google Colab instead.*
|
|
|
|
More resources:
|
|
|
|
- [Notebooks on GitHub](https://github.com/ine-rmotr-curriculum/data-cleaning-rmotr-freecodecamp)
|
|
- [How to open Notebooks from GitHub using Google Colab.](https://colab.research.google.com/github/googlecolab/colabtools/blob/master/notebooks/colab-github-demo.ipynb)
|
|
|
|
# --question--
|
|
|
|
## --text--
|
|
|
|
The Python method `.duplicated()` returns a boolean Series for your DataFrame. `True` is the return value for rows that:
|
|
|
|
## --answers--
|
|
|
|
contain a duplicate, where the value for the row contains the first occurrence of that value.
|
|
|
|
---
|
|
|
|
contain a duplicate, where the value for the row is at least the second occurrence of that value.
|
|
|
|
---
|
|
|
|
contain a duplicate, where the value for the row contains either the first or second occurrence.
|
|
|
|
## --video-solution--
|
|
|
|
2
|
|
|