* feat(tools): add seed/solution restore script * chore(curriculum): remove empty sections' markers * chore(curriculum): add seed + solution to Chinese * chore: remove old formatter * fix: update getChallenges parse translated challenges separately, without reference to the source * chore(curriculum): add dashedName to English * chore(curriculum): add dashedName to Chinese * refactor: remove unused challenge property 'name' * fix: relax dashedName requirement * fix: stray tag Remove stray `pre` tag from challenge file. Signed-off-by: nhcarrigan <nhcarrigan@gmail.com> Co-authored-by: nhcarrigan <nhcarrigan@gmail.com>
40 lines
1.0 KiB
Markdown
40 lines
1.0 KiB
Markdown
---
|
|
id: 5e9a093a74c4063ca6f7c15f
|
|
title: Data Cleaning Duplicates
|
|
challengeType: 11
|
|
videoId: kj7QqjXhH6A
|
|
dashedName: data-cleaning-duplicates
|
|
---
|
|
|
|
# --description--
|
|
|
|
*Instead of using notebooks.ai like it shows in the video, you can use Google Colab instead.*
|
|
|
|
More resources:
|
|
|
|
- [Notebooks on GitHub](https://github.com/ine-rmotr-curriculum/data-cleaning-rmotr-freecodecamp)
|
|
- [How to open Notebooks from GitHub using Google Colab.](https://colab.research.google.com/github/googlecolab/colabtools/blob/master/notebooks/colab-github-demo.ipynb)
|
|
|
|
# --question--
|
|
|
|
## --text--
|
|
|
|
The Python method `.duplicated()` returns a boolean Series for your DataFrame. `True` is the return value for rows that:
|
|
|
|
## --answers--
|
|
|
|
contain a duplicate, where the value for the row contains the first occurrence of that value.
|
|
|
|
---
|
|
|
|
contain a duplicate, where the value for the row is at least the second occurrence of that value.
|
|
|
|
---
|
|
|
|
contain a duplicate, where the value for the row contains either the first or second occurrence.
|
|
|
|
## --video-solution--
|
|
|
|
2
|
|
|