2020-04-21 12:42:19 -04:00
|
|
|
---
|
|
|
|
id: 5e9a093a74c4063ca6f7c164
|
|
|
|
title: Parsing HTML and Saving Data
|
|
|
|
challengeType: 11
|
2020-06-23 17:36:39 +05:30
|
|
|
isHidden: false
|
2020-04-21 12:42:19 -04:00
|
|
|
videoId: bJaqnTWQmb0
|
|
|
|
---
|
|
|
|
|
|
|
|
## Description
|
2020-08-04 20:56:41 +01:00
|
|
|
|
2020-04-21 12:42:19 -04:00
|
|
|
<section id='description'>
|
2020-07-17 05:12:45 -04:00
|
|
|
More resources:
|
|
|
|
- <a href="https://notebooks.ai/rmotr-curriculum/rdp-reading-csv-and-txt-files-fb829f46" target='_blank'>Reading CSVs Notebook</a>
|
|
|
|
- <a href="https://notebooks.ai/rmotr-curriculum/rdp-reading-data-from-relational-databases-2a3a889b" target='_blank'>Reading SQL</a>
|
|
|
|
- <a href="https://notebooks.ai/rmotr-curriculum/rdp-reading-html-tables-eb9cca73" target='_blank'>Reading HTML</a>
|
|
|
|
- <a href="https://notebooks.ai/rmotr-curriculum/rdp-reading-excel-files-a6b99973" target='_blank'>Reading Excel files</a>
|
2020-04-21 12:42:19 -04:00
|
|
|
</section>
|
|
|
|
|
|
|
|
## Tests
|
2020-08-04 20:56:41 +01:00
|
|
|
|
2020-04-21 12:42:19 -04:00
|
|
|
<section id='tests'>
|
|
|
|
|
|
|
|
```yml
|
|
|
|
question:
|
2020-08-04 20:56:41 +01:00
|
|
|
text: |
|
|
|
|
What Python library has the `.read_html()` method we can we use for parsing HTML documents and extracting tables?
|
2020-04-21 12:42:19 -04:00
|
|
|
answers:
|
2020-08-04 20:56:41 +01:00
|
|
|
- |
|
|
|
|
BeautifierSoupy
|
|
|
|
- |
|
|
|
|
WebReader
|
|
|
|
- |
|
|
|
|
HTTP-master
|
|
|
|
- |
|
|
|
|
Pandas
|
2020-05-12 04:35:00 -05:00
|
|
|
solution: 4
|
2020-04-21 12:42:19 -04:00
|
|
|
```
|
|
|
|
|
|
|
|
</section>
|