diff --git a/guide/english/data-science-tools/pandas/index.md b/guide/english/data-science-tools/pandas/index.md index 42a5183c98..3ebbbec624 100644 --- a/guide/english/data-science-tools/pandas/index.md +++ b/guide/english/data-science-tools/pandas/index.md @@ -56,7 +56,7 @@ df.describe() ``` This will show the `count`, `mean`, `std`, `min`, `max` among others for numeric data. -## Subsetting +### Subsetting A data frame can be subset in many ways. One of the simplest is getting a single column. For instance, if the data frame `df` contains a column named `age`, we can extract it as follows: ```python ages = df["age"] @@ -74,6 +74,14 @@ Another option for subsetting a dataframe is using the loc and iloc methods. The ages = df.loc["age"] ``` +### Basic Statistics +Descriptive statistics can be performed on each column of a pandas dataframe. + +Get the mean of the values for the requested axis +```python +mean = df.mean() +``` + #### More Information: 1. [pandas](http://pandas.pydata.org/) 2. [read_csv](http://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_csv.html?highlight=read_csv#pandas.read_csv)