- On data analysis
- 1st : Check whether there are duplicated data
- Standard can be various (ex. Values from all columns are same on two rows, Values from just 2 columns are same on two rows)
- 2nd : remove the duplicated (corresponding to specific standard) row
- Visualization of data using the tools of pandas
- If we wanna make more complex graph, use seaborn & matplotlib
- Use df.plot()
- basic information about dataframe
- statistical information about dataframe (describe method)
- numerical
- count
- mean
- max
- 75%
- 50%
- 25%
- min
- std
- object (non-numerical)
- preview of data (head & tail)
- the number of rows & columns
- the number of data
- count
- unique value’s count
- Statistical method
- Series with numbers
- Series with series
- Dataframe with numbers
- Dataframe with dataframe
- Choose the columns as the index of dataframe (set index)
- Initialization of index
- Rearrange index
- Sorting