Duplicated Data (pandas)

pandas

  • On data analysis
    • 1st : Check whether there are duplicated data
      • Standard can be various (ex. Values from all columns are same on two rows, Values from just 2 columns are same on two rows)
    • 2nd : remove the duplicated (corresponding to specific standard) row

Visualization (pandas)

pandas

  • Visualization of data using the tools of pandas
  • If we wanna make more complex graph, use seaborn & matplotlib
  • Use df.plot()

Looking at Data (pandas)

pandas

  • basic information about dataframe
  • statistical information about dataframe (describe method)
    • numerical
      • count
      • mean
      • max
      • 75%
      • 50%
      • 25%
      • min
      • std
    • object (non-numerical)
      • count
      • top
      • unique value
      • freq
  • preview of data (head & tail)
  • the number of rows & columns
  • the number of data
    • count
    • unique value’s count
  • Statistical method
    • mean
    • median
    • max
    • min
    • std
    • corr

Index (pandas)

pandas

  • Choose the columns as the index of dataframe (set index)
  • Initialization of index
  • Rearrange index
  • Sorting

Pagination


© 2017. All rights reserved.

Powered by Hydejack v조현진