Manage Data

Getting Information about your data

def get_info_data(df):
    print('\n ++++Head Dataframe:++++ \n')
    print(df.head(5))
    print('\n ++++Tail Dataframe:++++ \n')
    print(df.tail(5))
    print('\n ++++Info Dataframe:++++ \n')
    print(df.info())
    print('\n Shape Dataframe: ',df.shape)   
    print('\n ++++Describe Dataframe:++++ \n')
    print(df.describe())
    print('\n ++++IsNull in Dataframe:++++ \n')
    print(df.isnull().sum())  
    
    return 

Drop Duplicates

def drop_dupli(df, keeps='last'):
    df = df.drop_duplicates(keep=keeps)
    return df

Count the number of NaN's

āļĨāļšāļ„āđˆāļē NaN

Lower Column

āļ›āļĢāļąāļš Format

āļ—āļĻāļ™āļīāļĒāļĄ 2 āļ•āļģāđāļŦāļ™āđˆāļ‡

dataframe / Series

% Number (*100)

percent + % sign (Dataframe / Series)

Last updated

Was this helpful?