Pandas Tutorials

Sorting in Pandas 0

Sorting in Pandas

Program 1 Pandas Dataset # Sorting import pandas as pd myfile=”D://mypandas/employee.xlsx” df=pd.read_excel(myfile) #print(df) #print(df.sort_index(ascending=True)) #print(df.sort_values([’empname’],ascending=False)) #print(df.columns) #print(df.sort_values(‘totalsalary’)) Program 2 # Duplicate import pandas as pd myfile=”D://mypandas/employee.xlsx” df=pd.read_excel(myfile) print(“Before Remove: “) print(df.duplicated()) print(df.size) print(“————————————————“) df.drop_duplicates(inplace=True)...

Slicing Pandas Dataframe 0

Slicing Pandas Dataframe

Program 1 Pandas Dataset #Slicing in DataFrame import pandas as pd myfile=”D://mypandas/employee.xlsx” df=pd.read_excel(myfile) #df[start:stop:step] #print(df[[’empname’,’totalsalary’]]) #print(df[1:11:2]) #print(df[[’empname’,’totalsalary’]][1:11:2]) #print(df.head(3)) #print(df.tail(4)) # print(df.columns) # print(df[[’empname’,’totalsalary’]])    

Properties of Dataframe in Pandas 0

Properties of Dataframe in Pandas

Program 1 # Property of DataFrame # index , columns ,size, ndim,shape,value, memory_usage(),empty import pandas as pd data=[(1,’vivek’,8000),(2,’vikas’,38000),(3,’rahul’,78000),(4,’rajesh’,55000)] df=pd.DataFrame(data) print(df) print(df.empty) print(df.memory_usage()) # print(“SIZE: “,df.size) # print(“SHAPE: “,df.shape) # print(“Dimension: “,df.ndim) # ar=df.values #...

Handling Missing Data in Pandas 0

Handling Missing Data in Pandas

Program 1 Pandas Dataset 1 Pandas Dataset 2 # Missing Values import pandas as pd myfile=”D://mypandas/employee1.xlsx” df=pd.read_excel(myfile) print(df[[’empname’,’TA’]].dropna()) # print(“Before”) # print(df) # print(“After”) # df=df.dropna() # print(df) # df.to_excel(“D://mypandas/empnomissing.xlsx”) # print(“—————-Success————“) # print(“Before”)...

Methods of Pandas 0

Methods of Pandas

Program 1 #add,sub,multiply,div,mod,max,min,prod,cumprod,sum,cumsum ,mean import pandas as pd mylist=[15,52,10,42,5] sr1=pd.Series(mylist) sr1. print(sr1.median()) #print(sr1.cummax()) # print(sr1.cummin()) # print(sr1.cumsum()) # 1 3 6 10 15 # #print(sr1.sum()) # 1+2+3+4+5 # sr2=sr1.cumprod() # print(type(sr2)) #print(sr1.prod()) # 1*2*3*4*5...

Properties of Series in Pandas 0

Properties of Series in Pandas

Program 1 # Properities of Series # index,name,size values,empty,ndim, memory_usage() # nbytes ,shape ,array ,dtype import pandas as pd mylist=[10,20,33,40,53,60,79,80,93,100] mysr=pd.Series(mylist,dtype=int) print(type(mysr)) myar=mysr.array print(type(myar)) #print(“Total Memory : “,mysr.nbytes) # print(mysr.ndim) # print(mysr.shape) # print(mysr.empty)...

Creating a Pandas Series 0

Creating a Pandas Series

Program 1 #Empty Series # Create Series using array #Series using list #Series using dictonary # Create Series using numpy array import pandas as pd import array as arr import numpy as np #Series...

Practical Implementation of Pandas DataFrame Attributes 0

Practical Implementation of Pandas DataFrame Attributes

Program 1 import pandas as pd mydata=[(101,’Vishal’,9000),(102,’Amit’,7000),(103,’Rajesh’,8000),(104,’Vinit’,8000)] df=pd.DataFrame(mydata,index=[‘a”b”c”d’],columns=[‘A’,’B’,’C’]) #print(df.size) #print(df.ndim) #print(df.shape) #print(df.values) #print(df.memory_usage())  

How to Insert, Delete, Update in Pandas DataFrames 2

How to Insert, Delete, Update in Pandas DataFrames

Program 1 import pandas as pd emp=pd.read_excel(“E:\mypandas\employee.xlsx”) #print(emp) #emp=emp._append({‘id’:36,’empname’:’Rohit Sharma’,’empdept’:’CS’,’gender’:’M’,’age’:39,’HRA’:5000,’TA’:7000,’DA’:8000,’salary’:20000},ignore_index=True) # print(emp) # print(“—————After Update———————-“) # emp.loc[emp[‘salary’]==15000,’salary’]=25000 # print(emp[emp.salary==25000]) #print(emp) emp=emp.drop(emp[emp.salary==10000].index) print(emp) # _append() # loc() # drop()