week 3

 

Using Pandas to access data with Excel or CSV files. Use your approved dataset for this assignment.

Requirements: 1
Import the pandas library
Import the numpy library
Import the openpyxl library

Create a data frame and load the Excel or CSV file

Add the optional settings
pd.set_option(“display.max_columns”,None)
pd.set_option(“display.max_rows”,None)
pd.set_option(“max_colwidth”,None)
pd.set_option(‘expand_frame_repr’,False)

Requirements: 2

Print all the columns in the data frame
Print a statistic summary of the data frame
Print the first 4 records in the data frame
Print the last 7 records in the data frame

Requirements: 3

Print the index in the data frame
Print the data types in the data frame

Selecting Data with brackets []
Print any column in the data frame
Print the 3rd data element using an index for your selected column

Requirements: 4

Create a new numeric column (default: zero)

Create a temp dataset with your main dataset
Using a for loop:
Using a function (insert parameter):
Perform arithmetic with one of your numeric fields
Store the result in your new numeric column
Print the first 5 records in the data frame
Save the data into a csv file (Make sure your name is in the file title)

Requirements: 5

Create a filter using two of your columns from your dataset
Create a temp dataset with your main dataset and filter
Print the first 5 records in the data frame
Save the data into a csv file (Make sure your name is in the file title)

Requirements: 6

Create a new numeric column (default: zero)
Create another function (insert parameter) to perform arithmetic
Append a row to the main dataset; use the function to populate the numeric column created earlier
Save the data into a csv file (Make sure your name is in the file title)
Open the file and highlight the row you added

Requirements: 7

Create a new numeric column (default: negative -1)
Create a temp dataset with your main dataset
Using a for loop:
Using if statements:
Populate categorical codes in the numeric column
Print the first 5 records in the data frame
Save the data into a csv file (Make sure your name is in the file title)

Note: Please submit your py files, screenshot(s) of the output, and csv file in the same submission for the assignment. 

note: no copypaste, no plagirisum