In my previous article I addressed some of the common queries faced by a beginner while working with various dataset. This article is the continuation of my previous article.
I’ll be continuing to demonstrate further concepts using the same dataset(UFO) as used in the first part of this article.
When working with tabular data, such as data stored in spreadsheets or databases, pandas is the right tool for you. pandas will help you to explore, clean and process your data. In pandas, a data table is called a DataFrame. This Blog will pose some frequently asked queries and try to answer these queries.
To demonstrate the use of pandas, I’ll be using the UFO Dataset. The data is stored as an excel file by the name of ufo.xlsx.
The below code demonstrates how you read a csv file into your dataframe.
import pandas as pd
''' read a dataset of Chipotle orders directly from a URL and store the results in a DataFrame'''
orders = pd.read_table('http://bit.ly/chiporders',sep='\t')
In my previous article’s I addressed some of the common queries faced by a beginner while working with various datasets. This article is the continuation of my previous articles.
I’ll be continuing to demonstrate further concepts using the same dataset(UFO) as used in the first and second part of this article.
The categorical Features needs to be changed to Numerical one’s to fit it into a any specific model. Although it is beneficial and convenient to use Label Encoder , pandas provide a method to change the Categorical Features to Numerical ones using