Datathon

The dataset contains information about US taxpayers. There are 10 independent columns and 1 dependent column. This dataset includes attributes like household income, household debt level, if the taxpayer is married or not, how many cars their household has, if they filed their taxes in the last three years or not. Some of the attribute informations are given below: To load the training data in your jupyter notebook, use the below command: import pandas as pd tax_data = pd.read_csv("https://raw.githubusercontent.com/dphi-official/Datasets/master/tax_payers/train_set_label.csv" ) Test Dataset Load the test data (name it as test_data). You can load the data using the below command. test_data = pd.read_csv('https://raw.githubusercontent.com/dphi-official/Datasets/master/tax_payers/test_set_label.csv') Here the target column is deliberately not there as you need to predict it OBJECTIVE: You are required to build a machine learning model that would predict the political party to which a taxpayer belongs to.

6/20/2022
157 views

Tags:  

#python 

#classification 

#beginner 

#datathon 

#machine-learning